Aarhus University Seal

Test Of Archiving Software - WebReaper

Back to main outline

Type

Name

Platform

Version

Price

URL

Remarks

Complete websites

WebReaper

Windows

9.8

Free

http://www.webreaper.net

aa


Conclusion

WebReaper archives websites’ source codes and other elements, as well as converting these files so that the archived elements can be used offline. Elements requiring an online connection for viewing cannot be archived with this programme. WebReaper archives rapidly but individual pages are often missing or appear defective.  A further complaint against the programme is that certain limitations of the material to be archived are not possible; among other things, external web pages (domain boundaries) can only be exempted from archiving with difficulty. In spite of this, it should be noted that the programme is capable of archiving many web pages correctly.


Recommended settings

The desired websiteís URL is entered in the field in the upper left corner of the programme. One or more filters (lower right corner) should be added, in order to limit what is to be archived. Typically it is most important to limit the number of levels of the hyperstructure to be included. In order to do this, choose Crawl depth, which is often best limited to 4-5 levels.† Next, the destination path is chosen from the programmeís preferences (accessed by activating the icon next to the red stop button on the toolbar) Next, archiving is begun by activating the green íplayí button on the toolbar.


Archiving speed

Archiving time (min)

File size (MB)

Archiving speed (MB/min)

Degree of presence required

12.2

76.7

6.3

Low


Test details


Test date and time: Friday November 5 2004, 8 a.m. – 12 a.m.

Tested by: Bo Hovgaard Thomasen

Tested by archiving: http://www.dr.dk/kroniken , http://www.dr.dk/nyheder , http://www.dr.dk/skum , http://www.dr.dk/skum/boogie, http://www.dr.dk/oline, http://tv2.dk/

Speed test carried out by archiving: http://www.dr.dk/nyheder/html/nyheder/baggrund/tema2003/krise/index.jhtml

Test results

The following have been evaluated according to the following scale for the number of archived elements: 0=none, 1=few, 2=average, 3=most, 4=all

Structure

aa

2

aa

aa

Cascading Style Sheets

3

The archived material usually appears as defined in CSS.

Page composition

2

Elements are usually correctly positioned on most of the archived web pages, but in some cases the web pageís composition is very poorly archived.

Background

3

Backgrounds are usually archived correctly

Pop-up-windows

1

Many pop-up windows are not active in the archived version, especially if they are activated from a link with JavaScript.

Archiving of all the desired web pages

2

The programme is unable to archive all the desired sub-levels correctly.

Movement between elements in the structure

Link

3

aaaa

Print/writing

Textual link

4

All textual links are archived. However, some textual links referring to JavaScript routines are not functional, which has consequences for such things as fact boxes on web pages like dr.dk/nyheder.

Pull-down menu

3

Pull-down menus are archived and often act as links in the archived version.

Formulas such as login

4

Formulas act as links but almost always refer to online elements (online elements are not archived).

Image

Animation

4

Animation (such as Macromedia Flash) acts as a link in the archived version.

Graphics

2

Graphics links are archived and are often active, but not in JavaScript links.

Photo

2

All photo links are archived and active except for JavaScript links.

Moving images

-

Not tested

Link target

3

Aaaa

Print/writing

Text

4

All text on archived pages is included in the archived material.

Image

Animation

2

Only animation not requiring an online connection is archived.

Graphics

3

Graphics are usually archived.

Photo

3

Most photos are archived.

Moving images

2

Only moving images not requiring an online connection are archived

Other

-

aa

Sound

a

2

Only sound not requiring an online connection is archived.

Automation

4

aa

aa

Automatic redirection

4

Automatic redirection is active.

Movement in elements in the structure

Automatic + inherent

3

aa

Print/writing

3

All movable text is usually archived.

Image

Animation

4

Flash- and Shockwave-elements are archived perfectly

Moving images

3

Moving images usually archived correctly

Banner ads

3

Banner ads usually archived correctly.

Sound

Background sound

3

Background sound usually archived correctly

Banner ads

4

Sound in banner ads archived correctly

Automatic + online

0

aa

Print/writing

Chat as reader

0

Elements requiring online connection cannot be archived using WebReaper

Image

Moving images

0

Elements requiring online connection cannot be archived using WebReaper

Sound

0

Elements requiring online connection cannot be archived using WebReaper

User intervention + inherent

3

aa

Print/writing

Archived chat

-

Not tested

Mouse-over

4

Mouse-over text is archived and active.

Quizzes

2

Active to some degree

Clickable maps

4

Clickable maps (such as Micromedia Flash) are archived and functional.

Image

Non-streamed image (such as slide show, clickable map)

3

Usually functional in the archived version

Games

1

Games are archived poorly, because they are usually constructed with online elements (reporting high scores to the website, etc.). However, some games are correctly archived.

Quizzes

1

Quizzes are archived poorly, because they are usually constructed with online elements (reporting high scores to the website, etc.). However, some quizzes are correctly archived.

Clickable maps (w. zoom or activation)

4

Clickable maps (Macromedia Flash) are archived and functional

Mouse-over

4

Mouse-over images are correctly archived and functional

Sound

Non-streamed sound (e.g. activated in games, quizzes, etc.)

3

Sound is archived and is usually functional in the archived version.

Mouse-over

3

Sound is archived and is usually functional in the archived version.

User intervention + online

0

aa

Print/writing

Chat (as participant)

0

Elements requiring online connection cannot be archived using WebReaper.

Polls

0

Elements requiring online connection cannot be archived using WebReaper.

Test-yourself

0

Elements requiring online connection cannot be archived using WebReaper.

Image

Streamed images

0

Elements requiring online connection cannot be archived using WebReaper.

Games

0

Elements requiring online connection cannot be archived using WebReaper.

Sound

Streaming (both archived and live)

0

Elements requiring online connection cannot be archived using WebReaper.

Non-movable elements

3

aa

Print/writing

ss

4

All print/writing is correctly archived.

Image

3

All images are correctly archived.

Sound


Back to main outline

The test was carried out by graduate student Bo Hovgaard Thomasen during the period from July- December 2004, and its premises and main results are explicated in the text Test of software and strategies for micro-archiving websites.

Note: We do not have the resources to offer technical support or other advice on the use of the tested archiving programme beyond what can be found on this web page.