Aarhus University Seal

Test Of Archiving Software - Webkit2png

Back to main outline

Type

Name

Platform

Version

Price

URL

Remarks

Individual web page in static form.

Webkit2png

Mac OS X

0.4

Free

http://www.paulham-
mond.org/webkit2png/

A command-line operated programme designed for screenshots of individual web pages. A similar programme, khtml2png, exists for the Linux operative system (not tested), as well as Paparazzi! for Mac OS X with a graphic interface ( see test ).

 

Conclusion

Webkit2png can make a screenshot of an individual web page. An advantage in using this programme to archive web pages is that the page visually appears 100% as experienced in the browser.  However, the programme does not archive dynamic elements. A strong point in Webkit2png is that the programme is operated from the command line, which enables automatic archiving with the aid of scripts. See an example of a simple UNIX-script for archiving 17 websites or the script photourl.sh that archives the URLs specified in a text file using the webkit2png programme

 

Recommended settings

In order to use Webkit2png with Mac OS X, a number of criteria must be fulfilled:
(1) X-window (X11) must be installed on the computer (can be done from Mac OS 10.3). X11 can be downloaded from Appleís websites at the following URL (44 MB): http://www.apple.com/macosx/features/x11/download/. A C-compiler must also be installed ñ the compiler from Appleís developer package, 'Xcode Tools', was used for this test). The programme package is included with Mac OS X after version 10.3 ( search for the 'Developer.mpkg' file or 'XcodeTools.mpkg', which will install the necessary programme packages. Note: uses 1.5 GB of hard disk space). 'Xcode Tools' can be downloaded from Apple after registering as a developer at (372 MB): http://developer.apple.com/tools/download/.

(2) PyObjC 1.1 (an Objective-C Bridge) must be installed on the computer. The package can be downloaded from (2,7 MB): http://pyobjc.sourceforge.net/software/ .

(3) The Safari browser must be installed on the computer (included with Mac OS X, or can be downloaded from http://www.apple.com/safari/ ).

To archive using Webkit2png, proceed as follows:
First start 'Terminal' (Applications/Utillities), and from the command line type 'python /usr/local/bin/webkit2png.txt' (for this test the programme was copied to the folder /usr/local/bin). Next, type the URL of the desired web page, followed by a space and any further parameters (such as: python /usr/local/bin/webkit2png.txt cfi.imv.au.dk -F -d'), then press ‘enter’. Possible parameters can be found by using the parameter '--help' (as in: 'python /usr/local/bin/webkit2png.txt --help'.''

The following parameters are recommended as a minimum for archiving:
To indicate where the archived material is to be stored, it is recommended to use the parameter '-D' followed by the desired path (e.g. 'python /usr/local/bin/webkit2png.txt cfi.imv.au.dk -DNetsteder'). The parameter -d allows the current date to be added to the archived photoís file name, and the parameter ño allows the file name to be specified.

 

Archiving speed

Archiving time (min)

File size (MB)

Archiving speed (MB/min)

Degree of presence required

Not tested

Not tested

Not tested

Not tested

 

Test details


Test date and time: Friday December 17 2004, 10 a.m. – 11,30 a.m.

Tested by: Bo Hovgaard Thomasen

Tested by archiving: http://www.dr.dk/skum/boogie, http://rundfunk.tv2.dk

Speed test carried out by archiving: http://www.dr.dk/nyheder/html/nyheder/baggrund/tema2003/krise/index.jhtml

Top

 

Test results

The following have been evaluated according to the following scale for the number of archived elements: 0=none, 1=few, 2=average, 3=most, 4=all

Structure

a

3

a

a

Cascading Style Sheets

4

The archived material follows the online web pageís CSS.

Page composition

4

All elements are correctly positioned on the archived web page.

Background

4

All backgrounds are archived

Pop-up-windows

0

The programme can only archive individual pages

Archiving of all the desired web pages

-

Not relevant for this programme

Movement between elements in the structure

Link

0

aa

Print/writing

Textual link

0

Links are not active, the archived material is static

Pull-down menu

0

Links are not active, the archived material is static

Formulas such as login

0

Links are not active, the archived material is static

Image

Animation

0

Links are not active, the archived material is static

Graphics

0

Links are not active, the archived material is static

Photo

0

Links are not active, the archived material is static

Moving images

0

Links are not active, the archived material is static

Link target

0

a

Print/writing

Text

0

Links are not active, the archived material is static

Image

Animation

0

Links are not active, the archived material is static

Graphics

0

Links are not active, the archived material is static

Photo

0

Links are not active, the archived material is static

Moving images

0

Links are not active, the archived material is static

Other

-

a

Sound

a

0

Links are not active, the archived material is static

Automation

0

a

aa

Automatic redirection

0

Links are not active, the archived material is static

Movement in elements in the structure

Automatic + inherent

0

a

Print/writing

0

Movable print/writing is archived only as snapshot

Image

Animation

0

Animation is archived only as screenshot

Moving images

0

Moving images archived only as screenshot

Banner ads

0

Banner ads archived only as screenshot

Sound

Background sound

0

Sound is not archived.

Banner ads

0

Sound is not archived.

Automatic + online

0

aa

Print/writing

Chat as reader

0

The archived material is static. Chat can only be archived as snapshots, with no movement.

Image

Moving images

0

The archived material is static

Sound

0

Sound is not archived.

User intervention + inherent

0

a

Print/writing

Archived chat

0

The archived material is static

Mouse-over

0

The archived material is static

Quizzes

0

The archived material is static

Clickable maps

0

The archived material is static

Image

Non-streamed image (such as slide show, clickable map)

0

The archived material is static

Games

0

The archived material is static

Quizzes

0

The archived material is static

Clickable maps (w. zoom or activation)

0

The archived material is static

Mouse-over

0

The archived material is static

Sound

Non-streamed sound (e.g. activated in games, quizzes, etc.)

0

Sound is not archived.

Mouse-over

0

Sound is not archived.

User intervention + online

0

a

Print/writing

Chat (as participant)

0

The archived material is static

Polls

0

The archived material is static

Test-yourself

0

The archived material is static

Image

Streamed images

0

The archived material is static

Games

0

The archived material is static

Sound

Streaming (both archived and live)

0

The archived material is static

Non-movable elements

aa

3

aa

Print/writing

ss

4

All text is archived.

Image

4

All images are correctly archived.

Sound

0

Sound is not archived.

Back to main outline

The test was carried out by graduate student Bo Hovgaard Thomasen during the period from July- December 2004, and its premises and main results are explicated in the text Test of software and strategies for micro-archiving websites.

Note: We do not have the resources to offer technical support or other advice on the use of the tested archiving programme beyond what can be found on this web page.