WarcManager

From COPTR
Jump to navigation Jump to search
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.




The WARC Manager is a web-based UI for managing and querying collections of web crawl data.
Homepage:https://wiki.umiacs.umd.edu/adapt/index.php/WarcManager
Platforms:Apache Tomcat, MySQL jdbc connector, context.xml, schema.sql, and the warc webapp
Function:File Management,Web Capture
Content type:Web
Error in widget Ohloh Project: unable to write file /var/www/html/extensions/Widgets/compiled_templates/wrt6623a1ca782a28_18045844


Description

The WARC Manager is a web-based UI for managing and querying collections of web crawl data. The WARC Manager allows libraries to easily locate pages, determine the completeness of a web collection, and view crawl statistics for a page or collection. The WARC Manager has been tested on collections containing over 177 million pages covering 37 million unique URL's. Developed by University of Maryland.

User Experiences

Development Activity