WarcManager

From COPTR
Revision as of 13:50, 12 November 2013 by COPTR Bot (talk | contribs) (Trial import from script.)
Jump to navigation Jump to search


The WARC Manager is a web-based UI for managing and querying collections of web crawl data.
Homepage:https://wiki.umiacs.umd.edu/adapt/index.php/WarcManager
Platforms:Apache Tomcat, MySQL jdbc connector, context.xml, schema.sql, and the warc webapp


Description

The WARC Manager is a web-based UI for managing and querying collections of web crawl data. The WARC Manager allows libraries to easily locate pages, determine the completeness of a web collection, and view crawl statistics for a page or collection. The WARC Manager has been tested on collections containing over 177 million pages covering 37 million unique URL's. Developed by University of Maryland.

User Experiences

Development Activity