Difference between revisions of "WarcManager"
Jump to navigation
Jump to search
(Trial import from script.) |
Prwheatley (talk | contribs) |
||
Line 1: | Line 1: | ||
− | {{ | + | {{Infobox tool |
|purpose=The WARC Manager is a web-based UI for managing and querying collections of web crawl data. | |purpose=The WARC Manager is a web-based UI for managing and querying collections of web crawl data. | ||
− | |||
|homepage=https://wiki.umiacs.umd.edu/adapt/index.php/WarcManager | |homepage=https://wiki.umiacs.umd.edu/adapt/index.php/WarcManager | ||
− | |||
|platforms=Apache Tomcat, MySQL jdbc connector, context.xml, schema.sql, and the warc webapp | |platforms=Apache Tomcat, MySQL jdbc connector, context.xml, schema.sql, and the warc webapp | ||
+ | |function=Web Crawl, File Management | ||
+ | |content=Web | ||
+ | }} | ||
+ | {{Infobox tool details | ||
+ | |ohloh_id=WarcManager | ||
}} | }} | ||
− | |||
<!-- Delete the Categories that do not apply --> | <!-- Delete the Categories that do not apply --> | ||
[[Category:Web Crawl]] | [[Category:Web Crawl]] | ||
Line 19: | Line 21: | ||
= Development Activity = | = Development Activity = | ||
− | |||
− | |||
− | |||
− |
Revision as of 14:13, 21 April 2021
Error in widget Ohloh Project: unable to write file /var/www/html/extensions/Widgets/compiled_templates/wrt673f8336301bc3_86410543
Description
The WARC Manager is a web-based UI for managing and querying collections of web crawl data. The WARC Manager allows libraries to easily locate pages, determine the completeness of a web collection, and view crawl statistics for a page or collection. The WARC Manager has been tested on collections containing over 177 million pages covering 37 million unique URL's. Developed by University of Maryland.