Difference between revisions of "WarcManager"
Jump to navigation
Jump to search
Prwheatley (talk | contribs) |
Prwheatley (talk | contribs) |
||
| Line 3: | Line 3: | ||
|homepage=https://wiki.umiacs.umd.edu/adapt/index.php/WarcManager | |homepage=https://wiki.umiacs.umd.edu/adapt/index.php/WarcManager | ||
|platforms=Apache Tomcat, MySQL jdbc connector, context.xml, schema.sql, and the warc webapp | |platforms=Apache Tomcat, MySQL jdbc connector, context.xml, schema.sql, and the warc webapp | ||
| − | |function= | + | |function=File Management, Web Capture |
|content=Web | |content=Web | ||
}} | }} | ||
Latest revision as of 16:57, 26 November 2021
Description
The WARC Manager is a web-based UI for managing and querying collections of web crawl data. The WARC Manager allows libraries to easily locate pages, determine the completeness of a web collection, and view crawl statistics for a page or collection. The WARC Manager has been tested on collections containing over 177 million pages covering 37 million unique URL's. Developed by University of Maryland.