Difference between revisions of "WarcManager"

From COPTR
Jump to navigation Jump to search
 
Line 3: Line 3:
 
|homepage=https://wiki.umiacs.umd.edu/adapt/index.php/WarcManager
 
|homepage=https://wiki.umiacs.umd.edu/adapt/index.php/WarcManager
 
|platforms=Apache Tomcat, MySQL jdbc connector, context.xml, schema.sql, and the warc webapp
 
|platforms=Apache Tomcat, MySQL jdbc connector, context.xml, schema.sql, and the warc webapp
|function=Web Crawl, File Management
+
|function=File Management, Web Capture
 
|content=Web
 
|content=Web
 
}}
 
}}

Latest revision as of 16:57, 26 November 2021




The WARC Manager is a web-based UI for managing and querying collections of web crawl data.
Homepage:https://wiki.umiacs.umd.edu/adapt/index.php/WarcManager
Platforms:Apache Tomcat, MySQL jdbc connector, context.xml, schema.sql, and the warc webapp
Function:File Management,Web Capture
Content type:Web
Error in widget Ohloh Project: unable to write file /var/www/html/extensions/Widgets/compiled_templates/wrt67436cf0a6a284_89733355


Description[edit]

The WARC Manager is a web-based UI for managing and querying collections of web crawl data. The WARC Manager allows libraries to easily locate pages, determine the completeness of a web collection, and view crawl statistics for a page or collection. The WARC Manager has been tested on collections containing over 177 million pages covering 37 million unique URL's. Developed by University of Maryland.

User Experiences[edit]

Development Activity[edit]