Difference between revisions of "Web Archive Discovery"
Jump to navigation
Jump to search
(Added initial webarchive-discovery outline) |
(No difference)
|
Revision as of 10:22, 14 February 2014
Description
Full-text indexing system, using Apache Solr as the search back-end. Supports command-line and large-scale map-reduce (Hadoop) processing of ARC and WARC files. Also integrates file format analysis and scans for some known preservation risks.
User Experiences
- Used by the UK Web Archive to provide access to their collections. More details TBA.
Development Activity
Error in widget Ohloh Project: unable to write file /var/www/html/extensions/Widgets/compiled_templates/wrt662153e1b4d498_61738314