Web Archive Discovery
Full-text indexing system, using Apache Solr as the search back-end. Supports command-line and large-scale map-reduce (Hadoop) processing of ARC and WARC files. Also integrates file format analysis and scans for some known preservation risks.
- Used by the UK Web Archive to provide access to their collections. More details TBA.
Andy Jackson (100.0%)