Difference between revisions of "NutchWAX"
Jump to navigation
Jump to search
(Import from spreadsheet via script.) |
Ania Molenda (talk | contribs) |
||
Line 1: | Line 1: | ||
− | {{ | + | {{Infobox tool |
|purpose=NutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search. | |purpose=NutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search. | ||
− | |||
|homepage=http://archive-access.sourceforge.net/projects/nutchwax/ | |homepage=http://archive-access.sourceforge.net/projects/nutchwax/ | ||
|license=GNU Lesser General Public License 2.1; Nutch itself is under Apache License 2.0. | |license=GNU Lesser General Public License 2.1; Nutch itself is under Apache License 2.0. | ||
|platforms=Platform-independent Java, though only tested and primarily used on Linux machines. | |platforms=Platform-independent Java, though only tested and primarily used on Linux machines. | ||
+ | |function=Web Crawl | ||
+ | |content=Web | ||
+ | }} | ||
+ | {{Infobox tool details | ||
+ | |ohloh_id=NutchWAX | ||
}} | }} | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
= Description = | = Description = | ||
NutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search. NutchWAX is based on the open-source Web-search software, Nutch. Developed by Internet Archive. Written in Java. | NutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search. NutchWAX is based on the open-source Web-search software, Nutch. Developed by Internet Archive. Written in Java. | ||
Line 19: | Line 17: | ||
= Development Activity = | = Development Activity = | ||
− | |||
− | |||
− | |||
− |
Revision as of 16:23, 22 April 2021
Error in widget Ohloh Project: unable to write file /var/www/html/extensions/Widgets/compiled_templates/wrt67436d8720e5e9_69469992
Description
NutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search. NutchWAX is based on the open-source Web-search software, Nutch. Developed by Internet Archive. Written in Java.