Difference between revisions of "NutchWAX"
Jump to navigation
Jump to search
(Import from spreadsheet via script.) |
Ania Molenda (talk | contribs) |
||
| Line 1: | Line 1: | ||
| − | {{ | + | {{Infobox tool |
|purpose=NutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search. | |purpose=NutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search. | ||
| − | |||
|homepage=http://archive-access.sourceforge.net/projects/nutchwax/ | |homepage=http://archive-access.sourceforge.net/projects/nutchwax/ | ||
|license=GNU Lesser General Public License 2.1; Nutch itself is under Apache License 2.0. | |license=GNU Lesser General Public License 2.1; Nutch itself is under Apache License 2.0. | ||
|platforms=Platform-independent Java, though only tested and primarily used on Linux machines. | |platforms=Platform-independent Java, though only tested and primarily used on Linux machines. | ||
| + | |function=Web Crawl | ||
| + | |content=Web | ||
| + | }} | ||
| + | {{Infobox tool details | ||
| + | |ohloh_id=NutchWAX | ||
}} | }} | ||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
= Description = | = Description = | ||
NutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search. NutchWAX is based on the open-source Web-search software, Nutch. Developed by Internet Archive. Written in Java. | NutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search. NutchWAX is based on the open-source Web-search software, Nutch. Developed by Internet Archive. Written in Java. | ||
| Line 19: | Line 17: | ||
= Development Activity = | = Development Activity = | ||
| − | |||
| − | |||
| − | |||
| − | |||
Revision as of 16:23, 22 April 2021
Description
NutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search. NutchWAX is based on the open-source Web-search software, Nutch. Developed by Internet Archive. Written in Java.