Difference between revisions of "NutchWAX"
Jump to navigation
Jump to search
(Trial import from script.) |
Prwheatley (talk | contribs) |
||
| (3 intermediate revisions by 2 users not shown) | |||
| Line 1: | Line 1: | ||
| − | {{ | + | {{Infobox tool |
|purpose=NutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search. | |purpose=NutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search. | ||
| − | |||
|homepage=http://archive-access.sourceforge.net/projects/nutchwax/ | |homepage=http://archive-access.sourceforge.net/projects/nutchwax/ | ||
| − | |license=GNU Lesser General Public License 2.1; Nutch itself is under Apache License 2.0. | + | |license=GNU Lesser General Public License 2.1; Nutch itself is under Apache License 2.0. |
|platforms=Platform-independent Java, though only tested and primarily used on Linux machines. | |platforms=Platform-independent Java, though only tested and primarily used on Linux machines. | ||
| + | |function=Web Capture | ||
| + | |content=Web | ||
| + | }} | ||
| + | {{Infobox tool details | ||
| + | |ohloh_id=NutchWAX | ||
}} | }} | ||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
= Description = | = Description = | ||
NutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search. NutchWAX is based on the open-source Web-search software, Nutch. Developed by Internet Archive. Written in Java. | NutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search. NutchWAX is based on the open-source Web-search software, Nutch. Developed by Internet Archive. Written in Java. | ||
Latest revision as of 16:06, 26 November 2021
Description
NutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search. NutchWAX is based on the open-source Web-search software, Nutch. Developed by Internet Archive. Written in Java.