Difference between revisions of "The DeDuplicator (Heritrix add-on module)"
Jump to navigation
Jump to search
(Trial import from script.) |
(Import from spreadsheet via script.) |
||
Line 1: | Line 1: | ||
{{Infobox_tool | {{Infobox_tool | ||
− | |purpose= The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls. | + | |purpose=The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls. |
|image= | |image= | ||
− | |homepage= http://deduplicator.sourceforge.net/ | + | |homepage=http://deduplicator.sourceforge.net/ |
|license= | |license= | ||
|platforms= | |platforms= |
Revision as of 21:26, 13 November 2013
Description
The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls.
User Experiences
Development Activity
Error in widget Ohloh Project: unable to write file /var/www/html/extensions/Widgets/compiled_templates/wrt673fc2aad63c35_41922212