Difference between revisions of "The DeDuplicator (Heritrix add-on module)"

From COPTR
Jump to navigation Jump to search
(Trial import from script.)
(Import from spreadsheet via script.)
Line 1: Line 1:
 
{{Infobox_tool
 
{{Infobox_tool
|purpose= The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls.
+
|purpose=The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls.
 
|image=
 
|image=
|homepage= http://deduplicator.sourceforge.net/
+
|homepage=http://deduplicator.sourceforge.net/
 
|license=
 
|license=
 
|platforms=
 
|platforms=

Revision as of 21:26, 13 November 2013

The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls.
Homepage:http://deduplicator.sourceforge.net/


Description

The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls.

User Experiences

Development Activity

Error in widget Ohloh Project: unable to write file /var/www/html/extensions/Widgets/compiled_templates/wrt673f64ad12dbd1_46880999