Difference between revisions of "The DeDuplicator (Heritrix add-on module)"

From COPTR
Jump to navigation Jump to search
(Trial import from script.)
 
(Trial import from script.)
Line 19: Line 19:
  
 
= Development Activity =
 
= Development Activity =
 +
 +
{{Infobox_tool_details
 +
|ohloh_id=The DeDuplicator (Heritrix add-on module)
 +
}}

Revision as of 17:41, 12 November 2013

The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls.
Homepage:http://deduplicator.sourceforge.net/


Description

The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls.

User Experiences

Development Activity