The DeDuplicator (Heritrix add-on module)

From COPTR
Jump to: navigation, search




The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls.
Homepage:http://deduplicator.sourceforge.net/


[edit] Description

The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls.

[edit] User Experiences

[edit] Development Activity


Contributors

COPTR Bot (100.0%)