The DeDuplicator (Heritrix add-on module)
Revision as of 16:32, 26 November 2021 by Prwheatley (talk | contribs)
Error in widget Ohloh Project: unable to write file /var/www/html/extensions/Widgets/compiled_templates/wrt6a3f6ce4e9a4a2_47754732
Description
The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls.