Difference between revisions of "The DeDuplicator (Heritrix add-on module)"
Jump to navigation
Jump to search
Prwheatley (talk | contribs) |
Prwheatley (talk | contribs) |
||
Line 2: | Line 2: | ||
|purpose=The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls. | |purpose=The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls. | ||
|homepage=http://landsbokasafn.github.io/DeDuplicator/ | |homepage=http://landsbokasafn.github.io/DeDuplicator/ | ||
− | |function= | + | |function=De-Duplication, Web Capture |
|content=Web | |content=Web | ||
}} | }} |
Latest revision as of 16:32, 26 November 2021
Error in widget Ohloh Project: unable to write file /var/www/html/extensions/Widgets/compiled_templates/wrt673f2ded7e6607_20442300
Description[edit]
The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls.