Jump to navigation Jump to search
|The DeDuplicator (Heritrix add-on module)
|The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls.
|Tree displays the directory structure of a path or of the disk in a drive graphically.
|Manage disk space and scan your hard disks.
|TubeKit is a toolkit for creating YouTube crawlers.
|Tufts Submission-Agreement Builder Tool
|Data capture and Deposit
|SABT is a web-based tool that guides records creators and records managers through the process of creating submission agreements, both for single transfers and for standing submissions.
|UKWA GSuite Add-On
GSuite functions for people working with web archives. The functions use the Memento API (specifically the TimeGate) to look up whether a given archive holds a given URL. It currently supports checks against:
|UnArchiver is a native macOS utility which supports infinitely more archive formats then other common archiving utilities.
|Securely encrypts large amounts of files
|Virtual CloneDrive works and behaves just like a physical CD/DVD drive, but it exists only virtually.
|Data capture and Deposit
|Google Chrome browser extension for creating WARC files from web pages
|WAS (Web Archiving Service)
|The Web Archiving Service (WAS) is a Web-based curatorial tool that enables libraries and archivists to capture, curate, analyze, and preserve Web-based government and political information.
|WAXToolbar is a firefox extension to help users with common tasks encountered surfing a web archive.
|WCT (Web Curator Tool)
|Web Curator Tool (WCT) is a workflow management application for selective web archiving.
|The WARC Manager is a web-based UI for managing and querying collections of web crawl data.
|Warrick is a free utility for reconstructing (or recovering) a website from web archives.
|The Wayback Machine is a powerful search and discovery tool for use with collections of Web site "snapshots" collected through Web harvesting, usually with Heritrix (ARC or WARC files).
|Web Scraper Plus+
|Web Scraper Plus+ takes data from the web and puts it into a spreadsheet or database.
|Citation and Impact Tracking
|WebCite is an on-demand web archiving service that takes snapshots of Internet-accessible digital objects at the behest of users, storing the data on their own servers and assigning unique identifiers to those instances of the material.
|WebShot allows you to take screenshots of web pages and save them as full sized images or thumbnails.
|webkit2png is a command line tool that creates png screenshots of webpages.
|Webrecorder is a hosted web archiving tool with which users can capture what they see as they browse websites and save that information (locally or to a free account)
|XXCopy is an expanded version of Xcopy
|Xcopy copies files and directories, including subdirectories.
|Xenu's Link Sleuth
|The tool checks the hyperlinks on websites.
|YT-DLP (You Tube Download P)
|Supports download of youtube videos, based on the now defunct YT-DL