Semantic search
Jump to navigation
Jump to search
Tool | Function | Purpose |
---|---|---|
Teleport | Web Capture | Teleport is a web crawling tool that enables offline browsing |
TeraCopy | File Copy File Management Transfer | Performs file copying, whilst also logging and verifying accuracy and completeness by using checksums |
Tesseract-ocr | OCR | Open source OCR engine, accepting uncompressed TIFF files as input |
The DeDuplicator (Heritrix add-on module) | De-Duplication Web Capture | The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls. |
Tree | File Management Metadata Processing Appraisal | Tree displays the directory structure of a path or of the disk in a drive graphically. |
TreeSize | File Management Appraisal | Manage disk space and scan your hard disks. |
TubeKit | Web Capture | TubeKit is a toolkit for creating YouTube crawlers. |
Tufts Submission-Agreement Builder Tool | Data capture and Deposit Planning | SABT is a web-based tool that guides records creators and records managers through the process of creating submission agreements, both for single transfers and for standing submissions. |
UKWA GSuite Add-On | Validation Appraisal | GSuite functions for people working with web archives. The functions use the Memento API (specifically the TimeGate) to look up whether a given archive holds a given URL. It currently supports checks against:
|
UnArchiver | Fixity Decryption Transfer | UnArchiver is a native macOS utility which supports infinitely more archive formats then other common archiving utilities. |
VeraCrypt | Decryption Transfer | Securely encrypts large amounts of files |
Virtual CloneDrive | Disk Imaging | Virtual CloneDrive works and behaves just like a physical CD/DVD drive, but it exists only virtually. |
WARCreate | Data capture and Deposit Personal Archiving Web Capture | Google Chrome browser extension for creating WARC files from web pages |
WAS (Web Archiving Service) | Web Capture | The Web Archiving Service (WAS) is a Web-based curatorial tool that enables libraries and archivists to capture, curate, analyze, and preserve Web-based government and political information. |
WAXToolbar | Web Capture | WAXToolbar is a firefox extension to help users with common tasks encountered surfing a web archive. |
WCT (Web Curator Tool) | Metadata Processing Web Capture | Web Curator Tool (WCT) is a workflow management application for selective web archiving. |
WarcManager | File Management Web Capture | The WARC Manager is a web-based UI for managing and querying collections of web crawl data. |
Warrick | Web Capture | Warrick is a free utility for reconstructing (or recovering) a website from web archives. |
Wayback Machine | Access Discovery Web Capture | The Wayback Machine is a powerful search and discovery tool for use with collections of Web site "snapshots" collected through Web harvesting, usually with Heritrix (ARC or WARC files). |
Web Scraper Plus+ | Web Capture | Web Scraper Plus+ takes data from the web and puts it into a spreadsheet or database. |