Semantic search
Jump to navigation
Jump to search
| Tool | Function | Purpose |
|---|---|---|
| Storytracker | Web Capture | Tools for tracking stories on news homepages |
| Sumfolder1 | Fixity De-Duplication Appraisal | sumfolder1 is a utility for use within the archival and digital preservation community to generate checksums for file system directories, and to generate an overall "collection" checksum for a given set of files. The utility may be used in support of de-duplication at a directory/folder level. |
| TOMES (Transforming Online Mail with Embedded Semantics) | Content Profiling Data capture and Deposit File Format Migration Metadata Processing | A package of open source tools for handling the preservation of government email records |
| Tabula | Data capture and Deposit | Extract tabular data from PDF files |
| Taverna | Managing Active Research Data Workflow Workflow and Lab Notebook Management | Taverna is a scientific workflow management system designed to assemble, run, document and share sequences sequences of web services and scripts. |
| Teleport | Web Capture | Teleport is a web crawling tool that enables offline browsing |
| TeraCopy | File Copy File Management Transfer | Performs file copying, whilst also logging and verifying accuracy and completeness by using checksums |
| Tesseract-ocr | OCR | Open source OCR engine, accepting uncompressed TIFF files as input |
| The DeDuplicator (Heritrix add-on module) | De-Duplication Web Capture | The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls. |
| Tree | File Management Metadata Processing Appraisal | Tree displays the directory structure of a path or of the disk in a drive graphically. |
| TreeSize | File Management Appraisal | Manage disk space and scan your hard disks. |
| TubeKit | Web Capture | TubeKit is a toolkit for creating YouTube crawlers. |
| Tufts Submission-Agreement Builder Tool | Data capture and Deposit Planning | SABT is a web-based tool that guides records creators and records managers through the process of creating submission agreements, both for single transfers and for standing submissions. |
| UKWA GSuite Add-On | Validation Appraisal | GSuite functions for people working with web archives. The functions use the Memento API (specifically the TimeGate) to look up whether a given archive holds a given URL. It currently supports checks against:
|
| UnArchiver | Fixity Decryption Transfer | UnArchiver is a native macOS utility which supports infinitely more archive formats then other common archiving utilities. |
| VeraCrypt | Decryption Transfer | Securely encrypts large amounts of files |
| Virtual CloneDrive | Disk Imaging | Virtual CloneDrive works and behaves just like a physical CD/DVD drive, but it exists only virtually. |
| WARCreate | Data capture and Deposit Personal Archiving Web Capture | Google Chrome browser extension for creating WARC files from web pages |
| WAS (Web Archiving Service) | Web Capture | The Web Archiving Service (WAS) is a Web-based curatorial tool that enables libraries and archivists to capture, curate, analyze, and preserve Web-based government and political information. |
| WAXToolbar | Web Capture | WAXToolbar is a firefox extension to help users with common tasks encountered surfing a web archive. |
| WCT (Web Curator Tool) | Metadata Processing Web Capture | Web Curator Tool (WCT) is a workflow management application for selective web archiving. |
| WarcManager | File Management Web Capture | The WARC Manager is a web-based UI for managing and querying collections of web crawl data. |
| Warrick | Web Capture | Warrick is a free utility for reconstructing (or recovering) a website from web archives. |
| Wayback Machine | Access Discovery Web Capture | The Wayback Machine is a powerful search and discovery tool for use with collections of Web site "snapshots" collected through Web harvesting, usually with Heritrix (ARC or WARC files). |
| Web Scraper Plus+ | Web Capture | Web Scraper Plus+ takes data from the web and puts it into a spreadsheet or database. |
| WebCite | Citation and Impact Tracking Persistent Identification Web Capture | WebCite is an on-demand web archiving service that takes snapshots of Internet-accessible digital objects at the behest of users, storing the data on their own servers and assigning unique identifiers to those instances of the material. |
| WebShot | Web Capture | WebShot allows you to take screenshots of web pages and save them as full sized images or thumbnails. |
| Webkit2png | Web Capture | webkit2png is a command line tool that creates png screenshots of webpages. |
| Webrecorder | Web Capture | Webrecorder is a hosted web archiving tool with which users can capture what they see as they browse websites and save that information (locally or to a free account) |
| XXCopy | File Copy | XXCopy is an expanded version of Xcopy |
| Xcopy | File Copy | Xcopy copies files and directories, including subdirectories. |
| Xenu's Link Sleuth | Web Capture | The tool checks the hyperlinks on websites. |
| YT-DLP (You Tube Download P) | Web Capture | Supports download of youtube videos, based on the now defunct YT-DL |