Property:Purpose

From COPTR
Jump to navigation Jump to search

This is a property of type Text.

Showing 355 pages using this property.
G
Google Cloud Storage allows users to store, access, and manage their data.  +
ARK identifiers minter and resolver  +
A suite of software for building and distributing digital library collections  +
JPEG 2000 SDK, includes encoder/decoder  +
Search interface for metadata extracted from forensic disk images.  +
gvfs-info - print information about files and directories  +
gzip produces files with a .gz extension. gunzip can decompress files created by gzip, compress or pack  +
H
The HP Integrated Archive Platform (HP IAP) was a solution for the long-term archival and disposition of information.  +
HTTrack is a website copying utility.  +
HTTrack2Arc is a tool that converts HTTrack crawls to ARC files, the file format used by the Internet Archive.  +
Converts video data into widely accepted formats.  +
Digital Evidence Laboratory specialists created the HashKeeper software in 1998 to expedite the analysis of electronic media by reducing the number of files to be analyzed during the course of an investigation.  +
bootable CD with Linux and forensic tools  +
Heritrix is an open-source web crawler, allowing users to target websites they wish to include in a collection and to harvest an instance of each site.  +
The Rich Media Capture module (RMC), developed in the LiWA (Living Web Archives) project, is designed to enhance the capturing capabilities of the crawler, with regards to different multimedia content types.  +
The Hex Workshop Hex Editor by BreakPoint Software is a complete set of hexadecimal development tools for Microsoft Windows 2000 and later.  +
Browser-based Online and Offline Hex Editing.  +
HoliRisk is a framework and online tool to support the development of a risk assessment based on principles from ISO31000.  +
Hoppla is an archiving solution that combines back-up and fully automated migration services for data collections in small office environments.  +
Free Hex- and Ram-Editor  +
I
I2 +
i2 is a provider of intelligence and investigation management software for law enforcement, defense, national security and private sector organizations.  +
IBM's Digital Asset Preservation Tool is a proof-of-concept demonstration of the Universal Virtual Computer solution that provides long-term access to JPEG and GIF87a files.  +
ICA-AtoM allows organisations to create standards-based descriptions of their archival holdings and subsequently publish them to the Web.  +
A set of APIs, tools and protocols for manipulating and providing interoperable access to digital images  +
IIPImage is an advanced high-performance imaging server and client for web-based streamed remote visualization of ultra resolution scientific imagery.  +
ILookPI provides a fully programmable IDE environment with customizable tool capabilities.  +
IMAGE is a DOS application capable of generating either highly compressed or "flat" images for forensic analysis.  +
iMacros makes it easy to test web-based applications.  +
IN-SPIRE Visual Document Analysis is powerful information visualization software developed by Pacific Northwest National Laboratory.  +
iRODS software was designed to allow curators utilising heterogeneous storage and computing facilities to define policies without being concerned with the technical detail of how the system implements those policies and without having to respond to changes in technical infrastructure.  +
PDF library for manipulation, content extraction and creation  +
ImageMagick® is a software suite to create, edit, compose, or convert bitmap images.  +
ImageVerifier (IV for short) traverses a hierarchy of folders looking for image files to verify. It can verify TIFFs, JPEGs. PSDs, DNGs, and non-DNG raws (e.g., NEF, CR2).  +
ImpactStory (previously Total-Impact) allows researchers and organisations to gather a wide range of impact metrics about multiple forms of scholarly output.  +
InBoxer is a next generation email archiving, IM archiving, e-discovery, and policy management system.  +
Index.dat Analyzer is a tool to view, examine and delete contents of index.dat files.  +
InfinaDyne's forensic products are focused on government and law enforcement examining various types of media and intent on collecting evidence in a thorough, secure and trustworthy manner.  +
ingestr is a command-line application that allows ingesting or copying data from any source into any destination database.  +
Invenio is a free software suite enabling you to run your own digital library or document repository on the web.  +
IrfanView is a very fast, small, compact and innovative FREEWARE (for non-commercial use) graphic viewer for Windows 9x, ME, NT, 2000, XP, 2003, 2008, Vista, Windows 7.  +
Recover data from CD, DVD, BD, HDD, Flash drive, USB stick, media card, SD and SSD.  +
J
JHOVE provides functions to perform format-specific identification, validation, and characterization of digital objects.  +
JHOVE2 allows data curators to characterise the digital objects in their repositories.  +
Pure Java implementation of a JPEG2000 decoder  +
JPC is the fast pure Java x86 PC emulator.  +
Java Web Archive Toolkit  +
Reference and bibliographic data manager  +
The PAIRTREE LIBRARY is a software library that supports the mapping between identifiers and filepaths according to the Pairtree Specification.  +
Simple JP2 file structure checker  +
JP2 validation + properties extraction  +
K
KEA is an algorithm for extracting keyphrases from text documents.  +
KEEP Emulation Framework (EF) allows users to view and interact with digital files that otherwise would require obsolete hardware and software.  +
The KOST-Simy application is used for Compare Images.  +
KOST-Val is an open source validator for different file formats and Submission Information Package (SIP).  +
Benefits Analysis Toolkit guides users through a process of identifying, assessing, and communicating the benefits from investing resources in the curation and long-term preservation of research data.  +
JPEG 2000 SDK, includes encoder/decoder  +
Karen's Directory Printer can print the name of every file on a drive, along with the file's size, date and time of last modification, and attributes (Read-Only, Hidden, System and Archive).  +
PhraseRate is a program, developed by Keith Humphreys, for extracting a set of meaningful, attractive keywords and key phrases from a web page describing the content of that page.  +
Kepler is a scientific workflow modelling and management system that enables users, regardless of programming experience, to set up data analysis pipelines.  +
KVM (for Kernel-based Virtual Machine) is a full virtualization solution for Linux on x86 hardware containing virtualization extensions (Intel VT or AMD-V).  +
khtml2png is a command line program to create screenshots of webpages.  +
The kopal Library for Retrieval and Ingest (koLibRI) represents a library of Java tools that have been developed for the interaction with the DIAS system of IBM within the kopal project.  +
Open Source turn-key OCR system forked from ocropus  +
Floppy disk controller software that accompanies a KryoFlux drive  +
L
LOCKSS software allows libraries to create preserved digital collections out of materials that would otherwise be accessible only through a licensed academic subscription.  +
LabTrove is a blogging platform specifically designed for use in a research environment.  +
Open-source JavaScript library for building interactive maps (OpenStreetMap)  +
Legacy Locker is a safe, secure repository for your vital digital property that lets you grant access to online assets for friends and loved ones in the event of loss, death, or disability.  +
Converts document formats to RTF  +
A tiered set of recommendations that help guide organizations in the planning and implementation of born-digital access provisions.  +
Libewf is a library for support of the Expert Witness Compression Format (EWF), it support both the SMART (EWF-S01) and EnCase (EWF-E01) format.  +
This library can be used to classify files according to magic number tests.  +
Media indexing multi-tool  +
The Library of Congress Newspaper Viewer is a web application used to ingest and view digitized newspaper pages meeting the National Digital Newspaper Program specification.  +
An office suite with command line options for PDF/A conversions  +
libsafe allows the organizations to create a full OAIS compliant Archive, including active and passive digital preservation workflows and is particularly suited for master image files of digitizing processes.  +
This is an implementation for libsharedmime.  +
Software for processing, enhancing and converting cultural heritage into digital cultural heritage  +
Lingfo provides a library for developers to use to extract information from Microsoft Excel spreadsheet files.  +
Linux-VServer provides virtualization for GNU/Linux systems.  +
LosslessCut is a video editing application  +
LuraDocument PDF Compressor is a document conversion engine.  +
M
Lightweight Windows Desktop application to create and check MD5 Digests for one or several files.  +
MDB Viewer opens Microsoft Access 1997-2013 databases on your Macintosh, and views and exports all tables in Access databases.  +
The METS API is a Java API designed to aid developers in the processing and assembly of METS Documents.  +
METS-based system for displaying and navigating sets of page images or other multi-part digital objects.  +
Python library for processing and outputting METS/PREMIS XML according to the Archivematica METS profile.  +
A web application for human-friendly exploration of Archivematica METS files  +
MIXED (Migration to Intermediate XML for Electronic Data) is a web service that converts tabular data files such as spreadsheets and databases to the Standard Data Format for Preservation (SDFP), a supplier-independent XML format.  +
MP3::Tag is a module for reading tags of MP3 audio files.  +
MP3val is a small, high-speed, free software tool for checking MPEG audio files' integrity.  +
mpg321 is a command-line mp3 player. mpg321 is used for frontends, as an mp3 player and as an mp3 to wave file decoder.  +
MPP Viewer is a viewer for Microsoft Project files  +
A movie player that runs on many systems, supports a wide range of formats and supports a wide range of output drivers  +
MRU-Blaster is a program made to do one large task - detect and clean MRU (most recently used) lists on your computer.  +
The MSIL Disassembler is a companion tool to the MSIL Assembler (Ilasm.  +
Extract Emails from many kinds of Mailbox formats  +
Unifies your private emails into one searchable, platform-independent repository  +
Matchbox: Duplicate detection tool for digital document collections.  +
Free Tools [See specifically Foresnic Tools]  +
md5deep is a set of programs to compute MD5, SHA-1, SHA-256, Tiger, or Whirlpool message digests on an arbitrary number of files. hashdeep is a program to compute, match, and <em>audit</em> hashsets.  +
md5sum computes a 128-bit checksum (or fingerprint or message-digest) for each specified file.  +
MD5summer is an application for Microsoft Windows 9x, NT, ME, 2000 and XP which generates and verifies md5 checksums.  +
Tool for managing and comparing digital asset metadata  +
MediaConch is a file validation software.  +
Supplies technical and tag information about a video or audio file.  +
Mendeley is a combination web service and desktop application that allows users to create, manage, and share collections of references.  +
Merritt is a cost-effective repository service from the University of California Curation Center (UC3) that lets the UC community manage, archive, and share its valuable digital content.  +
A tool for editing, cleaning, healing, inspecting, rendering, texturing, converting, and editing 3D triangular meshes.  +
Metadata Extraction Tool automatically extracts a limited set of metadata from the headers of digital files.  +
The Metadata Interrogator is a standalone, offline GUI tool for extracting and analysing metadata from a wide variety of file formats.  +
A simple tool for creating new CSV and HTML reports based on the metadata files generated by the Data Accessioner  +
Freeware tool to view, edit, modify, extract, copy metadata of various formats.  +
Web-based EXIF data viewer  +
Metaproducts offers several commercial capture and off-line browsing tools.  +
Micr’Olonys is the software solution for long-term passive digital archiving on film and paper.  +
Use the Word 2003 Redaction Add-in to hide text within Microsoft Office Word 2003 documents.  +
With this add-in you can permanently remove hidden data and collaboration data, such as change tracking and comments, from Microsoft Word, Microsoft Excel, and Microsoft PowerPoint files.  +
The Minimum Preservation Tool (MPT) can be used to create an interim preservation storage environment for files awaiting preservation in a longer term repository solution. It supports checksum generation, fixity checking, and replication across two or more storage nodes.  +
Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user access to the copy  +
Download legacy file formats from the UK Web Archive and Warclight services  +
A movie player that runs on many systems, supports a wide range of formats and supports a wide range of output drivers  +
A tool used for personal archiving of email.  +
Multivalent works on digital documents research and development.  +
myExperiment is an online social networking service aimed at scientific researchers; the site fosters collaboration by allowing members to share scientific workflows, experiment plans, and other digital objects.  +
N
NARA File Analyzer and Metadata Harvester allows a user to analyze the contents of a file system or external drive and generates statistics about the contents of the contained directories.  +
NARA complete Preservation Plan collection for all their object types, requesting your interaction.  +
NARA Video Frame Analyzer analyzes technical properties of individual frames of a video file in order to detect quality issues within digitized video files.  +
The "Levels of Digital Preservation" are a tiered set of recommendations for how organizations should begin to build or enhance their digital preservation activities.  +
Nesstar suite is an online publishing platform for organisations wishing to share datasets both internally and with the wider web.  +
A version of [http://coptr.digipres.org/Nice_Opaque_Identifiers_%28NOID%29 NOID] in Ruby  +
The NSRL provides a large data set of metadata on computer files which can be used to identify the files and their provenance  +
This is a presrvation tool  +
Tool for METS/ALTO validation and quality control  +
A friendly swarm of format-identifying robots  +
NetarchiveSuite is a web archiving software package designed to plan, schedule and run web harvests of parts of the Internet.  +
Data-driven computational pipelines  +
Identifiers management tool to generate, bind and resolve different kinds of identifiers  +
A PDF handling tool including PDF/A  +
Identifiers management tool  +
Practical digital preservation training for beginners  +
Process/module manager for Windows, with features such as Kill/Resume/Suspend thread of a process and unload DLL files  +
Platform for digitization projects management  +
NutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search.  +
O
Open Computer Forensics Architecture is a modular computer forensics framework.  +
ODF Validator is a tool that validates OpenDocument files and checks them for certain conformance criteria.  +
An RDF based list of basic RDM infrastructure components to make this infrastructure more visible and easier to identify  +
disk image file mounting  +
officerparser.py is a python script that parses the format of OLE compound documents used by Microsoft Office applications.  +
Analyses plain text files, looking for code (scripting languages etc.)  +
An open source tool for accessing and exploring web archives through emulated legacy browsers.  +
Omeka is a free open source web-publishing platform for the display of library, museum, archives, and scholarly collections and exhibitions.  +
Plugin for Omeka to assign ARK identifiers  +
Ontrack EasyRecovery software products offer home users or businesses complete solutions for their data recovery, file repair and disk diagnostic needs.  +
Ontrack Eraser software is an easy-to-use, highly flexible data erasure tool that erases all traces of data stored on a targeted media - ensuring that sensitive information does not fall into the wrong hands.  +
OpenOffice.org 3 is the leading open-source office software suite for word processing, spreadsheets, presentations, graphics, databases and more.  +
This tool is for video conversion, splitting and editing.  +
OpenDOAR is a simple, web-based tool that guides repository administrators through the process of creating basic policies for the submission, re-use, and preservation of digital materials.  +
The OpenJPEG library is an open-source JPEG 2000 codec written in C language.  +
For dealing with messy data, cleaning it and transforming it  +
OpenVZ is container-based virtualization for Linux.  +
The OpenWMS is a platform-independent, open source, web-accessible system that can be used as a standalone application or integrated with other repository architectures by a wide range of organizations.  +
The goal for this project is to provide translators to allow for interoperability between applications based on ODF (OpenDocument) 1.  +
Collates information into a CSV from log files for a batch optical media rip  +
Outside In Technology is a suite of software development kits (SDKs) that provides developers with a comprehensive solution to access, transform and control the contents of over 500 unstructured file formats.  +
P
software library that supports the mapping between identifiers and filepaths according to the Pairtree Curation Microservices Specification.  +
Tools for parsing and analysing PDF documents  +
PDF/A Manager is a PDF/A (ISO 19005) validation and conversion software.  +
PDFsam splits and merges PDF files  +
PDWIPE (Physical Drive WIPE) is a standalone DOS utility to wipe (zero) an entire physical hard drive.  +
A tool to capture contextual information in a sheer curation scenario  +
Plato is a preservation-planning tool for organisations charged with safeguarding digital materials.  +
The PREMIS Utility is a graphical program used to generate PREMIS metadata records for use in digital preservation systems and digital asset management systems in JSON and XML format, and attempts to cover gaps not programmatically generated by system logs.  +
Output DROID compatible file format signature files using PRONOM syntax  +
View, create, edit, and validate Swiss archival packages  +
pageVault supports the archiving of all unique responses generated by a web server.  +
Suite of tools for detecting changes in web pages and their rendering  +
A universal converter that converts files from one markup format into another  +
Paraben provides forensics tools.  +
"Use your CDROM drive to read audio tracks.... and have it actually work right!"  +
Passware software recovers or resets passwords for Windows, Word , Excel, QuickBooks, Access, Acrobat, and more than 180 document types.  +
pdfaPilot: Conversion of documents and emails into robust, searchable PDF or PDF/A files  +
A Go library and command line tool for PDF processing incl. validation  +
PDF manipulation tool  +
Pearl Crescent Page Saver is an extension for Mozilla Firefox that lets you capture images of web pages, including Flash content.  +
peepdf is a Python tool to explore PDF files in order to find out if the file can be harmful or not.  +
A tool that captures, stores, plays-back and provides a new URL for web citation. Built and maintained at the Harvard Law School Library.  +
PhotoRec is file data recovery software designed to recover lost files including video, documents and archives from hard disks, CD-ROMs, and lost pictures (thus the Photo Recovery name) from digital camera memory.  +
PhotoRescue is a picture and data recovery solution for digital film - sd cards, compact flash, memory sticks, microdrive, etc.  +
PREMIS in METS Toolbox was developed to support the implementation of PREMIS in the METS container format.  +
A free open-source WebGL based point cloud renderer for large point clouds.  +
PowerISO is a powerful CD/DVD image file processing tool, which allows you to open, extract, create, edit, compress, encrypt, split and convert ISO files, and mount these files with internal virtual drive.  +
A tool for generating an OAIS SIP for digital preservation. It produces METS document that contains metadata for digital preservation.  +
Preservica is a complete OAIS Digital Preservation system available on the cloud (hosted in US, EU, CA or AUS) and on-premise. It is trusted by over 1000 organisations across 5 continents to preserve collections both large (>6Pb) and small (few 100kb).  +
Prodiscover provides a set of features and toolkits for Computer Forensics and Incident Response  +
Blacklight is a free and open source ruby-on-rails based discovery interface (a.  +
Proofpoint Enterprise Archive is a SaaS email archiving solution that addresses three key challenges eDiscovery, regulatory compliance and email storage management without the headaches of managing archiving in-house.  +
Puremagic is a cross-platform pure python module that will identify a file based off it's magic numbers  +
FLAG (Forensic and Log Analysis GUI) is an advanced forensic tool for the analysis of large volumes of log files and forensic investigations.  +
A lightweight DPX file format validator.  +
Library for working with XMP metadata, as well as reading/writing XMP metadata stored in many different file formats  +
This is a Python implementation of the checkm specification.  +
Q
Digitized analog video analysis  +
QGIS is a Free and Open Source GIS application that supports a wide range of raster and vector spatial data types.  +
With QPxTool you can measure the quality of CDs and DVDs.  +
QPDF is a command-line program that does structural, content-preserving transformations on PDF files  +
View virtually all the files and e-mail attachments you need, instantly without purchasing numerous software programs.  +
R
RAID is a relational database used to record key pieces of information and to quickly identify links among people, places, businesses, financial accounts, telephone numbers, and other investigative information.  +
rARC is a distributed system that enables Internet users to provide storage space from their computers to replicate small parts of the archived data stored in the central repository of the Web archive.  +
Review, Appraisal, and Triage of Mail (RATOM) is software to assist archives and other collecting organizations with email analysis, selection, and appraisal tasks  +
The Revision Control System (RCS) manages multiple revisions of files.  +
Controlled renaming of file collections  +
RMCAS is an assessment tool for organisations wishing to map their current records management infrastructure against community best-practice.  +
Migrates databases to an XML schema, DBML. Can then provide access by dumping DBML to MySQL and showing it in phpMyAdmin.  +
The tool creates SIPs from files and folders available on the local file system.  +
RackSpace provices cloud based services to businesses of all sizes through the world.  +
The RapidRedact product range provides fast, easy to use redaction tools for irreversibly blanking out (redacting) selected information, author's changes and hidden data from all electronic document types.  +
A file audit and comparison tool using Microsoft Excel and VBA.  +
ReDBox and Mint are two complimentary applications designed to create, store, and provide access to research metadata.  +
ReNamer is a very powerful and flexible file renaming tool.  +
ReactOS is a free and open-source operating system for personal computers intended to be binary-compatible with computer programs and device drivers made for Windows Server 2003 and later versions of Windows.  +
ReaderMeter is a web-based service that compiles readership information about scientific content to create an estimate of the content's community impact.  +
Recollections is a free open source platform for generating and customizing views (interactive maps, timelines, facets, tag clouds) that allow scholars, librarians, and curators to explore digital collection.  +
Automatically generates "playable" virtual machines from source code on github  +
Recovery Is Possible (RIP) is a CD or USB boot/rescue/backup/maintenance system.  +
Provides Windows desktop and server redaction of PDF, Word, scanned TIFF images. Find, black out and remove content within documents, images or drawings.  +
Redax completely redacts (removes) text and graphics from the PDF page.  +
Regshot is an open-source (GPL) registry compare utility that allows you to quickly take a snapshot of your registry and then compare it with a second one - done after doing system changes or installing a new software product.  +
Removes empty directories  +
The ResCarta Tools software empowers users to create non-proprietary digital objects with LOC standard METS, MODS, MIX and AudioMD metadata from existing TIFF, JPEG, PDF and WAV data through user-friendly interfaces.  +
ResearchGate is an online professional network for scientists and researchers, particularly employed by those wishing to follow and track the publication outputs of others in their field.  +
Restorer Ultimate offers data recovery software.  +
RHash (Recursive Hasher) is a console utility for computing and verifying hash sums of files.  +
Riprap is a PREMIS-compliant fixity checking microservice.  +
rocfl is a command line utility for interacting with OCFL repositories on the local filesystem or in S3.  +
RODA - Repository of Authentic Digital Objects  +
Ex Libris Rosetta enables institutions to preserve and provide access to the collections in their care.  +
S
Policy-based replication and Auditing of LOCKSS networks.  +
A brief description  +
SDelete is a command line utility that takes a number of options.  +
Social Feed Manager is open source software that provides a web interface to enable users to harvest social media data and web resources from Twitter and other social media platforms.  +
SIARD Suite is a freeware tool for the conversion of contents of relations databases into the SIARD format.  +
SIARD-Val is an open source validator for SIARD files.  +
SIARDexcerpt is a Java-based application that searches and extracts individual records of SIARD files.  +
Plugin for QGIS. Fetches data from Wikidata and other Linked Data SPARQL endpoints and adds a new layer in a QGIS project. Just insert a SPARQL query for Geo-Items and get a new vector layer into QGIS.  +
The DICE Storage Resource Broker (SRB) supports shared collections that can be distributed across multiple organizations and heterogeneous storage systems.  +
Recursive piecewise hashing tool  +
SafeBack is used to create mirror-image (bit-stream) backup files of hard disks or to make a mirror-image copy of an entire hard disk drive or partition.  +
Python tool to support the overtly "safe" copying of files from one location to another. Uses fixity, and OS file system metadata.  +
low level data recovery tool  +
SalvageData Recovery software tools and products are designed to empower both IT professionals and average personal computer users with all the functionalities and features needed to successfully salvage and recover data files from any kind of logical data loss situation.  +
A digital repository software that offers flexible and rich user interfaces tailored to distinct content types  +
screen-scraper is a tool for extracting data from websites.  +
Machine learning implementation package to generate descriptive metadata for digitized historical images.  +
SheepShaver is a MacOS run-time environment for BeOS and Linux that allows you to run classic MacOS applications inside the BeOS/Linux multitasking environment.  +
An open source photo manager capable of describing image collections for archival ingest.  +
A PRONOM based, command line, file format identification tool using Aho Corasick matching and no buffer limits.  +
Exhibit lets you easily create web pages with advanced text search and filtering functionalities, with interactive maps, timelines, and other visualizations.  +
SiteStory is a transactional web archive. It archives resources of a web server it is associated with.  +
Processing of 3D model, mesh, and texture data including the option to define custom processing workflows, where a set of files is processed by multiple tools.  +
Open source 3D explorer and authoring tool suite with a 3D viewer web component to be used for dissemination of 3D data.  +
Snagit is screen capture software to create interesting training documents, collaborative design work, IT bug reports, and more.  +
SobekCM is a digital repository and digital scholarship/publishing system which enables easy deposit, preservation, and access for all types of digital content, tailored to the needs of galleries, libraries, archives, museums, scholars, and researchers.  +
Creation of METS documents from a folder of items with bibliographic metadata.  +
Spadix Software can download websites from a starting URL, search engine results or web dirs, and is able to follow external links.  +
SpinRite is a magnetic storage data recovery, repair, and maintenance utility.  +
ssconvert is a command line utility to convert spreadsheet files between various spreadsheet file formats.  +
Tools for tracking stories on news homepages  +
Open source, fast, PDF and eBook viewer  +
sumfolder1 is a utility for use within the archival and digital preservation community to generate checksums for file system directories, and to generate an overall "collection" checksum for a given set of files. The utility may be used in support of de-duplication at a directory/folder level.  +
Switch is a universal audio converter that supports a wide range of formats.  +
T
TIFF-Val is an open source validator for TIFF files.  +
A package of open source tools for handling the preservation of government email records  +
Extract tabular data from PDF files  +
The Tar program provides the ability to create tar archives, as well as various other kinds of manipulation.  +
Taverna is a scientific workflow management system designed to assemble, run, document and share sequences sequences of web services and scripts.  +
Teleport is a web crawling tool that enables offline browsing  +
Performs file copying, whilst also logging and verifying accuracy and completeness by using checksums  +
Open source OCR engine, accepting uncompressed TIFF files as input  +
TestDisk is powerful free data recovery software that was primarily designed to help recover lost partitions and/or make non-booting disks bootable again when these symptoms are caused by faulty software, certain types of viruses or human error (such as accidentally deleting a Partition Table).  +
LibCarvPath is a library for computer forensics carving tools.  +
The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls.  +
The Open Video Digital Library Toolkit project is intended to provide museums, libraries and other institutions holding moving image collections tools to more easily create Web-based digital video libraries.  +
Bulk renaming of files  +
Collection of command line computer forensics digital investigation tools.  +
The aDORe Federation is a federated repository framework and reference implementation which aims to address many of the scalability issues experienced by large scale digital object repositories.  +
TrID is a utility designed to identify file types from their binary signatures.  +
Tree displays the directory structure of a path or of the disk in a drive graphically.  +
Manage disk space and scan your hard disks.  +
TubeKit is a toolkit for creating YouTube crawlers.  +
SABT is a web-based tool that guides records creators and records managers through the process of creating submission agreements, both for single transfers and for standing submissions.  +
TweetSets provides a web interface that allows users to (1) select from existing datasets; (2) limit the dataset by querying on keywords, hashtags, and other parameters; (3) generate and download dataset derivatives such as the list of tweet ids and mention nodes/edges.  +
U
Web archives access API  +
GSuite functions for people working with web archives. The functions use the Memento API (specifically the [https://timegate.readthedocs.io/en/latest/big-picture.html?highlight=server#client-server-and-timegate TimeGate]) to look up whether a given archive holds a given URL. It currently supports checks against: *UK Web Archive *UK Government Web Archive *Internet Archive  +
This page links to information and tools from the USGS.  +
UnArchiver is a native macOS utility which supports infinitely more archive formats then other common archiving utilities.  +
a Universal vector graphics format translator  +
unrm is a small shell utility that can, under some circumstances, recover almost 99% of your erased data (similar to DOS's undelete).  +
Attempts to recover every readable piece of a file and puts the pieces together.  +
V
Cross platform audio and video player based primarily on the libavcodec.  +
VMware Player is the easiest way to run multiple operating systems at the same time on your PC.  +
vRenamer is a cross platform tool for batch renaming files  +
Securely encrypts large amounts of files  +
PDF/A validation tool  +
Online search, discovery, and display of digitized newspaper collections  +
Virtual CloneDrive works and behaves just like a physical CD/DVD drive, but it exists only virtually.  +
VirtualBox is a powerful x86 and AMD64/Intel64 virtualization product for enterprise as well as home use.  +
Vitam is an open source software able to manage and preserve digital records and archives (back-end).  +
System for indexing and retrieving multimedia data based on its content.  +
Voyeur is a web-based text analysis environment that can use texts in a variety of formats, from different locations to perform lexical analysis, export data to other tools, and embed live tools into remote websites.  +
W
This is the World Wide Web Consortium's validation tool.  +
Google Chrome browser extension for creating WARC files from web pages  +
The Web Archiving Service (WAS) is a Web-based curatorial tool that enables libraries and archivists to capture, curate, analyze, and preserve Web-based government and political information.  +
WAXToolbar is a firefox extension to help users with common tasks encountered surfing a web archive.  +
Web Curator Tool (WCT) is a workflow management application for selective web archiving.  +
WERA (Web ARchive Access) is a freely available solution for searching and navigating archived web document collections.  +
WMDecode is used for extracting files from winmail.  +
A proof-of-concept client side webapp for analyzing WARC data using Webrecorder's warcio.js. No WARC data is uploaded anywhere it runs on your machine. The idea is that it would be useful for archivists who have been given a pile of WARC data and they would like to quickly know what it contains.  +
Warc-proxy is a simple tool to view WARC content in Firefox  +
The WARC Manager is a web-based UI for managing and querying collections of web crawl data.  +
Warcit is a command-line tool that converts directories (including nested directories), files (including HTML or other web assets and data files) and ZIP files to Web Archives (WARC).  +
Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)  +
Warrick is a free utility for reconstructing (or recovering) a website from web archives.  +
The Wayback Machine is a powerful search and discovery tool for use with collections of Web site "snapshots" collected through Web harvesting, usually with Heritrix (ARC or WARC files).  +
Indexing and discovery tools for web archives.  +
A tool that replays WARC files on your local computer.  +
Web Scraper Plus+ takes data from the web and puts it into a spreadsheet or database.  +
WebCite is an on-demand web archiving service that takes snapshots of Internet-accessible digital objects at the behest of users, storing the data on their own servers and assigning unique identifiers to those instances of the material.  +
WebShot allows you to take screenshots of web pages and save them as full sized images or thumbnails.  +
webkit2png is a command line tool that creates png screenshots of webpages.  +
Webrecorder is a hosted web archiving tool with which users can capture what they see as they browse websites and save that information (locally or to a free account)  +
WinHex is in its core a universal hexadecimal editor, particularly helpful in the realm of computer forensics, data recovery, low-level data processing, and IT security.  +
A visual tool for differencing and merging of file collections, images and texts.  +
WinZip is the world's most popular Windows Zip utility for file compression, file sharing, file encryption, and data backup.  +
Windows based forensic tools  +
Windows XP Mode and Windows Virtual PC, available on Windows 7 Professional and Windows 7 Ultimate, allow you to run multiple Windows environments, such as Windows XP Mode, from your Windows 7 desktop.  +
Wine lets you run Windows software on other operating systems.  +
Fast disk space analyzer  +
WordHoard is an application for the close reading and scholarly analysis of deeply tagged texts.  +
A free hex editor / disk editor  +
X
XArch is an archive management system that allows one to create, populate, and query archives of multiple database versions.  +
A set of command line utilities (tools) to transform, query, validate, and edit XML documents and files  +
A GUI platform for video conversion and cutting.  +
The XPAT engine is an SGML/XML-aware search engine that the University of Michigan has deployed with an extremely diverse set of digital library resources.  +
XXCopy is an expanded version of Xcopy  +
Xcopy copies files and directories, including subdirectories.  +
The xcorrSound package compares sound waves using cross correlation.  +
The Xen hypervisor, the powerful open source industry standard for virtualization, offers a powerful, efficient, and secure feature set for virtualization of x86, x86_64, IA64, ARM, and other CPU architectures.  +
Detecting the file formats of digital objects; converting digital objects into open formats for preservation.  +
The tool checks the hyperlinks on websites.  +
Cross-platform batch image converter  +
Open source PDF viewer that includes PDF information extractor and font analyzer  +
Y
Supports download of youtube videos, based on the now defunct YT-DL  +
YARA is a tool that allows the identification of files that match user-defined textual or binary patterns  +
Z
ZAR is Windows data recovery software.  +