Semantic search

Jump to navigation Jump to search
ToolFunctionPurpose
3-Heights(TM) PDF ValidatorValidation3-Heights(TM) PDF Validator from PDF-Tools AG.
7-ZipRendering
Transfer
Fixity
7-Zip is a file archiver with a high compression ratio, and encryption and fixity check capabilities
7trainMetadata ProcessingXSLT 2.0 tool for generating METS files from XML input
ACE (Audit Control Environment)FixityThe Auditing Control Environment is a mature set of software designed to help libraries and archives prove their holdings are intact and trustworthy.
ADC TestValidationTests and reports on audio analog-to-digital converters
ALTAG3DFile Management
Storage
Metadata Extraction
Personal Archiving
An open source archive software
ARK plugin for OmekaPersistent IdentificationGenerating and resolving ARK identifiers for resources in Omeka
AVP FixityFixityFixity monitoring for digital collections
Aaru Data Preservation SuiteBackup
Disk Imaging
Metadata Extraction
Media dump software and disc image manager
Adobe Photoshop ElementsMetadata Extraction
Metadata Processing
Personal Archiving
A commercial image editor with a metadata module (Organizer).
Apache PDFBoxMetadata Extraction
Repair
File Format Migration
Validation
Encryption Detection
JAVA PDF library for creation, manipulation, validation and content extraction of PDF documents
Apache POI - the Java API for Microsoft DocumentsFile Format Migration
Metadata Extraction
Encryption Detection
The Apache POI Project's mission is to create and maintain Java APIs for manipulating various file formats based upon the Office Open XML standards (OOXML) and Microsoft's OLE 2 Compound Document format (OLE2).
Apache TikaMetadata Extraction
File Format Identification
Text Extraction
Embedded File Extraction
Java based tool for identifying file formats using signatures and extracting metadata and text content from documents.
Archive::BagItFixity
File Copy
BagIt API for Perl
ArchivesSpaceAccess
Metadata Processing
ArchivesSpace is the next-generation web-based archives information management system, designed by archivists and supported by diverse archival repositories.
Archivists' ToolkitMetadata ProcessingAn open source archival data management system to provide broad, integrated support for the management of archives
ArchonAccess
Metadata Processing
Archon automatically publishes archival descriptive information and digital archival objects in a user-friendly website.
Ark servicePersistent IdentificationARK identifiers generator in python
AsTiffTagViewerQuality AssuranceAsTiffTagViewer is a TIFF Tag Viewer application.
AudiAnnotateAccess
Annotation
Preservation System
Workflow
Academic Social Networking
Personal Archiving
Service
Version Control
Metadata Processing
Rendering
Discovery
Persistent Identification
Managing Active Research Data
To make audio and its interpretations more discoverable and usable by extending the use of the newest IIIF (International Image Interoperability Framework) standard for audio with the development of the AudiAnnotate web application, documented workflows and workshops that will facilitate the use of existing best-of-breed, open source tools for audio annotation (Sonic Visualiser), for public code and document repositories (GitHub), and audio presentation (Universal Viewer) to produce, publish, and sustain shareable W3C Web Annotations for individual and collaborative audio projects.
BIL (BagIt Library)Fixity
File Copy
BagIt Library is a Java software library that supports the creation, manipulation and validation of bags.
BWF MetaEditMetadata Extraction
Metadata Processing
Validation
BWF MetaEdit permits embedding, validating, and exporting of metadata in Broadcast WAVE Format (BWF) files.
Bad PeggyValidation
Quality Assurance
Scans for damaged images and photos.
BagIt Transfer UtilitiesFixity
File Copy
BagIt transfer Utilities are a collection of tools developed for the purpose of validation and transfer of bags.
BaggerFixity
Transfer
GUI application to facilitate the creation and verification of BagIt bags.
BitCuratorFile Management
Fixity
Metadata Extraction
Metadata Processing
Quality Assurance
Validation
Workflow
The BitCurator Environment is an Ubuntu Linux distribution geared to the needs of archivists and librarians. It includes a suite of open source digital forensics and data analysis tools to help collecting institutions process born-digital materials.
BnL Mets ExporterMetadata ProcessingCommand Line Interface (CLI) to export METS/ALTO documents to other formats.
BrunnhildeMetadata Extraction
Content Profiling
Appraisal
Siegfried-based characterization of directories and disk images
C3POContent Profiling
Metadata Extraction
C3PO is a content profiling tool for visualization and preservation analysis
CSV ValidatorMetadata Processing
Validation
Validation of CSV files against user-defined schema
Checkit tiffQuality Assurance
Validation
a tool to validate TIFF files against given configuration profile
Checksum (by Corz)FixityFast hashing tool using a GUI interface
Cksum Unix commandFixitycksum computes a cyclic redundancy check (CRC) checksum for each given file, or standard input if none are given
ClocFile Format IdentificationCloc (Count Lines of Code) serves not only to count the lines of Code,but also guesses the programming language, thus can be used to identify files. It is a command line tool which is easy to use.
CloudCompareFile Format Migration
Multi Format Rendering
Metadata Extraction
CloudCompare is a tool for editing and processing 3D point clouds and triangular meshes.
CollectusMetadata ProcessingThe UVa Library's Collectus digital object collector tool allows users to to collect image or text objects from a repository.
ContextMinerMetadata Processing
Web Capture
ContextMiner is a framework to collect, analyze, and present the contextual information along with the data.
Crazy-fast-image-scanFile Format Identification
Content Profiling
Forensic
A script to scan media very quickly to find out what kind of content it contains
Curator's WorkbenchMetadata ProcessingCurator's Workbench is a tool that automates and streamlines the process of preparing collections of digital materials for submission to a repository
CyberChefEncryption Detection
File Management
Decryption
Metadata Extraction
Personal Archiving
Binary & Hexidecimal Editing
Discovery
A forensic tool with workflow capabilities to analyse files and containers
DART (Digital Archivist's Resource Tool)Storage
File Management
Fixity
Transfer
Provides both a GUI and a command-line interface for packaging files and uploading them to remote repositories.
DBPTK DeveloperValidation
File Format Migration
DBPTK Developer - library and command-line tool for exection of database preservation actions
DIMAGMetadata Extraction
Preservation System
Access
File Format Migration
Storage
Workflow
A software suite supporting archives with preservation of digital information for eternity
DIMAG IngestListMetadata Extraction
Transfer
Accompanies ingest process from donor to archive, logs process steps.
DNSMetadata Processing
File Format Migration
Validation
Preservation System
DNS - DA NRW Software Suite
DPF ManagerValidationA TIFF validity checker
DROID (Digital Record Object Identification)File Format Identification
Metadata Extraction
DROID (Digital Record Object Identification) is a software tool developed to perform automated batch identification of file formats.
DUMPBIN UtilityMetadata Extraction
File Format Identification
The DUMPBIN utility, which is provided with the 32-bit version of Microsoft Visual C++, combines the abilities of the LINK, LIB, and EXEHDR utilities.
DV AnalyzerQuality Assurance
Metadata Processing
DV Analyzer is a technical quality control and reporting tool that examines DV streams in order to report errors in the tape-to-file transfer process.
DVRescueQuality Assurance
Metadata Processing
DVRescue is archivist-made software that supports data migration from DV tapes into digital files suitable for long-term preservation.
DataCitePersistent Identification
Managing Active Research Data
Citation and Impact Tracking
DataCite works with data centres to assign persistent identifiers to datasets using the Digital Object Identifier (DOI) infrastructure.
DbDIPviewAccess
Quality Assurance
Redaction
Framework for packaging the database Representation Information and pre-configured user-friendly access. Different combinations of Content Data Objects are supported by an automated deployment mechanism. Enables access to the archived databases in the archive reading room for non-technical users.
DemystifyMetadata Extraction
Content Profiling
De-Duplication
Format Identification Analysis and Reporting
Dependency Discovery ToolDependency AnalysisThe Dependency Discovery Tool searches through binary office files (.doc, .xls and .ppt) and tries to find any documents or files that are linked to the document.
Developer Tools in QA: Novice's ToolkitQuality AssuranceA collaborative document which non-developers can adapt to record QA methods using built-in browser developer tools.
DiPS (Digital Preservation Solution)Access
Active Data Storage
File Format Identification
File Format Migration
File Management
Metadata Extraction
Preservation System
Secure Deletion
Service
Storage
Transfer
Validation
Workflow
DiPS (OAIS compliant Digital Preservation Solution)
Directory List & PrintMetadata Extraction
File Management
A universal metadata extractor
Directory ReportMetadata ProcessingShow disk usage, directory printer, find duplicate files, rename files, show file CRC and maintain your files - all in 1 tool
DiskFormatIDDisk Imaging
File Format Identification
Identify floppy disk formats from kryoflux stream files
DisktypeMetadata Extraction
Disk Imaging
Tool for detecting the content format of a disk or disk image. It knows about common file systems, partition tables, and boot codes.
Docuteam packerAppraisal
Data capture and Deposit
File Management
Fixity
Metadata Processing
Creates and edits SIPs
DocworksOCR
Workflow
Quality Assurance
Document digitization workflow software
Double CommanderFixity
File Copy
Batch Rename
File Management
De-Duplication
Open source file manager with two panels side by side
Duke Data AccessionerFile Copy
File Format Identification
Metadata Extraction
Transfer
Validation
Data Accessioner provides a graphical user interface to aid in migrating data from physical media to a dedicated file server, documenting the process and using MD5 checksums to identify any errors introduced in transfer.
EMET (Embedded Metadata Extraction Tool)Metadata ExtractionEMET is a stand-alone tool designed to extract metadata embedded in JPEG and TIFF files.
EPADDMetadata Processing
Metadata Extraction
Content Profiling
Access
Appraisal
ePADD is a software package developed by Stanford University's Special Collections & University Archives that supports archival processes around the appraisal, ingest, processing, discovery, and delivery of email archives.
EXE ExplorerMetadata ExtractionEXE Explorer reads and displays executable file properties and structure.
EXIF to DC XML normaliserFile Format Migration
Metadata Extraction
Extract EXIF data and normalise it to DC XML.
EZARKPersistent IdentificationARK identifiers management tool and sub-publishers registry
EZIDPersistent IdentificationEZID (easy-eye-dee) makes it easy to create and manage unique, persistent identifiers.
Easy CD-DA ExtractorFile Format Migration
Metadata Extraction
Disk Imaging
Easy CD-DA Extractor is CD Ripper, Music Converter, Audio Converter, Metadata Editor, and CD/DVD burning software.
EchoDep Hub and Spoke Framework Tool SuiteMetadata ProcessingTool suite to manage digital content in multiple repository systems.
EmbARCMetadata Processing
Quality Assurance
internal file metadata management including embedding and validation
EpubCheckValidation
Metadata Extraction
Encryption Detection
Validator for EPUB files
Exact Audio CopyFile Format Migration
Metadata Extraction
Disk Imaging
Exact Audio Copy is an audio grabber for audio CDs using standard CD and DVD-ROM drives on Windows only.
ExactFileFixityMaking sure that what you hash is what you get
ExempiMetadata Extraction
Metadata Processing
Exempi is a library for handling XMP metadata, based on the Adobe XMP SDK
ExifToolMetadata Processing
Metadata Extraction
Repair
Properties extraction, identification, metadata editing
Exiv2Metadata ProcessingExiv2 is a C++ library and a command line utility to manage image metadata.
FCIVFixity
Transfer
Generates and compares MD5 values stored in an XML file.
FFAStransMetadata Extraction
File Format Identification
File Format Migration
Quality Assurance
Workflow
Planning
Task automation engine, mostly used in audio and video visual content management.
FIDO (Format Identification for Digital Objects)Metadata Extraction
File Format Identification
A PRONOM based, command line, file format identification tool written in Python
FITS (File Information Tool Set)File Format Identification
Validation
Metadata Extraction
Encryption Detection
FITS allows data curators to identify, validate, and extract technical metadata for the objects in their digital repository.
File Analyzer and Metadata Harvester V2File Management
Fixity
Metadata Extraction
Metadata Processing
Quality Assurance
Validation
Workflow
The File Analyzer is a general purpose desktop (and command line) tool designed to automate simple, file-based operations. The File Analyzer assembles a toolkit of tasks a user can perform. The tasks that have been written into the File Analyzer code base have been optimized for use by libraries, archives, and other cultural heritage institutions.
File Format Identification PronomFile Format IdentificationPerl API to analyze and handle droid (PRONOM) signatures
FileAlyzerMetadata ExtractionFileAlyzer allows a basic analysis of files (showing file properties and file contents in hex dump form) and is able to interpret common file contents like resources structures (like text, graphics, HTML, media and PE).
FileTroveMetadata ExtractionFileTrove indexes files and creates metadata from them. The single binary application walks a directory tree and identifies all regular files by type with Siegfried.
FileVerifier++Fixity
De-Duplication
Windows utility for verifying file contents
FilestarFile Format Migration
Metadata Extraction
File Format Identification
Universal file converter for 900+ file types.
Fine Free File CommandFile Format IdentificationThis is the home page for the open source implementation of the file(1) command that ships with every free operating system (OpenBSD, Linux, NetBSD, FreeBSD, etc.
FingerdetQuality AssuranceQA tool for detecting fingers on digitised pages
FixiFixityFixi is a command-line utility that indexes, verifies, and updates checksum information for collections of files.
Fixity ProFixityFixity Pro is a desktop application for Windows and Mac that provides simple automated monitoring and reporting on the data integrity of your files that are stored on your computer, removable storage devices, and mounted network storage locations. Use Fixity Pro to schedule routine scans that will tell you if your files have been changed and if any files have been added, removed, or moved/renamed since the last scan that was performed.
FlintValidation
Encryption Detection
Validates a file against a policy, using common validation tools
FqAccess
Validation
Binary & Hexidecimal Editing
Discovery
Repair
Quality Assurance
Policy
File Format Identification
File Recovery
Forensic
Metadata Extraction
Tool, language and decoders for working with binary data.
FreeCommanderFile Management
De-Duplication
Fixity
File Copy
Split-screen file manager with desirable extras
GNU DiffutilsDe-Duplication
Quality Assurance
GNU Diffutils is a package of several programs related to finding differences between files.
GNU libextractorMetadata ExtractionGNU libextractor is a library used to extract meta data from files of arbitrary type.
GeosetterMetadata Extraction
Metadata Processing
Personal Archiving
A tool that sets coordinates and edits all kind of embedded image metadata.
GetID3()Metadata ExtractionExtracts technical and embedded descriptive metadata from common multimedia file formats.
GoobiWorkflow
OCR
Planning
Quality Assurance
Workflow Management Tool
GreensPersistent IdentificationARK identifiers minter and resolver
GreenstoneAccess
Metadata Processing
A suite of software for building and distributing digital library collections
GumshoeMetadata Processing
Forensic
Search interface for metadata extracted from forensic disk images.
Gvfs-infoFile Format Identificationgvfs-info - print information about files and directories
ICA-AtoMMetadata ProcessingICA-AtoM allows organisations to create standards-based descriptions of their archival holdings and subsequently publish them to the Web.
IMacrosQuality Assurance
Web Capture
iMacros makes it easy to test web-based applications.
ITextMetadata ExtractionPDF library for manipulation, content extraction and creation
ImageVerifierMetadata Processing
Validation
Quality Assurance
ImageVerifier (IV for short) traverses a hierarchy of folders looking for image files to verify. It can verify TIFFs, JPEGs. PSDs, DNGs, and non-DNG raws (e.g., NEF, CR2).
InBoxerMetadata Extraction
Metadata Processing
Access
InBoxer is a next generation email archiving, IM archiving, e-discovery, and policy management system.
Index.dat Analyzer v2.5Metadata Extraction
Forensic
Index.dat Analyzer is a tool to view, examine and delete contents of index.dat files.
JHOVE (Harvard Object Validation Environment)Validation
Metadata Extraction
File Format Identification
Encryption Detection
JHOVE provides functions to perform format-specific identification, validation, and characterization of digital objects.
JHOVE2Validation
Metadata Extraction
File Format Identification
Encryption Detection
JHOVE2 allows data curators to characterise the digital objects in their repositories.
JWATMetadata Extraction
Validation
File Format Migration
Java Web Archive Toolkit
JabRefMetadata Processing
Annotation
Reference and bibliographic data manager
Jp2StructCheckValidation
Quality Assurance
Metadata Extraction
Simple JP2 file structure checker
JpylyzerValidation
Quality Assurance
Metadata Extraction
JP2 validation + properties extraction
KOST-SimyValidation
Quality Assurance
The KOST-Simy application is used for Compare Images.
KOST-ValValidation
Quality Assurance
KOST-Val is an open source validator for different file formats and Submission Information Package (SIP).
Karen's Directory PrinterMetadata ProcessingKaren's Directory Printer can print the name of every file on a drive, along with the file's size, date and time of last modification, and attributes (Read-Only, Hidden, System and Archive).
Keith Humphreys' PhraseRateMetadata ExtractionPhraseRate is a program, developed by Keith Humphreys, for extracting a set of meaningful, attractive keywords and key phrases from a web page describing the content of that page.
KoLibRI (Kopal Library for Retrieval and Ingest)Metadata Processing
Pre-ingest SIP Builder
The kopal Library for Retrieval and Ingest (koLibRI) represents a library of Java tools that have been developed for the interaction with the DIAS system of IBM within the kopal project.
Libmagic-devFile Format IdentificationThis library can be used to classify files according to magic number tests.
Library (xklb)File Management
Quality Assurance
Web Capture
Media indexing multi-tool
LibsharedmimeFile Format IdentificationThis is an implementation for libsharedmime.
Limb ProcessingMetadata Processing
OCR
Software for processing, enhancing and converting cultural heritage into digital cultural heritage
LingfoFile Format Migration
Metadata Extraction
Lingfo provides a library for developers to use to extract information from Microsoft Excel spreadsheet files.
MD5CheckerFixityLightweight Windows Desktop application to create and check MD5 Digests for one or several files.
METS APIMetadata ProcessingThe METS API is a Java API designed to aid developers in the processing and assembly of METS Documents.
METS NavigatorRendering
Metadata Processing
METS-based system for displaying and navigating sets of page images or other multi-part digital objects.
METS Reader WriterMetadata Extraction
Metadata Processing
Python library for processing and outputting METS/PREMIS XML according to the Archivematica METS profile.
MP3::TagMetadata ExtractionMP3::Tag is a module for reading tags of MP3 audio files.
MP3valValidation
Quality Assurance
MP3val is a small, high-speed, free software tool for checking MPEG audio files' integrity.
MailStore HomeMetadata Extraction
Metadata Processing
File Management
Discovery
Unifies your private emails into one searchable, platform-independent repository
Matchbox ToolQuality Assurance
De-Duplication
Matchbox: Duplicate detection tool for digital document collections.
Md5deep and hashdeepFixitymd5deep is a set of programs to compute MD5, SHA-1, SHA-256, Tiger, or Whirlpool message digests on an arbitrary number of files. hashdeep is a program to compute, match, and audit hashsets.
Md5sum Unix commandFixitymd5sum computes a 128-bit checksum (or fingerprint or message-digest) for each specified file.
Md5summerFixityMD5summer is an application for Microsoft Windows 9x, NT, ME, 2000 and XP which generates and verifies md5 checksums.
MdqcMetadata Extraction
Metadata Processing
Quality Assurance
Tool for managing and comparing digital asset metadata
MediaConchFile Format Identification
Policy
Validation
MediaConch is a file validation software.
MediaInfoMetadata ExtractionSupplies technical and tag information about a video or audio file.
Metadata Extraction ToolMetadata ExtractionMetadata Extraction Tool automatically extracts a limited set of metadata from the headers of digital files.
Metadata InterrogatorMetadata ExtractionThe Metadata Interrogator is a standalone, offline GUI tool for extracting and analysing metadata from a wide variety of file formats.
Metadata transformerMetadata ExtractionA simple tool for creating new CSV and HTML reports based on the metadata files generated by the Data Accessioner
Metadata++Metadata Extraction
Metadata Processing
Personal Archiving
Freeware tool to view, edit, modify, extract, copy metadata of various formats.
Metadata2GoMetadata ExtractionWeb-based EXIF data viewer
Minimum Preservation ToolFixity
Preservation System
The Minimum Preservation Tool (MPT) can be used to create an interim preservation storage environment for files awaiting preservation in a longer term repository solution. It supports checksum generation, fixity checking, and replication across two or more storage nodes.
NARA File Analyzer and Metadata HarvesterFixity
Metadata Extraction
File Format Identification
NARA File Analyzer and Metadata Harvester allows a user to analyze the contents of a file system or external drive and generates statistics about the contents of the contained directories.
NARA Video Frame AnalyzerMetadata Extraction
Quality Assurance
NARA Video Frame Analyzer analyzes technical properties of individual frames of a video file in order to detect quality issues within digitized video files.
NESSTARMetadata Processing
Service
Nesstar suite is an online publishing platform for organisations wishing to share datasets both internally and with the wider web.
NOID (Ruby)Persistent IdentificationA version of NOID in Ruby
NamalysatorMetadata Processing
Quality Assurance
Validation
Tool for METS/ALTO validation and quality control
NaniteFile Format Identification
Metadata Extraction
A friendly swarm of format-identifying robots
Nice Opaque Identifiers (NOID)Persistent IdentificationIdentifiers management tool to generate, bind and resolve different kinds of identifiers
NoidsPersistent IdentificationIdentifiers management tool
Nuclear ProcessorDependency AnalysisProcess/module manager for Windows, with features such as Kill/Resume/Suspend thread of a process and unload DLL files
NumaHOPQuality Assurance
OCR
Platform for digitization projects management
ODF ValidatorValidation
Metadata Extraction
ODF Validator is a tool that validates OpenDocument files and checks them for certain conformance criteria.
Officeparser.pyMetadata Extraction
File Format Identification
officerparser.py is a python script that parses the format of OLE compound documents used by Microsoft Office applications.
OhcountFile Format IdentificationAnalyses plain text files, looking for code (scripting languages etc.)
Omeka Identity pluginPersistent IdentificationPlugin for Omeka to assign ARK identifiers
OpenJPEGFile Format Migration
Metadata Extraction
The OpenJPEG library is an open-source JPEG 2000 codec written in C language.
OpenRefineMetadata ProcessingFor dealing with messy data, cleaning it and transforming it
OpenWMS (Workflow Management System for Digital Objects)Metadata ProcessingThe OpenWMS is a platform-independent, open source, web-accessible system that can be used as a standalone application or integrated with other repository architectures by a wide range of organizations.
PAIRTREE LibraryMetadata Processingsoftware library that supports the mapping between identifiers and filepaths according to the Pairtree Curation Microservices Specification.
PDF Tools (by Didier Stevens)Metadata Extraction
Dependency Analysis
Validation
Tools for parsing and analysing PDF documents
PDFTron PDF-A ManagerValidation
File Format Migration
PDF/A Manager is a PDF/A (ISO 19005) validation and conversion software.
PET (PERICLES Extraction Tool)Metadata Extraction
Dependency Analysis
A tool to capture contextual information in a sheer curation scenario
PREMIS UtilityMetadata ProcessingThe PREMIS Utility is a graphical program used to generate PREMIS metadata records for use in digital preservation systems and digital asset management systems in JSON and XML format, and attempts to cover gaps not programmatically generated by system logs.
PRONOM Signature Development UtilityFile Format IdentificationOutput DROID compatible file format signature files using PRONOM syntax
Package HandlerFile Management
Metadata Processing
Personal Archiving
Validation
Appraisal
View, create, edit, and validate Swiss archival packages
PagelyzerMetadata Extraction
Quality Assurance
Suite of tools for detecting changes in web pages and their rendering
PdfaPilotValidation
Metadata Extraction
File Format Migration
pdfaPilot: Conversion of documents and emails into robust, searchable PDF or PDF/A files
PdfcpuValidation
Metadata Extraction
A Go library and command line tool for PDF processing incl. validation
PdftkMetadata Extraction
Repair
PDF manipulation tool
PeepdfMetadata Extractionpeepdf is a Python tool to explore PDF files in order to find out if the file can be harmful or not.
PiM (PREMIS in METS) ToolboxFile Format Migration
Metadata Processing
Validation
PREMIS in METS Toolbox was developed to support the implementation of PREMIS in the METS container format.
Pre-Ingest ToolMetadata Extraction
Metadata Processing
Validation
A tool for generating an OAIS SIP for digital preservation. It produces METS document that contains metadata for digital preservation.
PuremagicFile Format IdentificationPuremagic is a cross-platform pure python module that will identify a file based off it's magic numbers
Python DPX validatorValidationA lightweight DPX file format validator.
Python XMP ToolkitMetadata Extraction
Metadata Processing
Library for working with XMP metadata, as well as reading/writing XMP metadata stored in many different file formats
Python checkm packageFixityThis is a Python implementation of the checkm specification.
QCToolsQuality AssuranceDigitized analog video analysis
QpdfMetadata Extraction
Decryption
QPDF is a command-line program that does structural, content-preserving transformations on PDF files
RATOMAppraisal
Discovery
Metadata Extraction
Review, Appraisal, and Triage of Mail (RATOM) is software to assist archives and other collecting organizations with email analysis, selection, and appraisal tasks
RE (Rename Expert)Metadata Processing
File Management
Controlled renaming of file collections
RODA-InTransfer
Fixity
Appraisal
The tool creates SIPs from files and folders available on the local file system.
ReACT (Resource Audit and Comparison Tool)File Management
Quality Assurance
A file audit and comparison tool using Microsoft Excel and VBA.
ReDBoxMetadata Processing
Managing Active Research Data
ReDBox and Mint are two complimentary applications designed to create, store, and provide access to research metadata.
RhashFixityRHash (Recursive Hasher) is a console utility for computing and verifying hash sums of files.
RiprapFixityRiprap is a PREMIS-compliant fixity checking microservice.
RosettaPreservation System
Access
Metadata Processing
File Format Migration
Ex Libris Rosetta enables institutions to preserve and provide access to the collections in their care.
SAFE Archive Audit SystemFixity
Storage
Policy-based replication and Auditing of LOCKSS networks.
SIARD-VALValidation
Quality Assurance
SIARD-Val is an open source validator for SIARD files.
SIARDexcerptQuality Assurance
Access
SIARDexcerpt is a Java-based application that searches and extracts individual records of SIARD files.
SSDeepFixity
De-Duplication
Recursive piecewise hashing tool
SafeMoverData capture and Deposit
Transfer
Fixity
Python tool to support the overtly "safe" copying of files from one location to another. Uses fixity, and OS file system metadata.
SheekoAnnotation
Metadata Extraction
Machine learning implementation package to generate descriptive metadata for digitized historical images.
ShotwellMetadata Processing
Personal Archiving
Annotation
An open source photo manager capable of describing image collections for archival ingest.
SiegfriedFile Format IdentificationA PRONOM based, command line, file format identification tool using Aho Corasick matching and no buffer limits.
Smithsonian CookFile Format Migration
Metadata Extraction
Metadata Processing
Workflow
Rendering
Processing of 3D model, mesh, and texture data including the option to define custom processing workflows, where a set of files is processed by multiple tools.
SobekCMAccess
Discovery
Metadata Processing
Preservation System
Quality Assurance
SobekCM is a digital repository and digital scholarship/publishing system which enables easy deposit, preservation, and access for all types of digital content, tailored to the needs of galleries, libraries, archives, museums, scholars, and researchers.
SobekCM METS EditorMetadata ProcessingCreation of METS documents from a folder of items with bibliographic metadata.
Sumfolder1Appraisal
De-Duplication
Fixity
sumfolder1 is a utility for use within the archival and digital preservation community to generate checksums for file system directories, and to generate an overall "collection" checksum for a given set of files. The utility may be used in support of de-duplication at a directory/folder level.
TIFF-ValValidation
Quality Assurance
TIFF-Val is an open source validator for TIFF files.
TOMES (Transforming Online Mail with Embedded Semantics)File Format Migration
Content Profiling
Metadata Processing
Data capture and Deposit
A package of open source tools for handling the preservation of government email records
TrID File IdentifierFile Format IdentificationTrID is a utility designed to identify file types from their binary signatures.
TreeMetadata Processing
File Management
Appraisal
Tree displays the directory structure of a path or of the disk in a drive graphically.
UKWA Access APIPersistent Identification
Service
Web archives access API
UKWA GSuite Add-OnAppraisal
Validation

GSuite functions for people working with web archives. The functions use the Memento API (specifically the TimeGate) to look up whether a given archive holds a given URL. It currently supports checks against:

  • UK Web Archive
  • UK Government Web Archive
  • Internet Archive
USGS Formal metadata: information and softwareMetadata ProcessingThis page links to information and tools from the USGS.
UnArchiverDecryption
Fixity
Transfer
UnArchiver is a native macOS utility which supports infinitely more archive formats then other common archiving utilities.
VRenamerFile Management
Metadata Processing
vRenamer is a cross platform tool for batch renaming files
VeraPDFValidationPDF/A validation tool
VoyeurAccess
Metadata Processing
Voyeur is a web-based text analysis environment that can use texts in a variety of formats, from different locations to perform lexical analysis, export data to other tools, and embed live tools into remote websites.
W3C Markup Validation ServiceValidationThis is the World Wide Web Consortium's validation tool.
WCT (Web Curator Tool)Metadata Processing
Web Capture
Web Curator Tool (WCT) is a workflow management application for selective web archiving.
WarctoolsMetadata Extraction
Validation
File Format Migration
Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)
Web Archive DiscoveryMetadata Extraction
File Format Identification
Content Profiling
Discovery
Indexing and discovery tools for web archives.
WebCitePersistent Identification
Web Capture
Citation and Impact Tracking
WebCite is an on-demand web archiving service that takes snapshots of Internet-accessible digital objects at the behest of users, storing the data on their own servers and assigning unique identifiers to those instances of the material.
WordHoardMetadata Extraction
Access
WordHoard is an application for the close reading and scholarly analysis of deeply tagged texts.
XMLstarletMetadata ProcessingA set of command line utilities (tools) to transform, query, validate, and edit XML documents and files
XcorrSoundDe-Duplication
Quality Assurance
The xcorrSound package compares sound waves using cross correlation.
XpdfMetadata Extraction
Rendering
Open source PDF viewer that includes PDF information extractor and font analyzer