Semantic search

Jump to navigation Jump to search
ToolFunctionPurpose
Python XMP ToolkitMetadata Extraction
Metadata Processing
Library for working with XMP metadata, as well as reading/writing XMP metadata stored in many different file formats
Python checkm packageFixityThis is a Python implementation of the checkm specification.
QCToolsQuality AssuranceDigitized analog video analysis
QpdfMetadata Extraction
Decryption
QPDF is a command-line program that does structural, content-preserving transformations on PDF files
RATOMAppraisal
Discovery
Metadata Extraction
Review, Appraisal, and Triage of Mail (RATOM) is software to assist archives and other collecting organizations with email analysis, selection, and appraisal tasks
RE (Rename Expert)Metadata Processing
File Management
Controlled renaming of file collections
RODA-InTransfer
Fixity
Appraisal
The tool creates SIPs from files and folders available on the local file system.
ReACT (Resource Audit and Comparison Tool)File Management
Quality Assurance
A file audit and comparison tool using Microsoft Excel and VBA.
ReDBoxMetadata Processing
Managing Active Research Data
ReDBox and Mint are two complimentary applications designed to create, store, and provide access to research metadata.
RhashFixityRHash (Recursive Hasher) is a console utility for computing and verifying hash sums of files.
RiprapFixityRiprap is a PREMIS-compliant fixity checking microservice.
RosettaPreservation System
Access
Metadata Processing
File Format Migration
Ex Libris Rosetta enables institutions to preserve and provide access to the collections in their care.
SAFE Archive Audit SystemFixity
Storage
Policy-based replication and Auditing of LOCKSS networks.
SIARD-VALValidation
Quality Assurance
SIARD-Val is an open source validator for SIARD files.
SIARDexcerptQuality Assurance
Access
SIARDexcerpt is a Java-based application that searches and extracts individual records of SIARD files.
SSDeepFixity
De-Duplication
Recursive piecewise hashing tool
SafeMoverData capture and Deposit
Transfer
Fixity
Python tool to support the overtly "safe" copying of files from one location to another. Uses fixity, and OS file system metadata.
SheekoAnnotation
Metadata Extraction
Machine learning implementation package to generate descriptive metadata for digitized historical images.
ShotwellMetadata Processing
Personal Archiving
Annotation
An open source photo manager capable of describing image collections for archival ingest.
SiegfriedFile Format IdentificationA PRONOM based, command line, file format identification tool using Aho Corasick matching and no buffer limits.
Smithsonian CookFile Format Migration
Metadata Extraction
Metadata Processing
Workflow
Rendering
Processing of 3D model, mesh, and texture data including the option to define custom processing workflows, where a set of files is processed by multiple tools.
SobekCMAccess
Discovery
Metadata Processing
Preservation System
Quality Assurance
SobekCM is a digital repository and digital scholarship/publishing system which enables easy deposit, preservation, and access for all types of digital content, tailored to the needs of galleries, libraries, archives, museums, scholars, and researchers.
SobekCM METS EditorMetadata ProcessingCreation of METS documents from a folder of items with bibliographic metadata.
Sumfolder1Appraisal
De-Duplication
Fixity
sumfolder1 is a utility for use within the archival and digital preservation community to generate checksums for file system directories, and to generate an overall "collection" checksum for a given set of files. The utility may be used in support of de-duplication at a directory/folder level.
TIFF-ValValidation
Quality Assurance
TIFF-Val is an open source validator for TIFF files.
TOMES (Transforming Online Mail with Embedded Semantics)File Format Migration
Content Profiling
Metadata Processing
Data capture and Deposit
A package of open source tools for handling the preservation of government email records
TrID File IdentifierFile Format IdentificationTrID is a utility designed to identify file types from their binary signatures.
TreeMetadata Processing
File Management
Appraisal
Tree displays the directory structure of a path or of the disk in a drive graphically.
UKWA Access APIPersistent Identification
Service
Web archives access API
UKWA GSuite Add-OnAppraisal
Validation

GSuite functions for people working with web archives. The functions use the Memento API (specifically the TimeGate) to look up whether a given archive holds a given URL. It currently supports checks against:

  • UK Web Archive
  • UK Government Web Archive
  • Internet Archive
USGS Formal metadata: information and softwareMetadata ProcessingThis page links to information and tools from the USGS.
UnArchiverDecryption
Fixity
Transfer
UnArchiver is a native macOS utility which supports infinitely more archive formats then other common archiving utilities.
VRenamerFile Management
Metadata Processing
vRenamer is a cross platform tool for batch renaming files
VeraPDFValidationPDF/A validation tool
VoyeurAccess
Metadata Processing
Voyeur is a web-based text analysis environment that can use texts in a variety of formats, from different locations to perform lexical analysis, export data to other tools, and embed live tools into remote websites.
W3C Markup Validation ServiceValidationThis is the World Wide Web Consortium's validation tool.
WCT (Web Curator Tool)Metadata Processing
Web Capture
Web Curator Tool (WCT) is a workflow management application for selective web archiving.
WarctoolsMetadata Extraction
Validation
File Format Migration
Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)
Web Archive DiscoveryMetadata Extraction
File Format Identification
Content Profiling
Discovery
Indexing and discovery tools for web archives.
WebCitePersistent Identification
Web Capture
Citation and Impact Tracking
WebCite is an on-demand web archiving service that takes snapshots of Internet-accessible digital objects at the behest of users, storing the data on their own servers and assigning unique identifiers to those instances of the material.
WordHoardMetadata Extraction
Access
WordHoard is an application for the close reading and scholarly analysis of deeply tagged texts.
XMLstarletMetadata ProcessingA set of command line utilities (tools) to transform, query, validate, and edit XML documents and files
XcorrSoundDe-Duplication
Quality Assurance
The xcorrSound package compares sound waves using cross correlation.
XpdfMetadata Extraction
Rendering
Open source PDF viewer that includes PDF information extractor and font analyzer