Semantic search
Jump to navigation
Jump to search
Tool | Function | Purpose |
---|---|---|
Goobi | Workflow OCR Planning Quality Assurance | Workflow Management Tool |
Greens | Persistent Identification | ARK identifiers minter and resolver |
Greenstone | Access Metadata Processing | A suite of software for building and distributing digital library collections |
Gumshoe | Metadata Processing Forensic | Search interface for metadata extracted from forensic disk images. |
Gvfs-info | File Format Identification | gvfs-info - print information about files and directories |
ICA-AtoM | Metadata Processing | ICA-AtoM allows organisations to create standards-based descriptions of their archival holdings and subsequently publish them to the Web. |
IMacros | Quality Assurance Web Capture | iMacros makes it easy to test web-based applications. |
IText | Metadata Extraction | PDF library for manipulation, content extraction and creation |
ImageVerifier | Metadata Processing Validation Quality Assurance | ImageVerifier (IV for short) traverses a hierarchy of folders looking for image files to verify. It can verify TIFFs, JPEGs. PSDs, DNGs, and non-DNG raws (e.g., NEF, CR2). |
InBoxer | Metadata Extraction Metadata Processing Access | InBoxer is a next generation email archiving, IM archiving, e-discovery, and policy management system. |
Index.dat Analyzer v2.5 | Metadata Extraction Forensic | Index.dat Analyzer is a tool to view, examine and delete contents of index.dat files. |
JHOVE (Harvard Object Validation Environment) | Validation Metadata Extraction File Format Identification Encryption Detection | JHOVE provides functions to perform format-specific identification, validation, and characterization of digital objects. |
JHOVE2 | Validation Metadata Extraction File Format Identification Encryption Detection | JHOVE2 allows data curators to characterise the digital objects in their repositories. |
JWAT | Metadata Extraction Validation File Format Migration | Java Web Archive Toolkit |
JabRef | Metadata Processing Annotation | Reference and bibliographic data manager |
Jp2StructCheck | Validation Quality Assurance Metadata Extraction | Simple JP2 file structure checker |
Jpylyzer | Validation Quality Assurance Metadata Extraction | JP2 validation + properties extraction |
KOST-Simy | Validation Quality Assurance | The KOST-Simy application is used for Compare Images. |
KOST-Val | Validation Quality Assurance | KOST-Val is an open source validator for different file formats and Submission Information Package (SIP). |
Karen's Directory Printer | Metadata Processing | Karen's Directory Printer can print the name of every file on a drive, along with the file's size, date and time of last modification, and attributes (Read-Only, Hidden, System and Archive). |
Keith Humphreys' PhraseRate | Metadata Extraction | PhraseRate is a program, developed by Keith Humphreys, for extracting a set of meaningful, attractive keywords and key phrases from a web page describing the content of that page. |
KoLibRI (Kopal Library for Retrieval and Ingest) | Metadata Processing Pre-ingest SIP Builder | The kopal Library for Retrieval and Ingest (koLibRI) represents a library of Java tools that have been developed for the interaction with the DIAS system of IBM within the kopal project. |
Libmagic-dev | File Format Identification | This library can be used to classify files according to magic number tests. |
Library (xklb) | File Management Quality Assurance Web Capture | Media indexing multi-tool |
Libsharedmime | File Format Identification | This is an implementation for libsharedmime. |
Limb Processing | Metadata Processing OCR | Software for processing, enhancing and converting cultural heritage into digital cultural heritage |
Lingfo | File Format Migration Metadata Extraction | Lingfo provides a library for developers to use to extract information from Microsoft Excel spreadsheet files. |
MD5Checker | Fixity | Lightweight Windows Desktop application to create and check MD5 Digests for one or several files. |
METS API | Metadata Processing | The METS API is a Java API designed to aid developers in the processing and assembly of METS Documents. |
METS Navigator | Rendering Metadata Processing | METS-based system for displaying and navigating sets of page images or other multi-part digital objects. |
METS Reader Writer | Metadata Extraction Metadata Processing | Python library for processing and outputting METS/PREMIS XML according to the Archivematica METS profile. |
MP3::Tag | Metadata Extraction | MP3::Tag is a module for reading tags of MP3 audio files. |
MP3val | Validation Quality Assurance | MP3val is a small, high-speed, free software tool for checking MPEG audio files' integrity. |
MailStore Home | Metadata Extraction Metadata Processing File Management Discovery | Unifies your private emails into one searchable, platform-independent repository |
Matchbox Tool | Quality Assurance De-Duplication | Matchbox: Duplicate detection tool for digital document collections. |
Md5deep and hashdeep | Fixity | md5deep is a set of programs to compute MD5, SHA-1, SHA-256, Tiger, or Whirlpool message digests on an arbitrary number of files. hashdeep is a program to compute, match, and audit hashsets. |
Md5sum Unix command | Fixity | md5sum computes a 128-bit checksum (or fingerprint or message-digest) for each specified file. |
Md5summer | Fixity | MD5summer is an application for Microsoft Windows 9x, NT, ME, 2000 and XP which generates and verifies md5 checksums. |
Mdqc | Metadata Extraction Metadata Processing Quality Assurance | Tool for managing and comparing digital asset metadata |
MediaConch | File Format Identification Policy Validation | MediaConch is a file validation software. |
MediaInfo | Metadata Extraction | Supplies technical and tag information about a video or audio file. |
Metadata Extraction Tool | Metadata Extraction | Metadata Extraction Tool automatically extracts a limited set of metadata from the headers of digital files. |
Metadata Interrogator | Metadata Extraction | The Metadata Interrogator is a standalone, offline GUI tool for extracting and analysing metadata from a wide variety of file formats. |
Metadata transformer | Metadata Extraction | A simple tool for creating new CSV and HTML reports based on the metadata files generated by the Data Accessioner |
Metadata++ | Metadata Extraction Metadata Processing Personal Archiving | Freeware tool to view, edit, modify, extract, copy metadata of various formats. |
Metadata2Go | Metadata Extraction | Web-based EXIF data viewer |
Minimum Preservation Tool | Fixity Preservation System | The Minimum Preservation Tool (MPT) can be used to create an interim preservation storage environment for files awaiting preservation in a longer term repository solution. It supports checksum generation, fixity checking, and replication across two or more storage nodes. |
NARA File Analyzer and Metadata Harvester | Fixity Metadata Extraction File Format Identification | NARA File Analyzer and Metadata Harvester allows a user to analyze the contents of a file system or external drive and generates statistics about the contents of the contained directories. |
NARA Video Frame Analyzer | Metadata Extraction Quality Assurance | NARA Video Frame Analyzer analyzes technical properties of individual frames of a video file in order to detect quality issues within digitized video files. |
NESSTAR | Metadata Processing Service | Nesstar suite is an online publishing platform for organisations wishing to share datasets both internally and with the wider web. |
NOID (Ruby) | Persistent Identification | A version of NOID in Ruby |
Namalysator | Metadata Processing Quality Assurance Validation | Tool for METS/ALTO validation and quality control |
Nanite | File Format Identification Metadata Extraction | A friendly swarm of format-identifying robots |
Nice Opaque Identifiers (NOID) | Persistent Identification | Identifiers management tool to generate, bind and resolve different kinds of identifiers |
Noids | Persistent Identification | Identifiers management tool |
Nuclear Processor | Dependency Analysis | Process/module manager for Windows, with features such as Kill/Resume/Suspend thread of a process and unload DLL files |
NumaHOP | Quality Assurance OCR | Platform for digitization projects management |
ODF Validator | Validation Metadata Extraction | ODF Validator is a tool that validates OpenDocument files and checks them for certain conformance criteria. |
Officeparser.py | Metadata Extraction File Format Identification | officerparser.py is a python script that parses the format of OLE compound documents used by Microsoft Office applications. |
Ohcount | File Format Identification | Analyses plain text files, looking for code (scripting languages etc.) |
Omeka Identity plugin | Persistent Identification | Plugin for Omeka to assign ARK identifiers |
OpenJPEG | File Format Migration Metadata Extraction | The OpenJPEG library is an open-source JPEG 2000 codec written in C language. |
OpenRefine | Metadata Processing | For dealing with messy data, cleaning it and transforming it |
OpenWMS (Workflow Management System for Digital Objects) | Metadata Processing | The OpenWMS is a platform-independent, open source, web-accessible system that can be used as a standalone application or integrated with other repository architectures by a wide range of organizations. |
PAIRTREE Library | Metadata Processing | software library that supports the mapping between identifiers and filepaths according to the Pairtree Curation Microservices Specification. |
PDF Tools (by Didier Stevens) | Metadata Extraction Dependency Analysis Validation | Tools for parsing and analysing PDF documents |
PDFTron PDF-A Manager | Validation File Format Migration | PDF/A Manager is a PDF/A (ISO 19005) validation and conversion software. |
PET (PERICLES Extraction Tool) | Metadata Extraction Dependency Analysis | A tool to capture contextual information in a sheer curation scenario |
PREMIS Utility | Metadata Processing | The PREMIS Utility is a graphical program used to generate PREMIS metadata records for use in digital preservation systems and digital asset management systems in JSON and XML format, and attempts to cover gaps not programmatically generated by system logs. |
PRONOM Signature Development Utility | File Format Identification | Output DROID compatible file format signature files using PRONOM syntax |
Package Handler | File Management Metadata Processing Personal Archiving Validation Appraisal | View, create, edit, and validate Swiss archival packages |
Pagelyzer | Metadata Extraction Quality Assurance | Suite of tools for detecting changes in web pages and their rendering |
PdfaPilot | Validation Metadata Extraction File Format Migration | pdfaPilot: Conversion of documents and emails into robust, searchable PDF or PDF/A files |
Pdfcpu | Validation Metadata Extraction | A Go library and command line tool for PDF processing incl. validation |
Pdftk | Metadata Extraction Repair | PDF manipulation tool |
Peepdf | Metadata Extraction | peepdf is a Python tool to explore PDF files in order to find out if the file can be harmful or not. |
PiM (PREMIS in METS) Toolbox | File Format Migration Metadata Processing Validation | PREMIS in METS Toolbox was developed to support the implementation of PREMIS in the METS container format. |
Pre-Ingest Tool | Metadata Extraction Metadata Processing Validation | A tool for generating an OAIS SIP for digital preservation. It produces METS document that contains metadata for digital preservation. |
Puremagic | File Format Identification | Puremagic is a cross-platform pure python module that will identify a file based off it's magic numbers |
Python DPX validator | Validation | A lightweight DPX file format validator. |
Python XMP Toolkit | Metadata Extraction Metadata Processing | Library for working with XMP metadata, as well as reading/writing XMP metadata stored in many different file formats |
Python checkm package | Fixity | This is a Python implementation of the checkm specification. |
QCTools | Quality Assurance | Digitized analog video analysis |
Qpdf | Metadata Extraction Decryption | QPDF is a command-line program that does structural, content-preserving transformations on PDF files |
RATOM | Appraisal Discovery Metadata Extraction | Review, Appraisal, and Triage of Mail (RATOM) is software to assist archives and other collecting organizations with email analysis, selection, and appraisal tasks |
RE (Rename Expert) | Metadata Processing File Management | Controlled renaming of file collections |
RODA-In | Transfer Fixity Appraisal | The tool creates SIPs from files and folders available on the local file system. |
ReACT (Resource Audit and Comparison Tool) | File Management Quality Assurance | A file audit and comparison tool using Microsoft Excel and VBA. |
ReDBox | Metadata Processing Managing Active Research Data | ReDBox and Mint are two complimentary applications designed to create, store, and provide access to research metadata. |
Rhash | Fixity | RHash (Recursive Hasher) is a console utility for computing and verifying hash sums of files. |
Riprap | Fixity | Riprap is a PREMIS-compliant fixity checking microservice. |
Rosetta | Preservation System Access Metadata Processing File Format Migration | Ex Libris Rosetta enables institutions to preserve and provide access to the collections in their care. |
SAFE Archive Audit System | Fixity Storage | Policy-based replication and Auditing of LOCKSS networks. |
SIARD-VAL | Validation Quality Assurance | SIARD-Val is an open source validator for SIARD files. |
SIARDexcerpt | Quality Assurance Access | SIARDexcerpt is a Java-based application that searches and extracts individual records of SIARD files. |
SSDeep | Fixity De-Duplication | Recursive piecewise hashing tool |
SafeMover | Data capture and Deposit Transfer Fixity | Python tool to support the overtly "safe" copying of files from one location to another. Uses fixity, and OS file system metadata. |
Sheeko | Annotation Metadata Extraction | Machine learning implementation package to generate descriptive metadata for digitized historical images. |
Shotwell | Metadata Processing Personal Archiving Annotation | An open source photo manager capable of describing image collections for archival ingest. |
Siegfried | File Format Identification | A PRONOM based, command line, file format identification tool using Aho Corasick matching and no buffer limits. |
Smithsonian Cook | File Format Migration Metadata Extraction Metadata Processing Workflow Rendering | Processing of 3D model, mesh, and texture data including the option to define custom processing workflows, where a set of files is processed by multiple tools. |
SobekCM | Access Discovery Metadata Processing Preservation System Quality Assurance | SobekCM is a digital repository and digital scholarship/publishing system which enables easy deposit, preservation, and access for all types of digital content, tailored to the needs of galleries, libraries, archives, museums, scholars, and researchers. |
SobekCM METS Editor | Metadata Processing | Creation of METS documents from a folder of items with bibliographic metadata. |
Sumfolder1 | Appraisal De-Duplication Fixity | sumfolder1 is a utility for use within the archival and digital preservation community to generate checksums for file system directories, and to generate an overall "collection" checksum for a given set of files. The utility may be used in support of de-duplication at a directory/folder level. |
TIFF-Val | Validation Quality Assurance | TIFF-Val is an open source validator for TIFF files. |
TOMES (Transforming Online Mail with Embedded Semantics) | File Format Migration Content Profiling Metadata Processing Data capture and Deposit | A package of open source tools for handling the preservation of government email records |
TrID File Identifier | File Format Identification | TrID is a utility designed to identify file types from their binary signatures. |
Tree | Metadata Processing File Management Appraisal | Tree displays the directory structure of a path or of the disk in a drive graphically. |
UKWA Access API | Persistent Identification Service | Web archives access API |
UKWA GSuite Add-On | Appraisal Validation | GSuite functions for people working with web archives. The functions use the Memento API (specifically the TimeGate) to look up whether a given archive holds a given URL. It currently supports checks against:
|
USGS Formal metadata: information and software | Metadata Processing | This page links to information and tools from the USGS. |
UnArchiver | Decryption Fixity Transfer | UnArchiver is a native macOS utility which supports infinitely more archive formats then other common archiving utilities. |
VRenamer | File Management Metadata Processing | vRenamer is a cross platform tool for batch renaming files |
VeraPDF | Validation | PDF/A validation tool |
Voyeur | Access Metadata Processing | Voyeur is a web-based text analysis environment that can use texts in a variety of formats, from different locations to perform lexical analysis, export data to other tools, and embed live tools into remote websites. |
W3C Markup Validation Service | Validation | This is the World Wide Web Consortium's validation tool. |
WCT (Web Curator Tool) | Metadata Processing Web Capture | Web Curator Tool (WCT) is a workflow management application for selective web archiving. |
Warctools | Metadata Extraction Validation File Format Migration | Command line tools and libraries for handling and manipulating WARC files (and HTTP contents) |
Web Archive Discovery | Metadata Extraction File Format Identification Content Profiling Discovery | Indexing and discovery tools for web archives. |
WebCite | Persistent Identification Web Capture Citation and Impact Tracking | WebCite is an on-demand web archiving service that takes snapshots of Internet-accessible digital objects at the behest of users, storing the data on their own servers and assigning unique identifiers to those instances of the material. |
WordHoard | Metadata Extraction Access | WordHoard is an application for the close reading and scholarly analysis of deeply tagged texts. |
XMLstarlet | Metadata Processing | A set of command line utilities (tools) to transform, query, validate, and edit XML documents and files |
XcorrSound | De-Duplication Quality Assurance | The xcorrSound package compares sound waves using cross correlation. |
Xpdf | Metadata Extraction Rendering | Open source PDF viewer that includes PDF information extractor and font analyzer |