Semantic search

Jump to navigation Jump to search
ToolFunctionPurpose
GNU libextractorMetadata ExtractionGNU libextractor is a library used to extract meta data from files of arbitrary type.
GeosetterMetadata Extraction
Metadata Processing
Personal Archiving
A tool that sets coordinates and edits all kind of embedded image metadata.
GetID3()Metadata ExtractionExtracts technical and embedded descriptive metadata from common multimedia file formats.
GoobiOCR
Planning
Quality Assurance
Workflow
Workflow Management Tool
GreensPersistent IdentificationARK identifiers minter and resolver
GreenstoneAccess
Metadata Processing
A suite of software for building and distributing digital library collections
GumshoeForensic
Metadata Processing
Search interface for metadata extracted from forensic disk images.
Gvfs-infoFile Format Identificationgvfs-info - print information about files and directories
ICA-AtoMMetadata ProcessingICA-AtoM allows organisations to create standards-based descriptions of their archival holdings and subsequently publish them to the Web.
IMacrosQuality Assurance
Web Capture
iMacros makes it easy to test web-based applications.
ITextMetadata ExtractionPDF library for manipulation, content extraction and creation
ImageVerifierValidation
Metadata Processing
Quality Assurance
ImageVerifier (IV for short) traverses a hierarchy of folders looking for image files to verify. It can verify TIFFs, JPEGs. PSDs, DNGs, and non-DNG raws (e.g., NEF, CR2).
InBoxerAccess
Metadata Extraction
Metadata Processing
InBoxer is a next generation email archiving, IM archiving, e-discovery, and policy management system.
Index.dat Analyzer v2.5Forensic
Metadata Extraction
Index.dat Analyzer is a tool to view, examine and delete contents of index.dat files.
JHOVE (Harvard Object Validation Environment)Validation
Encryption Detection
File Format Identification
Metadata Extraction
JHOVE provides functions to perform format-specific identification, validation, and characterization of digital objects.
JHOVE2Validation
Encryption Detection
File Format Identification
Metadata Extraction
JHOVE2 allows data curators to characterise the digital objects in their repositories.
JWATValidation
File Format Migration
Metadata Extraction
Java Web Archive Toolkit
JabRefAnnotation
Metadata Processing
Reference and bibliographic data manager
Jp2StructCheckValidation
Metadata Extraction
Quality Assurance
Simple JP2 file structure checker
JpylyzerValidation
Metadata Extraction
Quality Assurance
JP2 validation + properties extraction
KOST-SimyValidation
Quality Assurance
The KOST-Simy application is used for Compare Images.
KOST-ValValidation
Quality Assurance
KOST-Val is an open source validator for different file formats (TIFF, SIARD, PDF/A, JP2, JPEG) and Submission Information Package (SIP).
Karen's Directory PrinterMetadata ProcessingKaren's Directory Printer can print the name of every file on a drive, along with the file's size, date and time of last modification, and attributes (Read-Only, Hidden, System and Archive).
Keith Humphreys' PhraseRateMetadata ExtractionPhraseRate is a program, developed by Keith Humphreys, for extracting a set of meaningful, attractive keywords and key phrases from a web page describing the content of that page.
KoLibRI (Kopal Library for Retrieval and Ingest)Metadata Processing
Pre-ingest SIP Builder
The kopal Library for Retrieval and Ingest (koLibRI) represents a library of Java tools that have been developed for the interaction with the DIAS system of IBM within the kopal project.
Libmagic-devFile Format IdentificationThis library can be used to classify files according to magic number tests.
Library (xklb)File Management
Quality Assurance
Web Capture
Media indexing multi-tool with more than 70 CLI subcommands
LibsharedmimeFile Format IdentificationThis is an implementation for libsharedmime.
Limb ProcessingMetadata Processing
OCR
Software for processing, enhancing and converting cultural heritage into digital cultural heritage
LingfoFile Format Migration
Metadata Extraction
Lingfo provides a library for developers to use to extract information from Microsoft Excel spreadsheet files.
MD5CheckerFixityLightweight Windows Desktop application to create and check MD5 Digests for one or several files.
METS APIMetadata ProcessingThe METS API is a Java API designed to aid developers in the processing and assembly of METS Documents.
METS NavigatorMetadata Processing
Rendering
METS-based system for displaying and navigating sets of page images or other multi-part digital objects.
METS Reader WriterMetadata Extraction
Metadata Processing
Python library for processing and outputting METS/PREMIS XML according to the Archivematica METS profile.
MP3::TagMetadata ExtractionMP3::Tag is a module for reading tags of MP3 audio files.
MP3valValidation
Quality Assurance
MP3val is a small, high-speed, free software tool for checking MPEG audio files' integrity.
MailStore HomeDiscovery
File Management
Metadata Extraction
Metadata Processing
Unifies your private emails into one searchable, platform-independent repository
Matchbox ToolDe-Duplication
Quality Assurance
Matchbox: Duplicate detection tool for digital document collections.
Md5deep and hashdeepFixitymd5deep is a set of programs to compute MD5, SHA-1, SHA-256, Tiger, or Whirlpool message digests on an arbitrary number of files. hashdeep is a program to compute, match, and audit hashsets.
Md5sum Unix commandFixitymd5sum computes a 128-bit checksum (or fingerprint or message-digest) for each specified file.
Md5summerFixityMD5summer is an application for Microsoft Windows 9x, NT, ME, 2000 and XP which generates and verifies md5 checksums.
MdqcMetadata Extraction
Metadata Processing
Quality Assurance
Tool for managing and comparing digital asset metadata
MediaConchValidation
File Format Identification
Policy
MediaConch is a file validation software.
MediaInfoMetadata ExtractionSupplies technical and tag information about a video or audio file.
Metadata Extraction ToolMetadata ExtractionMetadata Extraction Tool automatically extracts a limited set of metadata from the headers of digital files.
Metadata InterrogatorMetadata ExtractionThe Metadata Interrogator is a standalone, offline GUI tool for extracting and analysing metadata from a wide variety of file formats.
Metadata transformerMetadata ExtractionA simple tool for creating new CSV and HTML reports based on the metadata files generated by the Data Accessioner
Metadata++Metadata Extraction
Metadata Processing
Personal Archiving
Freeware tool to view, edit, modify, extract, copy metadata of various formats.
Metadata2GoMetadata ExtractionWeb-based EXIF data viewer
Mets-bag-checkerFixity
Validation
METS Bag checker is a simple python tool to check the validity of METS Information Packages (XML validity, completeness, Data Objects fixity, absence of unreferenced files).
Minimum Preservation ToolFixity
Preservation System
The Minimum Preservation Tool (MPT) can be used to create an interim preservation storage environment for files awaiting preservation in a longer term repository solution. It supports checksum generation, fixity checking, and replication across two or more storage nodes.
NARA File Analyzer and Metadata HarvesterFixity
File Format Identification
Metadata Extraction
NARA File Analyzer and Metadata Harvester allows a user to analyze the contents of a file system or external drive and generates statistics about the contents of the contained directories.
NARA Video Frame AnalyzerMetadata Extraction
Quality Assurance
NARA Video Frame Analyzer analyzes technical properties of individual frames of a video file in order to detect quality issues within digitized video files.
NESSTARMetadata Processing
Service
Nesstar suite is an online publishing platform for organisations wishing to share datasets both internally and with the wider web.
NOID (Ruby)Persistent IdentificationA version of NOID in Ruby
NamalysatorValidation
Metadata Processing
Quality Assurance
Tool for METS/ALTO validation and quality control
NaniteFile Format Identification
Metadata Extraction
A friendly swarm of format-identifying robots
Nice Opaque Identifiers (NOID)Persistent IdentificationIdentifiers management tool to generate, bind and resolve different kinds of identifiers
NoidsPersistent IdentificationIdentifiers management tool
Nuclear ProcessorDependency AnalysisProcess/module manager for Windows, with features such as Kill/Resume/Suspend thread of a process and unload DLL files
NumaHOPOCR
Quality Assurance
Platform for digitization projects management
ODF ValidatorValidation
Metadata Extraction
ODF Validator is a tool that validates OpenDocument files and checks them for certain conformance criteria.
Officeparser.pyFile Format Identification
Metadata Extraction
officerparser.py is a python script that parses the format of OLE compound documents used by Microsoft Office applications.
OhcountFile Format IdentificationAnalyses plain text files, looking for code (scripting languages etc.)
Omeka Identity pluginPersistent IdentificationPlugin for Omeka to assign ARK identifiers
OpenJPEGFile Format Migration
Metadata Extraction
The OpenJPEG library is an open-source JPEG 2000 codec written in C language.
OpenRefineMetadata ProcessingFor dealing with messy data, cleaning it and transforming it
OpenWMS (Workflow Management System for Digital Objects)Metadata ProcessingThe OpenWMS is a platform-independent, open source, web-accessible system that can be used as a standalone application or integrated with other repository architectures by a wide range of organizations.
PAIRTREE LibraryMetadata Processingsoftware library that supports the mapping between identifiers and filepaths according to the Pairtree Curation Microservices Specification.
PDF Tools (by Didier Stevens)Validation
Dependency Analysis
Metadata Extraction
Tools for parsing and analysing PDF documents
PDFTron PDF-A ManagerValidation
File Format Migration
PDF/A Manager is a PDF/A (ISO 19005) validation and conversion software.
PET (PERICLES Extraction Tool)Dependency Analysis
Metadata Extraction
A tool to capture contextual information in a sheer curation scenario
PREMIS UtilityMetadata ProcessingThe PREMIS Utility is a graphical program used to generate PREMIS metadata records for use in digital preservation systems and digital asset management systems in JSON and XML format, and attempts to cover gaps not programmatically generated by system logs.
PRONOM Signature Development UtilityFile Format IdentificationOutput DROID compatible file format signature files using PRONOM syntax
Package HandlerValidation
File Management
Metadata Processing
Personal Archiving
Appraisal
View, create, edit, and validate Swiss archival packages
PagelyzerMetadata Extraction
Quality Assurance
Suite of tools for detecting changes in web pages and their rendering
PdfaPilotValidation
File Format Migration
Metadata Extraction
pdfaPilot: Conversion of documents and emails into robust, searchable PDF or PDF/A files
PdfcpuValidation
Metadata Extraction
A Go library and command line tool for PDF processing incl. validation
PdftkMetadata Extraction
Repair
PDF manipulation tool
PeepdfMetadata Extractionpeepdf is a Python tool to explore PDF files in order to find out if the file can be harmful or not.
PiM (PREMIS in METS) ToolboxValidation
File Format Migration
Metadata Processing
PREMIS in METS Toolbox was developed to support the implementation of PREMIS in the METS container format.
Pre-Ingest ToolValidation
Metadata Extraction
Metadata Processing
A tool for generating an OAIS SIP for digital preservation. It produces METS document that contains metadata for digital preservation.
PremisshMetadata Extraction
Metadata Processing
A simple prototype BASH script for automatically creating PREMIS XML from a file, using DROID, BASH and XSLT.
PuremagicFile Format IdentificationPuremagic is a cross-platform pure python module that will identify a file based off it's magic numbers
Python DPX validatorValidationA lightweight DPX file format validator.
Python XMP ToolkitMetadata Extraction
Metadata Processing
Library for working with XMP metadata, as well as reading/writing XMP metadata stored in many different file formats
Python checkm packageFixityThis is a Python implementation of the checkm specification.
QCToolsQuality AssuranceDigitized analog video analysis
QpdfDecryption
Metadata Extraction
QPDF is a command-line program that does structural, content-preserving transformations on PDF files
RATOMDiscovery
Metadata Extraction
Appraisal
Review, Appraisal, and Triage of Mail (RATOM) is software to assist archives and other collecting organizations with email analysis, selection, and appraisal tasks
RE (Rename Expert)File Management
Metadata Processing
Controlled renaming of file collections
RODA-InFixity
Transfer
Appraisal
The tool creates SIPs from files and folders available on the local file system.
ReACT (Resource Audit and Comparison Tool)File Management
Quality Assurance
A file audit and comparison tool using Microsoft Excel and VBA.
ReDBoxManaging Active Research Data
Metadata Processing
ReDBox and Mint are two complimentary applications designed to create, store, and provide access to research metadata.
RhashFixityRHash (Recursive Hasher) is a console utility for computing and verifying hash sums of files.
RiprapFixityRiprap is a PREMIS-compliant fixity checking microservice.
RosettaAccess
File Format Migration
Metadata Processing
Preservation System
Ex Libris Rosetta enables institutions to preserve and provide access to the collections in their care.
SAFE Archive Audit SystemFixity
Storage
Policy-based replication and Auditing of LOCKSS networks.
SIARD-VALValidation
Quality Assurance
SIARD-Val is an open source validator for SIARD files.
SIARDexcerptAccess
Quality Assurance
SIARDexcerpt is a Java-based application that searches and extracts individual records of SIARD files.