File Format Migration

Jump to navigation Jump to search
Function definition: Tools that support the transformation of data from one file format to another.
Lifecycle stage: Preservation Action

Tools for this function

AccessToSiardA collection of scripts to automatically convert MS Access files to the SIARD format.
Aid4MailFor migrating or normalizing email formats
AntiwordAntiword is a free MS Word reader for Linux and RISC OS.
Apache PDFBoxJAVA PDF library for creation, manipulation, validation and content extraction of PDF documents
Apache POI - the Java API for Microsoft DocumentsThe Apache POI Project's mission is to create and maintain Java APIs for manipulating various file formats based upon the Office Open XML standards (OOXML) and Microsoft's OLE 2 Compound Document format (OLE2).
ArchivematicaArchivematica is a digital preservation system that automates the process of preparing digital objects for ingest into a repository and an access system
Audio/Video to WAV ConverterThis tool converts audio and video files to WAV format.
CDS ConvertCDS Convert is a suite of tools that allow conversion of documents, presentations and images between different software formats.
CHRONOSDatabase Retirement, Partial and Ongoing Database Archiving, Application Retirement.
CSV export form for Microsoft AccessA Microsoft Access form to export all database tables to interoperable CSV files.
CSV2SIARDA tool to create SIARD containers from CSV files.
CalibreAn e-book management tool, including viewer, migration, and file conversion features among others.
Catdoc & xls2csvcatdoc is a program that reads one or more Microsoft Word files and outputs text to standard output.
CloudCompareCloudCompare is a tool for editing and processing 3D point clouds and triangular meshes.
ConverseenA GUI for ImageMagick supporting mass operations
DANS DBFDANS DBF Library is a Java library for reading and writing xBase database files.
DANS MIXEDMigration to Intermediate XML for Electronic Data.
DBPTK DeveloperDBPTK Developer - library and command-line tool for exection of database preservation actions
DBeaverMulti-platform database tool that supports migration and data management
DIMAGA software suite supporting archives with preservation of digital information for eternity
DMC (DBpoweramp Music Converter)dBpoweramp Music Converter (dMC) is an audio conversion tool.
DNSDNS - DA NRW Software Suite
Db-preservation-toolkitEnables conversion between database formats or dumping from live database systems for the purposes of preservation.
DeepArcIntended for preserving web sites from the back-end, this is a database-to-XML curation tool.
DiPS (Digital Preservation Solution)DiPS (OAIS compliant Digital Preservation Solution)
Digital Preservation RecorderDigital Preservation Recorder (DPR) is free and open source software developed by the National Archives of Australia to aid in the long term preservation of digital records.
DocMorph: Electronic Document ConversionThe U.S. National Library of Medicine's (NLM) document conversion tools make the exchange and use of biomedical library electronic information easier for librarians, library users, and the general public
Docuteam cosmosdocuteam cosmos is a comprehensive, modular software solution for the operation of digital long-term archives based on the OAIS standard (Open Archival Information System, ISO 14721:2012).
EXIF to DC XML normaliserExtract EXIF data and normalise it to DC XML.
Easy CD-DA ExtractorEasy CD-DA Extractor is CD Ripper, Music Converter, Audio Converter, Metadata Editor, and CD/DVD burning software.
EmailchemyConverts proprietary emails to standard portable formats
Exact Audio CopyExact Audio Copy is an audio grabber for audio CDs using standard CD and DVD-ROM drives on Windows only.
FFAStransTask automation engine, mostly used in audio and video visual content management.
FFmpegFFmpeg is a complete, cross-platform solution to record, convert and stream audio and video.
FilestarUniversal file converter for 900+ file types.
GrokJPEG 2000 SDK, includes encoder/decoder
HandBrakeConverts video data into widely accepted formats.
ImageMagickImageMagick® is a software suite to create, edit, compose, or convert bitmap images.
Ingestringestr is a command-line application that allows ingesting or copying data from any source into any destination database.
JJ2000Pure Java implementation of a JPEG2000 decoder
JWATJava Web Archive Toolkit
KakaduJPEG 2000 SDK, includes encoder/decoder
LegacyFileConverterConverts document formats to RTF
LibreofficeAn office suite with command line options for PDF/A conversions
LingfoLingfo provides a library for developers to use to extract information from Microsoft Excel spreadsheet files.
LuraDocument PDF CompressorLuraDocument PDF Compressor is a document conversion engine.
MDB/ACCDB ViewerMDB Viewer opens Microsoft Access 1997-2013 databases on your Macintosh, and views and exports all tables in Access databases.
MIXED (Migration to Intermediate XML for Electronic Data)MIXED (Migration to Intermediate XML for Electronic Data) is a web service that converts tabular data files such as spreadsheets and databases to the Standard Data Format for Preservation (SDFP), a supplier-independent XML format.
MPG321mpg321 is a command-line mp3 player. mpg321 is used for frontends, as an mp3 player and as an mp3 to wave file decoder.
MPP ViewerMPP Viewer is a viewer for Microsoft Project files
MSIL Disassembler (Ildasm.exe)The MSIL Disassembler is a companion tool to the MSIL Assembler (Ilasm.
Nitro ProA PDF handling tool including PDF/A
Open 3 is the leading open-source office software suite for word processing, spreadsheets, presentations, graphics, databases and more.
Open Video ConverterThis tool is for video conversion, splitting and editing.
OpenJPEGThe OpenJPEG library is an open-source JPEG 2000 codec written in C language.
OpenXML/ODF Translator Add-in for OfficeThe goal for this project is to provide translators to allow for interoperability between applications based on ODF (OpenDocument) 1.
Oracle Outside In TechnologyOutside In Technology is a suite of software development kits (SDKs) that provides developers with a comprehensive solution to access, transform and control the contents of over 500 unstructured file formats.
PDFTron PDF-A ManagerPDF/A Manager is a PDF/A (ISO 19005) validation and conversion software.
PandocA universal converter that converts files from one markup format into another
PdfaPilotpdfaPilot: Conversion of documents and emails into robust, searchable PDF or PDF/A files
PiM (PREMIS in METS) ToolboxPREMIS in METS Toolbox was developed to support the implementation of PREMIS in the METS container format.
QGISQGIS is a Free and Open Source GIS application that supports a wide range of raster and vector spatial data types.
RODA DBMLMigrates databases to an XML schema, DBML. Can then provide access by dumping DBML to MySQL and showing it in phpMyAdmin.
RosettaEx Libris Rosetta enables institutions to preserve and provide access to the collections in their care.
SIARD SuiteSIARD Suite is a freeware tool for the conversion of contents of relations databases into the SIARD format.
SPARQLing Unicorn QGIS PluginPlugin for QGIS. Fetches data from Wikidata and other Linked Data SPARQL endpoints and adds a new layer in a QGIS project. Just insert a SPARQL query for Geo-Items and get a new vector layer into QGIS.
Smithsonian CookProcessing of 3D model, mesh, and texture data including the option to define custom processing workflows, where a set of files is processed by multiple tools.
Ssconvertssconvert is a command line utility to convert spreadsheet files between various spreadsheet file formats.
Switch Audio File ConverterSwitch is a universal audio converter that supports a wide range of formats.
TOMES (Transforming Online Mail with Embedded Semantics)A package of open source tools for handling the preservation of government email records
UniConvertora Universal vector graphics format translator
WMDecodeWMDecode is used for extracting files from winmail.
WarcitWarcit is a command-line tool that converts directories (including nested directories), files (including HTML or other web assets and data files) and ZIP files to Web Archives (WARC).
WarctoolsCommand line tools and libraries for handling and manipulating WARC files (and HTTP contents)
XMediaRecodeA GUI platform for video conversion and cutting.
XenaDetecting the file formats of digital objects; converting digital objects into open formats for preservation.
XnConvertCross-platform batch image converter