Difference between revisions of "DROID (Digital Record Object Identification)"

From COPTR
Jump to navigation Jump to search
(Added links to Function and/or Content Type)
Line 37: Line 37:
 
     http://apps.nationalarchives.gov.uk/pronom/fmt/579.xml
 
     http://apps.nationalarchives.gov.uk/pronom/fmt/579.xml
 
= User Experiences =
 
= User Experiences =
 +
* [https://www.dpconline.org/handbook/tool-demos/droid-demo Tool demo with videos] in the [https://www.dpconline.org/handbook DPC Handbook]
 
* [http://www.jisc.ac.uk/media/documents/programmes/preservation/daat_file_format_tools_report.pdf Digital Asset Assessment Tool - Assessment of file format testing tools].
 
* [http://www.jisc.ac.uk/media/documents/programmes/preservation/daat_file_format_tools_report.pdf Digital Asset Assessment Tool - Assessment of file format testing tools].
 
* Comparing how [[Apache Tika]] and DROID perform HTML identification: [http://britishlibrary.typepad.co.uk/webarchive/2014/07/how-much-of-the-uk-html-is-valid.html How much of the UK's HTML is valid?]
 
* Comparing how [[Apache Tika]] and DROID perform HTML identification: [http://britishlibrary.typepad.co.uk/webarchive/2014/07/how-much-of-the-uk-html-is-valid.html How much of the UK's HTML is valid?]
Line 42: Line 43:
 
* '''KOST-CECO:''' Used in [[KOST-Val]] for the file format identification. For performance and granularity reasons an own SignatureFile is used instead of the official PRONOM registry.
 
* '''KOST-CECO:''' Used in [[KOST-Val]] for the file format identification. For performance and granularity reasons an own SignatureFile is used instead of the official PRONOM registry.
 
* '''FITS (File Information Tool Set):''' Used in [[FITS (File Information Tool Set)|FITS]]
 
* '''FITS (File Information Tool Set):''' Used in [[FITS (File Information Tool Set)|FITS]]
 
+
 
 
 
 
= Development Activity =
 
= Development Activity =
 
All development activity is visible on GitHub: http://github.com/digital-preservation/droid/commits
 
All development activity is visible on GitHub: http://github.com/digital-preservation/droid/commits

Revision as of 13:36, 9 June 2021



DROID (Digital Record Object Identification) is a software tool developed to perform automated batch identification of file formats.
Homepage:http://digital-preservation.github.io/droid/
License:BSD License
Platforms:Java 6 Standard Edition
Function:File Format Identification,Metadata Extraction
Appears in COW:Creating a SIP from content downloaded from OneDrive (or other Cloud based source), Digital archiving workflow (high-level), LAC Pre-Ingest Workflow, Workflow for ingesting digitized books into a digital archive



Description

DROID (Digital Record Object Identification) is a software tool developed to perform automated batch identification of file formats. DROID is designed to meet the fundamental requirement of any digital repository to be able to identify the precise format of all stored digital objects, and to link that identification to a central registry of technical information about that format and its dependencies.

DROID uses the PRONOM signature files to perform format identification. Like PRONOM, it was developed by the National Archives of the UK. Written in Java, XML.


PRONOM

The format information held in PRONOM is what powers DROID (Digital Record Object Identification). Both are maintained by the UK's National Archives.

DROID downloads the latest signature files from PRONOM, and those are used to drive the identification process. See the PRONOM release notes.

A number of other tools and registries have been based around the PRONOM data. These include:

Although the information and website are made freely available under the Open Government License, the underlying software engine that powers PRONOM is proprietary.


The PRONOM Web API

The website is oriented towards manual browsing, but note that each PRONOM registry entry as a permalink, like this:

   http://apps.nationalarchives.gov.uk/pronom/fmt/579

and furthermore, by appending '.xml' to the URL for any entry, the data can be recovered as XML:

   http://apps.nationalarchives.gov.uk/pronom/fmt/579.xml

User Experiences

Development Activity

All development activity is visible on GitHub: http://github.com/digital-preservation/droid/commits


Release Feed

Below the last 3 release feeds:

2022-01-07 15:49:54
[tag:github.com,2008:Repository/4737996/droid-6.5.2 droid-6.5.2]
by sparkhi
2020-05-01 11:04:35
[tag:github.com,2008:Repository/4737996/droid-6.5 6.5 Release]
by jcharlet
2020-03-27 17:04:19
[tag:github.com,2008:Repository/4737996/droid-6.5-RC3 6.5 Release Candidate 3]
by jcharlet


Activity Feed

Below the last 5 commits:

2022-06-09 09:38:15
[tag:github.com,2008:Grit::Commit/9cd83c80773821b1dee0a2a7d912dffbe8f26209 build(deps): bump mockito-core from 4.5.1 to 4.6.1 (#787)]
by dependabot https://github.com/dependabot
2022-06-09 09:38:02
[tag:github.com,2008:Grit::Commit/88649ff6773fc713a54ab62ba2551c88f9187712 build(deps): bump maven-scm-api from 1.12.2 to 1.13.0 (#788)]
by dependabot https://github.com/dependabot
2022-06-09 09:37:41
[tag:github.com,2008:Grit::Commit/f52a122972fdfbb65c40528ea931d4fa614cded2 build(deps): bump maven-surefire-report-plugin from 3.0.0-M6 to 3.0.0…]
by dependabot https://github.com/dependabot
2022-06-08 13:05:11
[tag:github.com,2008:Grit::Commit/7e1d2b5795fac45c6efb1f0773c57f7afa00ba39 build(deps): bump maven-project-info-reports-plugin from 3.1.2 to 3.3…]
by dependabot https://github.com/dependabot
2022-06-08 13:04:46
[tag:github.com,2008:Grit::Commit/7bc727105c8f2f97e1abf605ddfc9a4cb6c78e49 build(deps): bump maven-site-plugin from 3.10.0 to 3.12.0 (#768)]
by dependabot https://github.com/dependabot