Difference between revisions of "DROID (Digital Record Object Identification)"
(User Experiences: Used in KOST-Val) |
|||
(7 intermediate revisions by 5 users not shown) | |||
Line 1: | Line 1: | ||
− | {{ | + | {{Infobox tool |
|purpose=DROID (Digital Record Object Identification) is a software tool developed to perform automated batch identification of file formats. | |purpose=DROID (Digital Record Object Identification) is a software tool developed to perform automated batch identification of file formats. | ||
− | |||
|homepage=http://digital-preservation.github.io/droid/ | |homepage=http://digital-preservation.github.io/droid/ | ||
|license=BSD License | |license=BSD License | ||
|platforms=Java 6 Standard Edition | |platforms=Java 6 Standard Edition | ||
+ | |formats_in=PREMIS (Preservation Metadata Implementation Strategies) | ||
+ | |function=File Format Identification, Metadata Extraction | ||
+ | }} | ||
+ | {{Infobox tool details | ||
+ | |ohloh_id=droid | ||
}} | }} | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
= Description = | = Description = | ||
DROID (Digital Record Object Identification) is a software tool developed to perform automated batch identification of file formats. DROID is designed to meet the fundamental requirement of any digital repository to be able to identify the precise format of all stored digital objects, and to link that identification to a central registry of technical information about that format and its dependencies. | DROID (Digital Record Object Identification) is a software tool developed to perform automated batch identification of file formats. DROID is designed to meet the fundamental requirement of any digital repository to be able to identify the precise format of all stored digital objects, and to link that identification to a central registry of technical information about that format and its dependencies. | ||
DROID uses the PRONOM [http://www.nationalarchives.gov.uk/aboutapps/pronom/droid-signature-files.htm signature files] to perform format identification. Like PRONOM, it was [http://www.nationalarchives.gov.uk/information-management/manage-information/policy-process/digital-continuity/file-profiling-tool-droid/ developed by the National Archives of the UK]. Written in Java, XML. | DROID uses the PRONOM [http://www.nationalarchives.gov.uk/aboutapps/pronom/droid-signature-files.htm signature files] to perform format identification. Like PRONOM, it was [http://www.nationalarchives.gov.uk/information-management/manage-information/policy-process/digital-continuity/file-profiling-tool-droid/ developed by the National Archives of the UK]. Written in Java, XML. | ||
− | + | ||
− | == PRONOM == | + | |
+ | === PRONOM === | ||
The format information held in PRONOM is what powers [[DROID (Digital Record Object Identification)]]. Both are maintained by the [http://www.nationalarchives.gov.uk/ UK's National Archives]. | The format information held in PRONOM is what powers [[DROID (Digital Record Object Identification)]]. Both are maintained by the [http://www.nationalarchives.gov.uk/ UK's National Archives]. | ||
Line 25: | Line 24: | ||
* [[Nanite]] and [[Fido]] identification tools | * [[Nanite]] and [[Fido]] identification tools | ||
+ | * [[Siegfried]] identification tool | ||
Although the information and website are made freely available under the Open Government License, the underlying software engine that powers PRONOM is proprietary. | Although the information and website are made freely available under the Open Government License, the underlying software engine that powers PRONOM is proprietary. | ||
− | + | ||
− | === The PRONOM Web API === | + | |
+ | ===== The PRONOM Web API ===== | ||
The website is oriented towards manual browsing, but note that each PRONOM registry entry as a permalink, like this: | The website is oriented towards manual browsing, but note that each PRONOM registry entry as a permalink, like this: | ||
Line 36: | Line 37: | ||
http://apps.nationalarchives.gov.uk/pronom/fmt/579.xml | http://apps.nationalarchives.gov.uk/pronom/fmt/579.xml | ||
− | |||
− | |||
= User Experiences = | = User Experiences = | ||
+ | * [https://www.dpconline.org/handbook/tool-demos/droid-demo Tool demo with videos] in the [https://www.dpconline.org/handbook DPC Handbook] | ||
* [http://www.jisc.ac.uk/media/documents/programmes/preservation/daat_file_format_tools_report.pdf Digital Asset Assessment Tool - Assessment of file format testing tools]. | * [http://www.jisc.ac.uk/media/documents/programmes/preservation/daat_file_format_tools_report.pdf Digital Asset Assessment Tool - Assessment of file format testing tools]. | ||
* Comparing how [[Apache Tika]] and DROID perform HTML identification: [http://britishlibrary.typepad.co.uk/webarchive/2014/07/how-much-of-the-uk-html-is-valid.html How much of the UK's HTML is valid?] | * Comparing how [[Apache Tika]] and DROID perform HTML identification: [http://britishlibrary.typepad.co.uk/webarchive/2014/07/how-much-of-the-uk-html-is-valid.html How much of the UK's HTML is valid?] | ||
* [http://openplanetsfoundation.org/blogs/2014-06-03-analysis-engine-droid-csv-export An Analysis Engine for the DROID CSV Export] | * [http://openplanetsfoundation.org/blogs/2014-06-03-analysis-engine-droid-csv-export An Analysis Engine for the DROID CSV Export] | ||
− | * Used in [[KOST-Val]] for the file format identification. For performance and granularity reasons an own SignatureFile is used instead of the official PRONOM registry. | + | * '''KOST-CECO:''' Used in [[KOST-Val]] for the file format identification. For performance and granularity reasons an own SignatureFile is used instead of the official PRONOM registry. |
+ | * '''FITS (File Information Tool Set):''' Used in [[FITS (File Information Tool Set)|FITS]] | ||
= Development Activity = | = Development Activity = | ||
+ | All development activity is visible on GitHub: http://github.com/digital-preservation/droid/commits | ||
+ | |||
+ | |||
+ | === Release Feed === | ||
+ | Below the last 3 release feeds: | ||
+ | <rss max=3>https://github.com/digital-preservation/droid/releases.atom</rss> | ||
+ | |||
+ | |||
+ | === Activity Feed === | ||
+ | Below the last 5 commits: | ||
+ | <rss max=5>https://github.com/digital-preservation/droid/commits/master.atom</rss> | ||
+ | |||
− | + | === About Formats === | |
− | + | DROID do only read the format itself, not the spesification as such | |
− |
Latest revision as of 14:04, 17 August 2021
Description[edit]
DROID (Digital Record Object Identification) is a software tool developed to perform automated batch identification of file formats. DROID is designed to meet the fundamental requirement of any digital repository to be able to identify the precise format of all stored digital objects, and to link that identification to a central registry of technical information about that format and its dependencies.
DROID uses the PRONOM signature files to perform format identification. Like PRONOM, it was developed by the National Archives of the UK. Written in Java, XML.
PRONOM[edit]
The format information held in PRONOM is what powers DROID (Digital Record Object Identification). Both are maintained by the UK's National Archives.
DROID downloads the latest signature files from PRONOM, and those are used to drive the identification process. See the PRONOM release notes.
A number of other tools and registries have been based around the PRONOM data. These include:
Although the information and website are made freely available under the Open Government License, the underlying software engine that powers PRONOM is proprietary.
The PRONOM Web API[edit]
The website is oriented towards manual browsing, but note that each PRONOM registry entry as a permalink, like this:
http://apps.nationalarchives.gov.uk/pronom/fmt/579
and furthermore, by appending '.xml' to the URL for any entry, the data can be recovered as XML:
http://apps.nationalarchives.gov.uk/pronom/fmt/579.xml
User Experiences[edit]
- Tool demo with videos in the DPC Handbook
- Digital Asset Assessment Tool - Assessment of file format testing tools.
- Comparing how Apache Tika and DROID perform HTML identification: How much of the UK's HTML is valid?
- An Analysis Engine for the DROID CSV Export
- KOST-CECO: Used in KOST-Val for the file format identification. For performance and granularity reasons an own SignatureFile is used instead of the official PRONOM registry.
- FITS (File Information Tool Set): Used in FITS
Development Activity[edit]
All development activity is visible on GitHub: http://github.com/digital-preservation/droid/commits
Release Feed[edit]
Below the last 3 release feeds:
- 2024-06-26 14:04:16
- [tag:github.com,2008:Repository/4737996/droid-6.8.0 6.8.0]
- by sparkhi
- 2024-06-24 14:32:12
- [tag:github.com,2008:Repository/4737996/droid-6.8.0-RC4 6.8.0-RC4]
- by sparkhi
- 2024-06-16 20:03:44
- [tag:github.com,2008:Repository/4737996/droid-6.8.0-RC3 6.8.0-RC3]
- by sparkhi
Activity Feed[edit]
Below the last 5 commits:
- 2024-11-13 14:17:04
- [tag:github.com,2008:Grit::Commit/423227991e7033955471dddeb34509ff2ecc2acc dependency check plugin updated (#1137)]
- by sparkhi https://github.com/sparkhi
- 2024-10-07 14:04:47
- [tag:github.com,2008:Grit::Commit/f2ecabae684bb3854aa9f11a803cb54bc4b9bebb build(deps): bump com.github.junrar:junrar from 7.5.4 to 7.5.5 (#1061)]
- by dependabot https://github.com/dependabot
- 2024-06-26 15:15:20
- [tag:github.com,2008:Grit::Commit/7a0f5a20d4c4a93d19ea986a8ed0d71d867725de 6.8.x release (#1119)]
- by sparkhi https://github.com/sparkhi
- 2024-05-16 08:15:13
- [tag:github.com,2008:Grit::Commit/3a7f5bff633341e28e6e0ee2bb66100a8e083fef Java 17 JRE bundled for the Windows build (#1108)]
- by sparkhi https://github.com/sparkhi
- 2024-05-10 12:57:27
- [tag:github.com,2008:Grit::Commit/0c72b1dced0783b97537c628c58e9eaebfdb92f7 Include new pronom signature files (#1103)]
- by sparkhi https://github.com/sparkhi
About Formats[edit]
DROID do only read the format itself, not the spesification as such