Difference between revisions of "DROID (Digital Record Object Identification)"

From COPTR
Jump to navigation Jump to search
 
(3 intermediate revisions by 3 users not shown)
Line 1: Line 1:
{{Infobox_tool
+
{{Infobox tool
 
|purpose=DROID (Digital Record Object Identification) is a software tool developed to perform automated batch identification of file formats.
 
|purpose=DROID (Digital Record Object Identification) is a software tool developed to perform automated batch identification of file formats.
|image=
 
 
|homepage=http://digital-preservation.github.io/droid/
 
|homepage=http://digital-preservation.github.io/droid/
 
|license=BSD License
 
|license=BSD License
 
|platforms=Java 6 Standard Edition
 
|platforms=Java 6 Standard Edition
 +
|formats_in=PREMIS (Preservation Metadata Implementation Strategies)
 +
|function=File Format Identification, Metadata Extraction
 +
}}
 +
{{Infobox tool details
 +
|ohloh_id=droid
 
}}
 
}}
 
<!-- Delete the Categories that do not apply -->
 
[[Category:File Format Identification]]
 
[[Category:Metadata Extraction]]
 
 
 
 
= Description =
 
= Description =
 
DROID (Digital Record Object Identification) is a software tool developed to perform automated batch identification of file formats. DROID is designed to meet the fundamental requirement of any digital repository to be able to identify the precise format of all stored digital objects, and to link that identification to a central registry of technical information about that format and its dependencies.  
 
DROID (Digital Record Object Identification) is a software tool developed to perform automated batch identification of file formats. DROID is designed to meet the fundamental requirement of any digital repository to be able to identify the precise format of all stored digital objects, and to link that identification to a central registry of technical information about that format and its dependencies.  
  
 
DROID uses the PRONOM [http://www.nationalarchives.gov.uk/aboutapps/pronom/droid-signature-files.htm signature files] to perform format identification. Like PRONOM, it was [http://www.nationalarchives.gov.uk/information-management/manage-information/policy-process/digital-continuity/file-profiling-tool-droid/ developed by the National Archives of the UK]. Written in Java, XML.
 
DROID uses the PRONOM [http://www.nationalarchives.gov.uk/aboutapps/pronom/droid-signature-files.htm signature files] to perform format identification. Like PRONOM, it was [http://www.nationalarchives.gov.uk/information-management/manage-information/policy-process/digital-continuity/file-profiling-tool-droid/ developed by the National Archives of the UK]. Written in Java, XML.
 
+
 
== PRONOM ==
+
 
 +
=== PRONOM ===
 
The format information held in PRONOM is what powers [[DROID (Digital Record Object Identification)]]. Both are maintained by the [http://www.nationalarchives.gov.uk/ UK's National Archives].
 
The format information held in PRONOM is what powers [[DROID (Digital Record Object Identification)]]. Both are maintained by the [http://www.nationalarchives.gov.uk/ UK's National Archives].
  
Line 28: Line 27:
  
 
Although the information and website are made freely available under the Open Government License, the underlying software engine that powers PRONOM is proprietary.
 
Although the information and website are made freely available under the Open Government License, the underlying software engine that powers PRONOM is proprietary.
 
+
 
=== The PRONOM Web API ===
+
 
 +
===== The PRONOM Web API =====
 
The website is oriented towards manual browsing, but note that each PRONOM registry entry as a permalink, like this:  
 
The website is oriented towards manual browsing, but note that each PRONOM registry entry as a permalink, like this:  
  
Line 37: Line 37:
  
 
     http://apps.nationalarchives.gov.uk/pronom/fmt/579.xml
 
     http://apps.nationalarchives.gov.uk/pronom/fmt/579.xml
 
 
 
= User Experiences =
 
= User Experiences =
 +
* [https://www.dpconline.org/handbook/tool-demos/droid-demo Tool demo with videos] in the [https://www.dpconline.org/handbook DPC Handbook]
 
* [http://www.jisc.ac.uk/media/documents/programmes/preservation/daat_file_format_tools_report.pdf Digital Asset Assessment Tool - Assessment of file format testing tools].
 
* [http://www.jisc.ac.uk/media/documents/programmes/preservation/daat_file_format_tools_report.pdf Digital Asset Assessment Tool - Assessment of file format testing tools].
 
* Comparing how [[Apache Tika]] and DROID perform HTML identification: [http://britishlibrary.typepad.co.uk/webarchive/2014/07/how-much-of-the-uk-html-is-valid.html How much of the UK's HTML is valid?]
 
* Comparing how [[Apache Tika]] and DROID perform HTML identification: [http://britishlibrary.typepad.co.uk/webarchive/2014/07/how-much-of-the-uk-html-is-valid.html How much of the UK's HTML is valid?]
Line 47: Line 46:
  
 
= Development Activity =
 
= Development Activity =
 +
All development activity is visible on GitHub: http://github.com/digital-preservation/droid/commits
 +
 +
 +
=== Release Feed ===
 +
Below the last 3 release feeds:
 +
<rss max=3>https://github.com/digital-preservation/droid/releases.atom</rss>
 +
 
 +
 +
=== Activity Feed ===
 +
Below the last 5 commits:
 +
<rss max=5>https://github.com/digital-preservation/droid/commits/master.atom</rss>
  
<rss max=5>https://github.com/digital-preservation/droid/commits/master.atom</rss>
 
  
{{Infobox_tool_details
+
=== About Formats ===
|ohloh_id=droid
+
DROID do only read the format itself, not the spesification as such
}}
 

Latest revision as of 14:04, 17 August 2021


DROID (Digital Record Object Identification) is a software tool developed to perform automated batch identification of file formats.
Homepage:http://digital-preservation.github.io/droid/
License:BSD License
Platforms:Java 6 Standard Edition
Input Formats:PREMIS (Preservation Metadata Implementation Strategies)
Function:File Format Identification,Metadata Extraction
Appears in COW:Abrdn Digital Preservation Workflow, Creating a SIP from content downloaded from OneDrive (or other Cloud based source), Digital archiving workflow (high-level), Ingest and AIP creation, LAC Pre-Ingest Workflow, Workflow for ingesting digitized books into a digital archive


Error in widget Ohloh Project: unable to write file /var/www/html/extensions/Widgets/compiled_templates/wrt66228f05a72d75_44721963


Description[edit]

DROID (Digital Record Object Identification) is a software tool developed to perform automated batch identification of file formats. DROID is designed to meet the fundamental requirement of any digital repository to be able to identify the precise format of all stored digital objects, and to link that identification to a central registry of technical information about that format and its dependencies.

DROID uses the PRONOM signature files to perform format identification. Like PRONOM, it was developed by the National Archives of the UK. Written in Java, XML.


PRONOM[edit]

The format information held in PRONOM is what powers DROID (Digital Record Object Identification). Both are maintained by the UK's National Archives.

DROID downloads the latest signature files from PRONOM, and those are used to drive the identification process. See the PRONOM release notes.

A number of other tools and registries have been based around the PRONOM data. These include:

Although the information and website are made freely available under the Open Government License, the underlying software engine that powers PRONOM is proprietary.


The PRONOM Web API[edit]

The website is oriented towards manual browsing, but note that each PRONOM registry entry as a permalink, like this:

   http://apps.nationalarchives.gov.uk/pronom/fmt/579

and furthermore, by appending '.xml' to the URL for any entry, the data can be recovered as XML:

   http://apps.nationalarchives.gov.uk/pronom/fmt/579.xml

User Experiences[edit]

Development Activity[edit]

All development activity is visible on GitHub: http://github.com/digital-preservation/droid/commits


Release Feed[edit]

Below the last 3 release feeds:

2023-09-11 13:43:58
[tag:github.com,2008:Repository/4737996/droid-6.7.0 6.7.0 Release]
by sparkhi
2023-08-31 11:34:49
[tag:github.com,2008:Repository/4737996/droid-6.7.0-RC5 6.7.0 Release Candidate 5]
by sparkhi
2023-08-17 11:57:20
[tag:github.com,2008:Repository/4737996/droid-6.7.0-RC4 6.7.0 Release Candidate 4]
by sparkhi


Activity Feed[edit]

Below the last 5 commits:

2024-04-19 10:29:59
[tag:github.com,2008:Grit::Commit/c5127d390390b68b8c91b7ff9ad3be3b43a85fb8 Added tooltip text and removed the text prompt from config dialog (#1…]
by sparkhi https://github.com/sparkhi
2024-04-17 14:12:41
[tag:github.com,2008:Grit::Commit/8a4318c84c4525757e6ed3b59c91e7860bd673d0 Updated the version number in line with potential release (#1092)]
by sparkhi https://github.com/sparkhi
2024-04-17 07:36:43
[tag:github.com,2008:Grit::Commit/1fcae3c64cfb5189c1a667d581a7e9bf425b5f4b build(deps): bump com.sun.xml.ws:jaxws-ri from 4.0.1 to 4.0.2 (#1090)]
by dependabot https://github.com/dependabot
2024-04-16 07:59:12
[tag:github.com,2008:Grit::Commit/b9b5b08021cb15e7cf07c3a104b507dcf3e40788 build(deps): bump com.googlecode.maven-download-plugin:download-maven…]
by dependabot https://github.com/dependabot
2024-04-15 11:01:23
[tag:github.com,2008:Grit::Commit/f6e9b716fdca60d9cd1006f6f24919d7cecd9e26 build(deps): bump org.apache.maven.plugins:maven-javadoc-plugin (#1047)]
by dependabot https://github.com/dependabot


About Formats[edit]

DROID do only read the format itself, not the spesification as such