Difference between revisions of "Nanite"

From COPTR
Jump to navigation Jump to search
(updated homepage URL)
 
Line 1: Line 1:
<!-- Use the structure provided in this template, do not change it! -->
+
{{Infobox tool
 
 
{{Infobox_tool
 
 
|purpose=A friendly swarm of format-identifying robots
 
|purpose=A friendly swarm of format-identifying robots
|image={{PAGENAMEE}}.png
 
 
|homepage=https://github.com/openpreserve/nanite
 
|homepage=https://github.com/openpreserve/nanite
 
|license=Apache 2.0
 
|license=Apache 2.0
|platforms=
+
|function=File Format Identification, Metadata Extraction
 +
}}
 +
{{Infobox tool details
 +
|ohloh_id=nanite-id
 
}}
 
}}
<!-- Note that to use the image field, you should leave the value as {{PAGENAMEE}}.png (or similar) and upload a copy of the image. Hot-linking is not supported. If you don't want an image, just remove that line. -->
 
 
<!-- Add one or more categories to describe the function of the tool, such as:
 
[[Category:Metadata Extraction]] or [[Category:Preservation System]] or [[Category:Backup]]
 
Choose carefully, and view the list of existing categories first (see the Navigation sidebar on the left) -->
 
[[Category:File Format Identification]]
 
[[Category:Metadata Extraction]]
 
 
<!-- Add relevant categories to describe the content type that the tool addresses, such as:
 
[[Category:Audio]] or [[Category:Document]] or [[Category:Research Data]]
 
Choose carefully, and view the list of existing categories first (see the Navigation sidebar on the left). If the tool works on any content type, do not add a category. -->
 
 
 
 
== Description ==
 
== Description ==
 
<!-- Describe the what the tool does, focusing on it's digital preservation value. Keep it factual. -->
 
<!-- Describe the what the tool does, focusing on it's digital preservation value. Keep it factual. -->
Line 34: Line 21:
  
 
<!-- Add the Ohloh.com ID for the tool, if known. -->
 
<!-- Add the Ohloh.com ID for the tool, if known. -->
{{Infobox_tool_details
 
|ohloh_id=nanite-id
 
}}
 

Latest revision as of 16:09, 22 April 2021




A friendly swarm of format-identifying robots
Homepage:https://github.com/openpreserve/nanite
License:Apache 2.0
Function:File Format Identification,Metadata Extraction



Description[edit]

Nanite is simply a heavily cut-down and slightly modified version of DROID. It was built be extracting out the core of the DROID6 identification engine, stripping away all other complexities and external dependencies (GUI, databases, web services, etc.). Even the signature files are packaged within Nanite rather than being downloaded automatically, so that they can be released under separate versions of the tool and used directly without requiring internet access.

The main purpose of doing so was to ensure that DROID could be run at scale as a Hadoop Map-Reduce job, which is far easier when software is made available in self-contained packages. The Nanite codebase also include example Map-Reduce code that can be used to test and compare identification tools. Web Archive Discovery uses Nanite to help identify web archive content.

User Experiences[edit]

Development Activity[edit]