Difference between revisions of "Metadata Extraction Tool"

From COPTR
Jump to navigation Jump to search
(Trial import from script.)
 
(Changing link to the code from Sourceforge to Github)
 
(8 intermediate revisions by 4 users not shown)
Line 1: Line 1:
{{Infobox_tool
+
{{Infobox tool
|purpose=The Metadata Extraction tool automatically extracts preservation-related metadata from digital files and outputs that metadata in XML for use in preservation activities.
+
|purpose=Metadata Extraction Tool automatically extracts a limited set of metadata from the headers of digital files.
|image=
+
|homepage=https://github.com/DIA-NZ/Metadata-Extraction-Tool
|homepage=http://meta-extractor.sourceforge.net/
+
|sourcecode=https://github.com/DIA-NZ/Metadata-Extraction-Tool
|license=Apache Public License (version 2)
+
|license=[http://www.apache.org/licenses/LICENSE-2.0.html Apache Public License (version 2)]
 
|platforms=Must have Java installed and enabled
 
|platforms=Must have Java installed and enabled
 +
|function=Metadata Extraction
 +
}}
 +
{{Infobox tool details
 +
|ohloh_id=meta-extractor
 
}}
 
}}
 
<!-- Delete the Categories that do not apply -->
 
[[Category:Metadata Extraction]]
 
 
 
 
= Description =
 
= Description =
The Metadata Extraction tool automatically extracts preservation-related metadata from digital files and outputs that metadata in XML for use in preservation activities. Developed by National Library of New Zealand. Written in Java and XML.
+
The [http://natlib.govt.nz/librarians/digital-library-tools Metadata Extraction Tool] automatically extracts a limited set of metadata from the headers of digital files; it has the capability to process both individual files and larger batches. The Tool outputs this information as XML, with the goal of facilitating transfer into a preservation metadata repository.
 +
 +
====Provider====
 +
The National Library of New Zealand (NLNZ)
 +
 +
====Platform and interoperability====
 +
The software uses Java and XML, and has been tested in Windows and Linux/Unix environments.
 +
 +
====Functional notes====
 +
The Metadata Extraction Tool uses a library of &lsquo;adapters&rsquo; to extract metadata for specific file types. Adapters have been created for the following formats: BMP, GIF, JPEG and TIFF; MS Word, Word Perfect, Open Office, MS Works, MS Excel, MS PowerPoint, and PDF; WAV, MP3, BFW, and FLAC; HTML and XML; and ARC. If the file type is unknown the Tool applies a generic adapter, which extracts a limited amount baseline metadata.
 +
The application opens all files as read-only, ensuring the integrity of original files.
 +
 +
====Documentation and user support====
 +
The Tool&rsquo;s [http://meta-extractor.sourceforge.net/ Sourceforge page] includes user and installation guides, as well as a developer guide.
 +
Users can report bugs through the Sourceforge site, which also lists a contact email.
 +
 +
====Usability====
 +
The tool has both a GUI and command line interface.
 +
 +
====Expertise required====
 +
Installation and configuration require solid knowledge of application design and technologies. Users should have comprehensive knowledge of metadata standards and formats, particularly regarding preservation metadata.
 +
 +
====Standards compliance====
 +
The Metadata Extraction Tool currently outputs its XML files using the NLNZ preservation metadata schema; however, the software can be configured to support other schemas.
 +
 +
====Influence and take-up====
 +
Sourceforge statistics show approximately 38,000 downloads since 2007.
  
 
= User Experiences =
 
= User Experiences =
 +
<!-- Add hotlinks to user experiences with the tool (eg. blog posts). These should illustrate the effectiveness (or otherwise) of the tool. Use a bullet list. -->
 +
* '''FITS (File Information Tool Set):''' Used in [[FITS (File Information Tool Set)|FITS]]
  
 +
= Development Activity =
 +
Version 3.5GA was released in June 2010. Latest release 3.6GA is from 2014.
  
= Development Activity =
+
The initial version of the tool was released in 2003; redevelopment for version 3 began in 2007. Contact information on the NLNZ site implies ongoing support; no information is available about ongoing development.
 +
 
 +
All development activity is visible on http://sourceforge.net/projects/meta-extractor/

Latest revision as of 01:55, 20 August 2021



Metadata Extraction Tool automatically extracts a limited set of metadata from the headers of digital files.
Homepage:https://github.com/DIA-NZ/Metadata-Extraction-Tool
Source Code:https://github.com/DIA-NZ/Metadata-Extraction-Tool
License:Apache Public License (version 2)
Platforms:Must have Java installed and enabled
Function:Metadata Extraction


Error in widget Ohloh Project: unable to write file /var/www/html/extensions/Widgets/compiled_templates/wrt661de498d9a379_47411967


Description[edit]

The Metadata Extraction Tool automatically extracts a limited set of metadata from the headers of digital files; it has the capability to process both individual files and larger batches. The Tool outputs this information as XML, with the goal of facilitating transfer into a preservation metadata repository.

Provider[edit]

The National Library of New Zealand (NLNZ)

Platform and interoperability[edit]

The software uses Java and XML, and has been tested in Windows and Linux/Unix environments.

Functional notes[edit]

The Metadata Extraction Tool uses a library of ‘adapters’ to extract metadata for specific file types. Adapters have been created for the following formats: BMP, GIF, JPEG and TIFF; MS Word, Word Perfect, Open Office, MS Works, MS Excel, MS PowerPoint, and PDF; WAV, MP3, BFW, and FLAC; HTML and XML; and ARC. If the file type is unknown the Tool applies a generic adapter, which extracts a limited amount baseline metadata. The application opens all files as read-only, ensuring the integrity of original files.

Documentation and user support[edit]

The Tool’s Sourceforge page includes user and installation guides, as well as a developer guide. Users can report bugs through the Sourceforge site, which also lists a contact email.

Usability[edit]

The tool has both a GUI and command line interface.

Expertise required[edit]

Installation and configuration require solid knowledge of application design and technologies. Users should have comprehensive knowledge of metadata standards and formats, particularly regarding preservation metadata.

Standards compliance[edit]

The Metadata Extraction Tool currently outputs its XML files using the NLNZ preservation metadata schema; however, the software can be configured to support other schemas.

Influence and take-up[edit]

Sourceforge statistics show approximately 38,000 downloads since 2007.

User Experiences[edit]

  • FITS (File Information Tool Set): Used in FITS

Development Activity[edit]

Version 3.5GA was released in June 2010. Latest release 3.6GA is from 2014.

The initial version of the tool was released in 2003; redevelopment for version 3 began in 2007. Contact information on the NLNZ site implies ongoing support; no information is available about ongoing development.

All development activity is visible on http://sourceforge.net/projects/meta-extractor/