Difference between revisions of "Metadata Extraction Tool"

From COPTR
Jump to navigation Jump to search
(User Experiences: FITS)
(Changing link to the code from Sourceforge to Github)
 
(4 intermediate revisions by 3 users not shown)
Line 1: Line 1:
{{Infobox_tool
+
{{Infobox tool
 
|purpose=Metadata Extraction Tool automatically extracts a limited set of metadata from the headers of digital files.
 
|purpose=Metadata Extraction Tool automatically extracts a limited set of metadata from the headers of digital files.
|image=
+
|homepage=https://github.com/DIA-NZ/Metadata-Extraction-Tool
|homepage=http://meta-extractor.sourceforge.net/
+
|sourcecode=https://github.com/DIA-NZ/Metadata-Extraction-Tool
|license=Apache Public License (version 2)
+
|license=[http://www.apache.org/licenses/LICENSE-2.0.html Apache Public License (version 2)]
 
|platforms=Must have Java installed and enabled
 
|platforms=Must have Java installed and enabled
 +
|function=Metadata Extraction
 +
}}
 +
{{Infobox tool details
 +
|ohloh_id=meta-extractor
 
}}
 
}}
 
<!-- Delete the Categories that do not apply -->
 
[[Category:Metadata Extraction]]
 
 
 
 
= Description =
 
= Description =
 
The [http://natlib.govt.nz/librarians/digital-library-tools Metadata Extraction Tool] automatically extracts a limited set of metadata from the headers of digital files; it has the capability to process both individual files and larger batches. The Tool outputs this information as XML, with the goal of facilitating transfer into a preservation metadata repository.
 
The [http://natlib.govt.nz/librarians/digital-library-tools Metadata Extraction Tool] automatically extracts a limited set of metadata from the headers of digital files; it has the capability to process both individual files and larger batches. The Tool outputs this information as XML, with the goal of facilitating transfer into a preservation metadata repository.
Line 16: Line 15:
 
====Provider====
 
====Provider====
 
The National Library of New Zealand (NLNZ)
 
The National Library of New Zealand (NLNZ)
 
====Licensing and cost====
 
[http://www.apache.org/licenses/LICENSE-2.0.html Apache Public License (version 2)] &ndash; free.
 
 
   
 
   
 
====Platform and interoperability====
 
====Platform and interoperability====
Line 48: Line 44:
  
 
= Development Activity =
 
= Development Activity =
Version 3.5GA was released in June 2010.
+
Version 3.5GA was released in June 2010. Latest release 3.6GA is from 2014.
 +
 
 
The initial version of the tool was released in 2003; redevelopment for version 3 began in 2007. Contact information on the NLNZ site implies ongoing support; no information is available about ongoing development.
 
The initial version of the tool was released in 2003; redevelopment for version 3 began in 2007. Contact information on the NLNZ site implies ongoing support; no information is available about ongoing development.
  
 
All development activity is visible on http://sourceforge.net/projects/meta-extractor/
 
All development activity is visible on http://sourceforge.net/projects/meta-extractor/
 
{{Infobox_tool_details
 
|ohloh_id=meta-extractor
 
}}
 

Latest revision as of 01:55, 20 August 2021



Metadata Extraction Tool automatically extracts a limited set of metadata from the headers of digital files.
Homepage:https://github.com/DIA-NZ/Metadata-Extraction-Tool
Source Code:https://github.com/DIA-NZ/Metadata-Extraction-Tool
License:Apache Public License (version 2)
Platforms:Must have Java installed and enabled
Function:Metadata Extraction


Error in widget Ohloh Project: unable to write file /var/www/html/extensions/Widgets/compiled_templates/wrt673e9300eb8549_39205008


Description[edit]

The Metadata Extraction Tool automatically extracts a limited set of metadata from the headers of digital files; it has the capability to process both individual files and larger batches. The Tool outputs this information as XML, with the goal of facilitating transfer into a preservation metadata repository.

Provider[edit]

The National Library of New Zealand (NLNZ)

Platform and interoperability[edit]

The software uses Java and XML, and has been tested in Windows and Linux/Unix environments.

Functional notes[edit]

The Metadata Extraction Tool uses a library of ‘adapters’ to extract metadata for specific file types. Adapters have been created for the following formats: BMP, GIF, JPEG and TIFF; MS Word, Word Perfect, Open Office, MS Works, MS Excel, MS PowerPoint, and PDF; WAV, MP3, BFW, and FLAC; HTML and XML; and ARC. If the file type is unknown the Tool applies a generic adapter, which extracts a limited amount baseline metadata. The application opens all files as read-only, ensuring the integrity of original files.

Documentation and user support[edit]

The Tool’s Sourceforge page includes user and installation guides, as well as a developer guide. Users can report bugs through the Sourceforge site, which also lists a contact email.

Usability[edit]

The tool has both a GUI and command line interface.

Expertise required[edit]

Installation and configuration require solid knowledge of application design and technologies. Users should have comprehensive knowledge of metadata standards and formats, particularly regarding preservation metadata.

Standards compliance[edit]

The Metadata Extraction Tool currently outputs its XML files using the NLNZ preservation metadata schema; however, the software can be configured to support other schemas.

Influence and take-up[edit]

Sourceforge statistics show approximately 38,000 downloads since 2007.

User Experiences[edit]

  • FITS (File Information Tool Set): Used in FITS

Development Activity[edit]

Version 3.5GA was released in June 2010. Latest release 3.6GA is from 2014.

The initial version of the tool was released in 2003; redevelopment for version 3 began in 2007. Contact information on the NLNZ site implies ongoing support; no information is available about ongoing development.

All development activity is visible on http://sourceforge.net/projects/meta-extractor/