Difference between revisions of "Metadata Extraction Tool"
(Trial import from script.) |
(Changing link to the code from Sourceforge to Github) |
||
(6 intermediate revisions by 3 users not shown) | |||
Line 1: | Line 1: | ||
− | {{ | + | {{Infobox tool |
− | |purpose=Metadata Extraction Tool | + | |purpose=Metadata Extraction Tool automatically extracts a limited set of metadata from the headers of digital files. |
− | | | + | |homepage=https://github.com/DIA-NZ/Metadata-Extraction-Tool |
− | | | + | |sourcecode=https://github.com/DIA-NZ/Metadata-Extraction-Tool |
− | |license=Apache Public License (version 2) | + | |license=[http://www.apache.org/licenses/LICENSE-2.0.html Apache Public License (version 2)] |
|platforms=Must have Java installed and enabled | |platforms=Must have Java installed and enabled | ||
+ | |function=Metadata Extraction | ||
+ | }} | ||
+ | {{Infobox tool details | ||
+ | |ohloh_id=meta-extractor | ||
}} | }} | ||
− | |||
− | |||
− | |||
− | |||
− | |||
= Description = | = Description = | ||
− | The [http://natlib.govt.nz/librarians/digital-library-tools Metadata Extraction Tool ]automatically extracts a limited set of metadata from the headers of digital files; it has the capability to process both individual files and larger batches. The Tool outputs this information as XML, with the goal of facilitating transfer into a preservation metadata repository. | + | The [http://natlib.govt.nz/librarians/digital-library-tools Metadata Extraction Tool] automatically extracts a limited set of metadata from the headers of digital files; it has the capability to process both individual files and larger batches. The Tool outputs this information as XML, with the goal of facilitating transfer into a preservation metadata repository. |
+ | |||
====Provider==== | ====Provider==== | ||
The National Library of New Zealand (NLNZ) | The National Library of New Zealand (NLNZ) | ||
− | + | ||
− | |||
− | |||
− | |||
− | |||
====Platform and interoperability==== | ====Platform and interoperability==== | ||
The software uses Java and XML, and has been tested in Windows and Linux/Unix environments. | The software uses Java and XML, and has been tested in Windows and Linux/Unix environments. | ||
+ | |||
====Functional notes==== | ====Functional notes==== | ||
The Metadata Extraction Tool uses a library of ‘adapters’ to extract metadata for specific file types. Adapters have been created for the following formats: BMP, GIF, JPEG and TIFF; MS Word, Word Perfect, Open Office, MS Works, MS Excel, MS PowerPoint, and PDF; WAV, MP3, BFW, and FLAC; HTML and XML; and ARC. If the file type is unknown the Tool applies a generic adapter, which extracts a limited amount baseline metadata. | The Metadata Extraction Tool uses a library of ‘adapters’ to extract metadata for specific file types. Adapters have been created for the following formats: BMP, GIF, JPEG and TIFF; MS Word, Word Perfect, Open Office, MS Works, MS Excel, MS PowerPoint, and PDF; WAV, MP3, BFW, and FLAC; HTML and XML; and ARC. If the file type is unknown the Tool applies a generic adapter, which extracts a limited amount baseline metadata. | ||
The application opens all files as read-only, ensuring the integrity of original files. | The application opens all files as read-only, ensuring the integrity of original files. | ||
+ | |||
====Documentation and user support==== | ====Documentation and user support==== | ||
The Tool’s [http://meta-extractor.sourceforge.net/ Sourceforge page] includes user and installation guides, as well as a developer guide. | The Tool’s [http://meta-extractor.sourceforge.net/ Sourceforge page] includes user and installation guides, as well as a developer guide. | ||
Users can report bugs through the Sourceforge site, which also lists a contact email. | Users can report bugs through the Sourceforge site, which also lists a contact email. | ||
+ | |||
====Usability==== | ====Usability==== | ||
The tool has both a GUI and command line interface. | The tool has both a GUI and command line interface. | ||
+ | |||
====Expertise required==== | ====Expertise required==== | ||
Installation and configuration require solid knowledge of application design and technologies. Users should have comprehensive knowledge of metadata standards and formats, particularly regarding preservation metadata. | Installation and configuration require solid knowledge of application design and technologies. Users should have comprehensive knowledge of metadata standards and formats, particularly regarding preservation metadata. | ||
+ | |||
====Standards compliance==== | ====Standards compliance==== | ||
The Metadata Extraction Tool currently outputs its XML files using the NLNZ preservation metadata schema; however, the software can be configured to support other schemas. | The Metadata Extraction Tool currently outputs its XML files using the NLNZ preservation metadata schema; however, the software can be configured to support other schemas. | ||
+ | |||
====Influence and take-up==== | ====Influence and take-up==== | ||
Sourceforge statistics show approximately 38,000 downloads since 2007. | Sourceforge statistics show approximately 38,000 downloads since 2007. | ||
= User Experiences = | = User Experiences = | ||
+ | <!-- Add hotlinks to user experiences with the tool (eg. blog posts). These should illustrate the effectiveness (or otherwise) of the tool. Use a bullet list. --> | ||
+ | * '''FITS (File Information Tool Set):''' Used in [[FITS (File Information Tool Set)|FITS]] | ||
+ | = Development Activity = | ||
+ | Version 3.5GA was released in June 2010. Latest release 3.6GA is from 2014. | ||
− | + | The initial version of the tool was released in 2003; redevelopment for version 3 began in 2007. Contact information on the NLNZ site implies ongoing support; no information is available about ongoing development. | |
− | + | All development activity is visible on http://sourceforge.net/projects/meta-extractor/ | |
− | |||
− |
Latest revision as of 01:55, 20 August 2021
Description[edit]
The Metadata Extraction Tool automatically extracts a limited set of metadata from the headers of digital files; it has the capability to process both individual files and larger batches. The Tool outputs this information as XML, with the goal of facilitating transfer into a preservation metadata repository.
Provider[edit]
The National Library of New Zealand (NLNZ)
Platform and interoperability[edit]
The software uses Java and XML, and has been tested in Windows and Linux/Unix environments.
Functional notes[edit]
The Metadata Extraction Tool uses a library of ‘adapters’ to extract metadata for specific file types. Adapters have been created for the following formats: BMP, GIF, JPEG and TIFF; MS Word, Word Perfect, Open Office, MS Works, MS Excel, MS PowerPoint, and PDF; WAV, MP3, BFW, and FLAC; HTML and XML; and ARC. If the file type is unknown the Tool applies a generic adapter, which extracts a limited amount baseline metadata. The application opens all files as read-only, ensuring the integrity of original files.
Documentation and user support[edit]
The Tool’s Sourceforge page includes user and installation guides, as well as a developer guide. Users can report bugs through the Sourceforge site, which also lists a contact email.
Usability[edit]
The tool has both a GUI and command line interface.
Expertise required[edit]
Installation and configuration require solid knowledge of application design and technologies. Users should have comprehensive knowledge of metadata standards and formats, particularly regarding preservation metadata.
Standards compliance[edit]
The Metadata Extraction Tool currently outputs its XML files using the NLNZ preservation metadata schema; however, the software can be configured to support other schemas.
Influence and take-up[edit]
Sourceforge statistics show approximately 38,000 downloads since 2007.
User Experiences[edit]
- FITS (File Information Tool Set): Used in FITS
Development Activity[edit]
Version 3.5GA was released in June 2010. Latest release 3.6GA is from 2014.
The initial version of the tool was released in 2003; redevelopment for version 3 began in 2007. Contact information on the NLNZ site implies ongoing support; no information is available about ongoing development.
All development activity is visible on http://sourceforge.net/projects/meta-extractor/