Difference between revisions of "PET (PERICLES Extraction Tool)"
Jump to navigation
Jump to search
Prwheatley (talk | contribs) |
Prwheatley (talk | contribs) |
||
(7 intermediate revisions by 5 users not shown) | |||
Line 1: | Line 1: | ||
− | + | {{Infobox tool | |
− | |||
− | {{ | ||
|purpose=A tool to capture contextual information in a sheer curation scenario | |purpose=A tool to capture contextual information in a sheer curation scenario | ||
|homepage=https://github.com/pericles-project/pet | |homepage=https://github.com/pericles-project/pet | ||
Line 7: | Line 5: | ||
|platforms=Cross-platform | |platforms=Cross-platform | ||
|language=Java | |language=Java | ||
+ | |function=Metadata Extraction, Dependency Analysis | ||
}} | }} | ||
− | + | {{Infobox tool details | |
− | + | |releases_rss=https://github.com/pericles-project/pet/releases.atom | |
− | + | }} | |
− | + | == Description == | |
− | + | The PERICLES Extraction Tool (PET) is an open source (Apache 2 licensed) Java software for the extraction of significant information from the environment where digital objects are created and modified. This information supports object use and reuse, e.g. for a better long-term preservation of data. For the main part of the metadata extraction PET uses [http://coptr.digipres.org/Tika Apache TIKA] and some other moduls which are: | |
− | [ | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
+ | * CPU specification snapshot | ||
+ | * CPU usage monitoring | ||
+ | * Calculate file checksum | ||
+ | * Create custom executable command (file dependent) | ||
+ | * Create custom executable command (file independent) | ||
+ | * Directory Monitor Module | ||
+ | * FQDN | ||
+ | * File identification | ||
+ | * File store information (java.nio.file) | ||
+ | * File store information (sigar) | ||
+ | * File system information snapshot | ||
+ | * Google chrome opened tabs monitoring | ||
+ | * Graphic System properties snapshot | ||
+ | * Graphic card information module | ||
+ | * Installed software snapshot | ||
+ | * Java installation information snapshot | ||
+ | * LSOF use monitor | ||
+ | * List of network interfaces | ||
+ | * Log expression grep | ||
+ | * [[MediaInfo]] | ||
+ | * Memory monitoring | ||
+ | * Network information | ||
+ | * OS X Spotlight Command module | ||
+ | * Office document dependencies | ||
+ | * Operating System properties snapshot | ||
+ | * PDF Font dependencies | ||
+ | * Posix file information monitoring | ||
+ | * Process parameter | ||
+ | * Process statistics monitoring | ||
+ | * Regex text search | ||
+ | * Screenshot module | ||
+ | * System resources snapshot | ||
+ | * System swap monitoring | ||
+ | * TCP statistics monitoring | ||
+ | * Uptime | ||
+ | * Who (user, host, device, time) | ||
+ | * Windows Handle monitoring daemon | ||
+ | * XML xPath expression | ||
− | + | The Tool was developed entirely for the PERICLES EU project http://www.pericles-project.eu/ by Fabio Corubolo, University of Liverpool, and Anna Eggers, Göttingen State and University Library. | |
− | |||
A more detailed description can be found in this [http://pericles-project.eu/blog/post/metadata%20extraction,%20environment%20information blog post]. | A more detailed description can be found in this [http://pericles-project.eu/blog/post/metadata%20extraction,%20environment%20information blog post]. | ||
== User Experiences == | == User Experiences == | ||
− | + | <!-- Add hotlinks to user experiences with the tool (eg. blog posts). These should illustrate the effectiveness (or otherwise) of the tool. Use a bullet list. --> | |
+ | *'''User Wittmann:''' PET is a easy to use and easy to install tool, which does all thing which are promised. The user interface is self explanatory and the programm runs fast. But the neccessary informations for long time storage are given by Apache TIKA. If the other moduls of PET not expressly needed, then it is better and faster to use Apache TIKA directly instead of the PERICLES Extraction Tool. | ||
== Development Activity == | == Development Activity == | ||
<!-- Provide *evidence* of development activity of the tool. For example, RSS feeds for code issues or commits. --> | <!-- Provide *evidence* of development activity of the tool. For example, RSS feeds for code issues or commits. --> | ||
<!-- Add the OpenHub.com ID for the tool, if known. --> | <!-- Add the OpenHub.com ID for the tool, if known. --> | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− |
Latest revision as of 15:53, 21 April 2021
Release Feed
- 2016-01-21 15:01:34
- [tag:github.com,2008:Repository/22834235/1.2.1 Version 1.2.1 - Updated libraries]
- by corubolo
- 2016-01-21 10:45:52
- [tag:github.com,2008:Repository/22834235/1.2 1.2]
- by anna-g-arbeiter
- 2015-01-29 16:11:29
- [tag:github.com,2008:Repository/22834235/1.1.1 Version 1.1.1]
- by corubolo
- 2015-01-26 16:29:01
- [tag:github.com,2008:Repository/22834235/1.1 Bugfix release]
- by corubolo
- 2014-10-30 09:37:26
- [tag:github.com,2008:Repository/22834235/1.0 First binary release of PET tool]
- by corubolo
Description[edit]
The PERICLES Extraction Tool (PET) is an open source (Apache 2 licensed) Java software for the extraction of significant information from the environment where digital objects are created and modified. This information supports object use and reuse, e.g. for a better long-term preservation of data. For the main part of the metadata extraction PET uses Apache TIKA and some other moduls which are:
- CPU specification snapshot
- CPU usage monitoring
- Calculate file checksum
- Create custom executable command (file dependent)
- Create custom executable command (file independent)
- Directory Monitor Module
- FQDN
- File identification
- File store information (java.nio.file)
- File store information (sigar)
- File system information snapshot
- Google chrome opened tabs monitoring
- Graphic System properties snapshot
- Graphic card information module
- Installed software snapshot
- Java installation information snapshot
- LSOF use monitor
- List of network interfaces
- Log expression grep
- MediaInfo
- Memory monitoring
- Network information
- OS X Spotlight Command module
- Office document dependencies
- Operating System properties snapshot
- PDF Font dependencies
- Posix file information monitoring
- Process parameter
- Process statistics monitoring
- Regex text search
- Screenshot module
- System resources snapshot
- System swap monitoring
- TCP statistics monitoring
- Uptime
- Who (user, host, device, time)
- Windows Handle monitoring daemon
- XML xPath expression
The Tool was developed entirely for the PERICLES EU project http://www.pericles-project.eu/ by Fabio Corubolo, University of Liverpool, and Anna Eggers, Göttingen State and University Library.
A more detailed description can be found in this blog post.
User Experiences[edit]
- User Wittmann: PET is a easy to use and easy to install tool, which does all thing which are promised. The user interface is self explanatory and the programm runs fast. But the neccessary informations for long time storage are given by Apache TIKA. If the other moduls of PET not expressly needed, then it is better and faster to use Apache TIKA directly instead of the PERICLES Extraction Tool.