Difference between revisions of "PET (PERICLES Extraction Tool)"

From COPTR
Jump to navigation Jump to search
 
(One intermediate revision by the same user not shown)
Line 1: Line 1:
<!-- Use the structure provided in this template, do not change it! -->
+
{{Infobox tool
 
 
{{Infobox_tool
 
 
|purpose=A tool to capture contextual information in a sheer curation scenario
 
|purpose=A tool to capture contextual information in a sheer curation scenario
 
|homepage=https://github.com/pericles-project/pet
 
|homepage=https://github.com/pericles-project/pet
Line 7: Line 5:
 
|platforms=Cross-platform
 
|platforms=Cross-platform
 
|language=Java
 
|language=Java
 +
|function=Metadata Extraction, Dependency Analysis
 +
}}
 +
{{Infobox tool details
 +
|releases_rss=https://github.com/pericles-project/pet/releases.atom
 
}}
 
}}
<!-- Note that to use the image field, you should leave the value as {{PAGENAMEE}}.png (or similar) and upload a copy of the image. Hot-linking is not supported. If you don't want an image, just remove that line. -->
 
 
<!-- Add one or more categories to describe the function of the tool, such as:
 
[[Category:Metadata Extraction]] or [[Category:Preservation System]] or [[Category:Backup]]
 
Choose carefully, and view the list of existing categories first (see the Navigation sidebar on the left) -->
 
[[Category:Dependency Analysis]]
 
[[Category:Metadata Extraction]]
 
 
<!-- Add relevant categories to describe the content type that the tool addresses, such as:
 
[[Category:Audio]] or [[Category:Document]] or [[Category:Research Data]]
 
Choose carefully, and view the list of existing categories first (see the Navigation sidebar on the left). If the tool works on any content type, do not add a category. -->
 
 
 
 
== Description ==
 
== Description ==
 
The PERICLES Extraction Tool (PET) is an open source (Apache 2 licensed) Java software for the extraction of significant information from the environment where digital objects are created and modified. This information supports object use and reuse, e.g. for a better long-term preservation of data. For the main part of the metadata extraction PET uses [http://coptr.digipres.org/Tika Apache TIKA] and some other moduls which are:
 
The PERICLES Extraction Tool (PET) is an open source (Apache 2 licensed) Java software for the extraction of significant information from the environment where digital objects are created and modified. This information supports object use and reuse, e.g. for a better long-term preservation of data. For the main part of the metadata extraction PET uses [http://coptr.digipres.org/Tika Apache TIKA] and some other moduls which are:
Line 74: Line 63:
 
<!-- Provide *evidence* of development activity of the tool. For example, RSS feeds for code issues or commits. -->
 
<!-- Provide *evidence* of development activity of the tool. For example, RSS feeds for code issues or commits. -->
 
<!-- Add the OpenHub.com ID for the tool, if known. -->
 
<!-- Add the OpenHub.com ID for the tool, if known. -->
{{Infobox_tool_details
 
|releases_rss=https://github.com/pericles-project/pet/releases.atom
 
|issues_rss=
 
|mailing_lists=
 
|ohloh_id=
 
}}
 

Latest revision as of 15:53, 21 April 2021


A tool to capture contextual information in a sheer curation scenario
Homepage:https://github.com/pericles-project/pet
License:Apache 2
Platforms:Cross-platform
Language:Java
Function:Metadata Extraction,Dependency Analysis



Release Feed

2016-01-21 15:01:34
[tag:github.com,2008:Repository/22834235/1.2.1 Version 1.2.1 - Updated libraries]
by corubolo
2016-01-21 10:45:52
[tag:github.com,2008:Repository/22834235/1.2 1.2]
by anna-g-arbeiter
2015-01-29 16:11:29
[tag:github.com,2008:Repository/22834235/1.1.1 Version 1.1.1]
by corubolo
2015-01-26 16:29:01
[tag:github.com,2008:Repository/22834235/1.1 Bugfix release]
by corubolo
2014-10-30 09:37:26
[tag:github.com,2008:Repository/22834235/1.0 First binary release of PET tool]
by corubolo

Description[edit]

The PERICLES Extraction Tool (PET) is an open source (Apache 2 licensed) Java software for the extraction of significant information from the environment where digital objects are created and modified. This information supports object use and reuse, e.g. for a better long-term preservation of data. For the main part of the metadata extraction PET uses Apache TIKA and some other moduls which are:

  • CPU specification snapshot
  • CPU usage monitoring
  • Calculate file checksum
  • Create custom executable command (file dependent)
  • Create custom executable command (file independent)
  • Directory Monitor Module
  • FQDN
  • File identification
  • File store information (java.nio.file)
  • File store information (sigar)
  • File system information snapshot
  • Google chrome opened tabs monitoring
  • Graphic System properties snapshot
  • Graphic card information module
  • Installed software snapshot
  • Java installation information snapshot
  • LSOF use monitor
  • List of network interfaces
  • Log expression grep
  • MediaInfo
  • Memory monitoring
  • Network information
  • OS X Spotlight Command module
  • Office document dependencies
  • Operating System properties snapshot
  • PDF Font dependencies
  • Posix file information monitoring
  • Process parameter
  • Process statistics monitoring
  • Regex text search
  • Screenshot module
  • System resources snapshot
  • System swap monitoring
  • TCP statistics monitoring
  • Uptime
  • Who (user, host, device, time)
  • Windows Handle monitoring daemon
  • XML xPath expression

The Tool was developed entirely for the PERICLES EU project http://www.pericles-project.eu/ by Fabio Corubolo, University of Liverpool, and Anna Eggers, Göttingen State and University Library.

A more detailed description can be found in this blog post.

User Experiences[edit]

  • User Wittmann: PET is a easy to use and easy to install tool, which does all thing which are promised. The user interface is self explanatory and the programm runs fast. But the neccessary informations for long time storage are given by Apache TIKA. If the other moduls of PET not expressly needed, then it is better and faster to use Apache TIKA directly instead of the PERICLES Extraction Tool.

Development Activity[edit]