Difference between revisions of "PET (PERICLES Extraction Tool)"

From COPTR
Jump to navigation Jump to search
m (Prwheatley moved page PET to PERICLES Extraction Tool (PET): Expanded the acronym to meet naming convention)
 
(8 intermediate revisions by 5 users not shown)
Line 1: Line 1:
<!-- Use the structure provided in this template, do not change it! -->
+
{{Infobox tool
 
 
{{Infobox_tool
 
 
|purpose=A tool to capture contextual information in a sheer curation scenario
 
|purpose=A tool to capture contextual information in a sheer curation scenario
 
|homepage=https://github.com/pericles-project/pet
 
|homepage=https://github.com/pericles-project/pet
Line 7: Line 5:
 
|platforms=Cross-platform
 
|platforms=Cross-platform
 
|language=Java
 
|language=Java
 +
|function=Metadata Extraction, Dependency Analysis
 
}}
 
}}
<!-- Note that to use the image field, you should leave the value as {{PAGENAMEE}}.png (or similar) and upload a copy of the image. Hot-linking is not supported. If you don't want an image, just remove that line. -->
+
{{Infobox tool details
 +
|releases_rss=https://github.com/pericles-project/pet/releases.atom
 +
}}
 +
== Description ==
 +
The PERICLES Extraction Tool (PET) is an open source (Apache 2 licensed) Java software for the extraction of significant information from the environment where digital objects are created and modified. This information supports object use and reuse, e.g. for a better long-term preservation of data. For the main part of the metadata extraction PET uses [http://coptr.digipres.org/Tika Apache TIKA] and some other moduls which are:
  
<!-- Add one or more categories to describe the function of the tool, such as:
+
* CPU specification snapshot
[[Category:Metadata Extraction]] or [[Category:Preservation System]] or [[Category:Backup]]
+
* CPU usage monitoring
Choose carefully, and view the list of existing categories first (see the Navigation sidebar on the left) -->
+
* Calculate file checksum
[[Category:Dependency Analysis]]
+
* Create custom executable command (file dependent)
[[Category:Metadata Extraction]]
+
* Create custom executable command (file independent)
[[Category:Data capture and Deposit]]
+
* Directory Monitor Module
[[Category:Personal Archiving]]
+
* FQDN
 +
* File identification
 +
* File store information (java.nio.file)
 +
* File store information (sigar)
 +
* File system information snapshot
 +
* Google chrome opened tabs monitoring
 +
* Graphic System properties snapshot
 +
* Graphic card information module
 +
* Installed software snapshot
 +
* Java installation information snapshot
 +
* LSOF use monitor
 +
* List of network interfaces
 +
* Log expression grep
 +
* [[MediaInfo]]
 +
* Memory monitoring
 +
* Network information
 +
* OS X Spotlight Command module
 +
* Office document dependencies
 +
* Operating System properties snapshot
 +
* PDF Font dependencies
 +
* Posix file information monitoring
 +
* Process parameter
 +
* Process statistics monitoring
 +
* Regex text search
 +
* Screenshot module
 +
* System resources snapshot
 +
* System swap monitoring
 +
* TCP statistics monitoring
 +
* Uptime
 +
* Who (user, host, device, time)
 +
* Windows Handle monitoring daemon
 +
* XML xPath expression
  
<!-- Add relevant categories to describe the content type that the tool addresses, such as:
+
The Tool was developed entirely for the PERICLES EU project http://www.pericles-project.eu/ by Fabio Corubolo, University of Liverpool, and Anna Eggers, Göttingen State and University Library.
[[Category:Audio]] or [[Category:Document]] or [[Category:Research Data]]
 
Choose carefully, and view the list of existing categories first (see the Navigation sidebar on the left). If the tool works on any content type, do not add a category. -->
 
  
 
+
A more detailed description can be found in this [http://pericles-project.eu/blog/post/metadata%20extraction,%20environment%20information blog post].
== Description ==
 
The PERICLES Extraction Tool (PET) is an open source (Apache 2 licensed) Java software for the extraction of significant information from the environment where digital objects are created and modified. This information supports object use and reuse, e.g. for a better long-term preservation of data. The Tool was developed entirely for the PERICLES EU project http://www.pericles-project.eu/ by Fabio Corubolo, University of Liverpool, and Anna Eggers, Göttingen State and University Library.
 
  
 
== User Experiences ==
 
== User Experiences ==
 
+
<!-- Add hotlinks to user experiences with the tool (eg. blog posts). These should illustrate the effectiveness (or otherwise) of the tool. Use a bullet list. -->
 +
*'''User Wittmann:''' PET is a easy to use and easy to install tool, which does all thing which are promised. The user interface is self explanatory and the programm runs fast.  But the neccessary informations for long time storage are given by Apache TIKA. If the other moduls of PET not expressly needed, then it is better and faster to use Apache TIKA directly instead of the PERICLES Extraction Tool.
  
 
== Development Activity ==
 
== Development Activity ==
 
<!-- Provide *evidence* of development activity of the tool. For example, RSS feeds for code issues or commits. -->
 
<!-- Provide *evidence* of development activity of the tool. For example, RSS feeds for code issues or commits. -->
 
<!-- Add the OpenHub.com ID for the tool, if known. -->
 
<!-- Add the OpenHub.com ID for the tool, if known. -->
{{Infobox_tool_details
 
|releases_rss=https://github.com/pericles-project/pet/releases.atom
 
|issues_rss=
 
|mailing_lists=
 
|ohloh_id=
 
}}
 

Latest revision as of 15:53, 21 April 2021


A tool to capture contextual information in a sheer curation scenario
Homepage:https://github.com/pericles-project/pet
License:Apache 2
Platforms:Cross-platform
Language:Java
Function:Metadata Extraction,Dependency Analysis



Release Feed

2016-01-21 15:01:34
[tag:github.com,2008:Repository/22834235/1.2.1 Version 1.2.1 - Updated libraries]
by corubolo
2016-01-21 10:45:52
[tag:github.com,2008:Repository/22834235/1.2 1.2]
by anna-g-arbeiter
2015-01-29 16:11:29
[tag:github.com,2008:Repository/22834235/1.1.1 Version 1.1.1]
by corubolo
2015-01-26 16:29:01
[tag:github.com,2008:Repository/22834235/1.1 Bugfix release]
by corubolo
2014-10-30 09:37:26
[tag:github.com,2008:Repository/22834235/1.0 First binary release of PET tool]
by corubolo

Description[edit]

The PERICLES Extraction Tool (PET) is an open source (Apache 2 licensed) Java software for the extraction of significant information from the environment where digital objects are created and modified. This information supports object use and reuse, e.g. for a better long-term preservation of data. For the main part of the metadata extraction PET uses Apache TIKA and some other moduls which are:

  • CPU specification snapshot
  • CPU usage monitoring
  • Calculate file checksum
  • Create custom executable command (file dependent)
  • Create custom executable command (file independent)
  • Directory Monitor Module
  • FQDN
  • File identification
  • File store information (java.nio.file)
  • File store information (sigar)
  • File system information snapshot
  • Google chrome opened tabs monitoring
  • Graphic System properties snapshot
  • Graphic card information module
  • Installed software snapshot
  • Java installation information snapshot
  • LSOF use monitor
  • List of network interfaces
  • Log expression grep
  • MediaInfo
  • Memory monitoring
  • Network information
  • OS X Spotlight Command module
  • Office document dependencies
  • Operating System properties snapshot
  • PDF Font dependencies
  • Posix file information monitoring
  • Process parameter
  • Process statistics monitoring
  • Regex text search
  • Screenshot module
  • System resources snapshot
  • System swap monitoring
  • TCP statistics monitoring
  • Uptime
  • Who (user, host, device, time)
  • Windows Handle monitoring daemon
  • XML xPath expression

The Tool was developed entirely for the PERICLES EU project http://www.pericles-project.eu/ by Fabio Corubolo, University of Liverpool, and Anna Eggers, Göttingen State and University Library.

A more detailed description can be found in this blog post.

User Experiences[edit]

  • User Wittmann: PET is a easy to use and easy to install tool, which does all thing which are promised. The user interface is self explanatory and the programm runs fast. But the neccessary informations for long time storage are given by Apache TIKA. If the other moduls of PET not expressly needed, then it is better and faster to use Apache TIKA directly instead of the PERICLES Extraction Tool.

Development Activity[edit]