Difference between revisions of "Officeparser.py"

From COPTR
Jump to navigation Jump to search
m (Andy Jackson moved page OfficeParser to Officeparser.py)
 
(2 intermediate revisions by 2 users not shown)
Line 1: Line 1:
<!-- Use the structure provided in this template, do not change it! -->
+
{{Infobox tool
 
 
{{Infobox_tool
 
 
|purpose=officerparser.py is a python script that parses the format of OLE compound documents used by Microsoft Office applications.
 
|purpose=officerparser.py is a python script that parses the format of OLE compound documents used by Microsoft Office applications.
|image={{PAGENAMEE}}.png
 
 
|homepage=https://github.com/unixfreak0037/officeparser
 
|homepage=https://github.com/unixfreak0037/officeparser
 
|license=MIT License
 
|license=MIT License
 
|platforms=Cross-platform (Python)
 
|platforms=Cross-platform (Python)
 +
|function=Metadata Extraction, File Format Identification
 +
|content=Document, Spreadsheet
 
}}
 
}}
<!-- Note that to use the image field, you should leave the value as {{PAGENAMEE}}.png (or similar) and upload a copy of the image. Hot-linking is not supported. If you don't want an image, just remove that line. -->
+
{{Infobox tool details}}
 
 
<!-- Add one or more categories to describe the function of the tool, such as:
 
[[Category:Metadata Extraction]] or [[Category:Preservation System]] or [[Category:Backup]]
 
Choose carefully, and view the list of existing categories first (see the Navigation sidebar on the left) -->
 
[[Category:Metadata Extraction]]
 
[[Category:File Format Identification]]
 
 
 
<!-- Add relevant categories to describe the content type that the tool addresses, such as:
 
[[Category:Audio]] or [[Category:Document]] or [[Category:Research Data]]
 
Choose carefully, and view the list of existing categories first (see the Navigation sidebar on the left). If the tool works on any content type, do not add a category. -->
 
[[Category:Document]]
 
 
 
 
== Description ==
 
== Description ==
 
<!-- Describe the what the tool does, focusing on it's digital preservation value. Keep it factual. -->
 
<!-- Describe the what the tool does, focusing on it's digital preservation value. Keep it factual. -->
Line 40: Line 27:
  
 
<!-- Add the OpenHub.com ID for the tool, if known. -->
 
<!-- Add the OpenHub.com ID for the tool, if known. -->
{{Infobox_tool_details
 
|ohloh_id=
 
}}
 

Latest revision as of 14:33, 26 April 2021



officerparser.py is a python script that parses the format of OLE compound documents used by Microsoft Office applications.
Homepage:https://github.com/unixfreak0037/officeparser
License:MIT License
Platforms:Cross-platform (Python)
Function:Metadata Extraction,File Format Identification
Content type:Document,Spreadsheet




Description[edit]

It can perform:

  • macro extraction
  • embedded file extraction
  • format analysis


User Experiences[edit]

Development Activity[edit]