Difference between revisions of "PDF Tools (by Didier Stevens)"

From COPTR
Jump to navigation Jump to search
 
Line 3: Line 3:
 
|homepage=http://blog.didierstevens.com/programs/pdf-tools/
 
|homepage=http://blog.didierstevens.com/programs/pdf-tools/
 
|license=Not specified, public domain
 
|license=Not specified, public domain
|formats_in={{Format|PDF}}
+
|formats_in=PDF
|function=Metadata Extraction, Dependency Analysis, Validation
+
|function=Dependency Analysis, Metadata Extraction, Validation
 
|content=Document
 
|content=Document
 
}}
 
}}

Latest revision as of 21:29, 25 May 2021


Tools for parsing and analysing PDF documents
Homepage:http://blog.didierstevens.com/programs/pdf-tools/
License:Not specified, public domain
Input Formats:PDF
Function:Dependency Analysis,Metadata Extraction,Validation
Content type:Document



Description[edit]

This is a set of Python scripts for anaysing PDF documents. The main ones are:

pdf-parser.py[edit]

This tool will parse a PDF document to identify the fundamental elements used in the analyzed file. A command line option exists to search for specific text strings within indirect objects.

pdfid.py[edit]

Scans a file to look for certain PDF keywords, allowing you to identify PDF documents that contain (for example) JavaScript or execute an action when opened.

User Experiences[edit]

Development Activity[edit]