Difference between revisions of "Xpdf"
(Trial import from script.) |
(Import from spreadsheet via script.) |
||
Line 18: | Line 18: | ||
* ''pdfinfo'': prints the contents of the ‘Info’ dictionary (plus some other useful information) from a Portable Document Format (PDF) file. In addition, the following information is printed: | * ''pdfinfo'': prints the contents of the ‘Info’ dictionary (plus some other useful information) from a Portable Document Format (PDF) file. In addition, the following information is printed: | ||
− | tagged (yes/no) | + | tagged (yes/no) |
− | form (AcroForm / XFA / none) | + | form (AcroForm / XFA / none) |
− | page count | + | page count |
− | encrypted flag (yes/no) | + | encrypted flag (yes/no) |
− | print and copy permissions (if encrypted) | + | print and copy permissions (if encrypted) |
− | page size | + | page size |
− | file size | + | file size |
− | linearized (yes/no) | + | linearized (yes/no) |
− | PDF version | + | PDF version |
metadata (only if requested) | metadata (only if requested) | ||
Line 42: | Line 42: | ||
The following information is listed for each font: | The following information is listed for each font: | ||
− | name the font name, exactly as given in the PDF file (potentially | + | name the font name, exactly as given in the PDF file (potentially |
including a subset prefix) | including a subset prefix) | ||
Line 52: | Line 52: | ||
sub “yes†if the font is a subset | sub “yes†if the font is a subset | ||
− | uni “yes†if there is an explicit “ToUnicode†map in the PDF file | + | uni “yes†if there is an explicit “ToUnicode†map in the PDF file |
− | (the absence of a ToUnicode map doesn’t necessarily mean that | + | (the absence of a ToUnicode map doesn’t necessarily mean that |
the text can’t be converted to Unicode) | the text can’t be converted to Unicode) |
Revision as of 21:27, 13 November 2013
Description
Xpdf is an open source PDF viewer that includes command-line information extractor and font analyzer utilities. The following utilities are particularly relevant to digital preservation:
- pdfinfo: prints the contents of the ‘Info’ dictionary (plus some other useful information) from a Portable Document Format (PDF) file. In addition, the following information is printed:
tagged (yes/no)
form (AcroForm / XFA / none)
page count
encrypted flag (yes/no)
print and copy permissions (if encrypted)
page size
file size
linearized (yes/no)
PDF version
metadata (only if requested)
- pdffonts: lists the fonts used in a Portable Document Format (PDF) file along with various information for each font
The following information is listed for each font:
name the font name, exactly as given in the PDF file (potentially
including a subset prefix)
type the font type
emb “yes†if the font is embedded in the PDF file
sub “yes†if the font is a subset
uni “yes†if there is an explicit “ToUnicode†map in the PDF file
(the absence of a ToUnicode map doesn’t necessarily mean that
the text can’t be converted to Unicode)
User Experiences
Development Activity