Difference between revisions of "Xpdf"
(Trial import from script.) |
Prwheatley (talk | contribs) |
||
(3 intermediate revisions by one other user not shown) | |||
Line 1: | Line 1: | ||
− | {{ | + | {{Infobox tool |
|purpose=Open source PDF viewer that includes PDF information extractor and font analyzer | |purpose=Open source PDF viewer that includes PDF information extractor and font analyzer | ||
− | + | |homepage=http://www.xpdfreader.com/ | |
− | |homepage=http://www. | ||
|license=GPL | |license=GPL | ||
− | | | + | |function=Metadata Extraction, Rendering |
+ | |content=Document | ||
+ | }} | ||
+ | {{Infobox tool details | ||
+ | |ohloh_id=Xpdf | ||
}} | }} | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
= Description = | = Description = | ||
Xpdf is an open source PDF viewer that includes command-line information extractor and font analyzer utilities. The following utilities are particularly relevant to digital preservation: | Xpdf is an open source PDF viewer that includes command-line information extractor and font analyzer utilities. The following utilities are particularly relevant to digital preservation: | ||
Line 18: | Line 14: | ||
* ''pdfinfo'': prints the contents of the ‘Info’ dictionary (plus some other useful information) from a Portable Document Format (PDF) file. In addition, the following information is printed: | * ''pdfinfo'': prints the contents of the ‘Info’ dictionary (plus some other useful information) from a Portable Document Format (PDF) file. In addition, the following information is printed: | ||
− | tagged (yes/no) | + | tagged (yes/no) |
− | form (AcroForm / XFA / none) | + | form (AcroForm / XFA / none) |
− | page count | + | page count |
− | encrypted flag (yes/no) | + | encrypted flag (yes/no) |
− | print and copy permissions (if encrypted) | + | print and copy permissions (if encrypted) |
− | page size | + | page size |
− | file size | + | file size |
− | linearized (yes/no) | + | linearized (yes/no) |
− | PDF version | + | PDF version |
metadata (only if requested) | metadata (only if requested) | ||
Line 42: | Line 38: | ||
The following information is listed for each font: | The following information is listed for each font: | ||
− | name the font name, exactly as given in the PDF file (potentially | + | name the font name, exactly as given in the PDF file (potentially |
including a subset prefix) | including a subset prefix) | ||
Line 52: | Line 48: | ||
sub “yes†if the font is a subset | sub “yes†if the font is a subset | ||
− | uni “yes†if there is an explicit “ToUnicode†map in the PDF file | + | uni “yes†if there is an explicit “ToUnicode†map in the PDF file |
− | (the absence of a ToUnicode map doesn’t necessarily mean that | + | (the absence of a ToUnicode map doesn’t necessarily mean that |
the text can’t be converted to Unicode) | the text can’t be converted to Unicode) | ||
Line 62: | Line 58: | ||
= Development Activity = | = Development Activity = | ||
− | |||
− | |||
− |
Latest revision as of 15:18, 21 April 2021
Description[edit]
Xpdf is an open source PDF viewer that includes command-line information extractor and font analyzer utilities. The following utilities are particularly relevant to digital preservation:
- pdfinfo: prints the contents of the ‘Info’ dictionary (plus some other useful information) from a Portable Document Format (PDF) file. In addition, the following information is printed:
tagged (yes/no)
form (AcroForm / XFA / none)
page count
encrypted flag (yes/no)
print and copy permissions (if encrypted)
page size
file size
linearized (yes/no)
PDF version
metadata (only if requested)
- pdffonts: lists the fonts used in a Portable Document Format (PDF) file along with various information for each font
The following information is listed for each font:
name the font name, exactly as given in the PDF file (potentially
including a subset prefix)
type the font type
emb “yes†if the font is embedded in the PDF file
sub “yes†if the font is a subset
uni “yes†if there is an explicit “ToUnicode†map in the PDF file
(the absence of a ToUnicode map doesn’t necessarily mean that
the text can’t be converted to Unicode)