Jump to navigation Jump to search

PDF/A validation tool
License:GPL v3+, MPL v2+
Platforms:Windows, Mac, Linux
Input Formats:PDF
Content type:Document
Appears in COW:PDF/A validation and metadata extraction

Release Feed

Failed to load RSS feed from Error parsing XML for RSS


veraPDF is an open source PDF/A validation tool for Windows, Mac and Linux. It validates all parts and conformance levels of ISO 19005 (PDF/A). Results are exported using either the HTML or XML format.

About veraPDF[edit]

Designed to meet the needs of digital preservationists, and supported by leading members of the PDF software developer community, veraPDF is a purpose-built, open source, permissively licensed file-format validator covering all PDF/A parts and conformance levels.

The veraPDF consortium

Led by the Open Preservation Foundation (OPF) and the PDF Association, the Consortium’s mission is to develop the definitive, open-source validator for PDF/A, and to build a community to maintain the project in the long term.

Funded by the European Commission’s PREFORMA Project

veraPDF was funded by the PREFORMA project. PREFORMA – PREservation FORMAts for culture information/e-archives, a Pre-Commercial Procurement (PCP) project co-funded by the European Commission under its FP7-ICT Programme. The project’s main aim was to address the challenge of implementing standardised file formats for preserving digital objects in the long term, giving memory institutions full control over the acceptance and management of preservation files into digital repositories. OPF, PDF Association and Dual Labs continue to maintain veraPDF, and funding for significant new developments is welcomed and sought.

PDF/A validation[edit]

veraPDF logoThe specification for PDF/A is a set of restrictions and requirements applied to the “base” PDF standards (PDF 1.4 for PDF/A-1 and ISO 32000 for PDF/A-2 and PDF/A-3) plus a specific set of 3rd party standards. The veraPDF subsystems include:

veraPDF Implementation Checker

The Implementation Checker parses and analyzes PDF documents. It outputs two types of report: a report describing the PDF document and its metadata and a Validation Report describing conformance to PDF/A flavours.

veraPDF Metadata Fixer

The Metadata Fixer makes a limited set of fixes to metadata within PDF documents, such as removal of the PDF/A flag in the case of a non-conforming document, or the repair of broken XMP metadata, if bad XMP is the only error preventing a legitimate PDF/A flag.. The Metadata Fixer produces a fixed version of the original document and a Metadata Fixing Report, which describes the fixes attempted, and their success or failure.

veraPDF Policy Checker

The Policy Checker parses and analyzes a PDF Features Report and generates a Policy Report stating whether the PDF document complies with institutional policy as expressed in a Policy Profile. Note that the Policy Checker can be used to check for almost any quality in a PDF; for example, the use of annotations, irrespective of PDF/A.

veraPDF Reporter

The Reporter transforms verPDF’s machine-readable reports as generated by the Implementation Checker, Policy Checker, and Metadata Fixer, into other forms for downstream use.

veraPDF Shell

The Shell manages veraPDF’s other components and ensures interaction in a coordinated sequences of actions. Users interact with the Shell through the Command Line Interface (CLI), Desktop Graphical User Interface, or Web Graphical User Interface.

Open Licensing[edit]

veraPDF is open source software dual licensed for sustainability and reuse in accordance with PREFORMA’s requirements. Other project outputs such as test corpora and documentation are issued under a Creative Commons license.

MPL v2+

The Mozilla Public License v2+ allows covered source code to be mixed with other files under a different, even proprietary license. Code licensed under the MPL must remain under the MPL, and freely available in source form.

GPL v3+

The GNU General Public License v3 guarantees users the freedom to run, study, share (copy), and modify the software. The copyleft quality of the GPLv3 requires those rights to be retained.


The Creative Commons is a public copyright license that enable the free distribution of an otherwise copyrighted work.


User Experiences[edit]

Development Activity[edit]

All development activity is visible on GitHub: