Difference between revisions of "Docworks"

From COPTR
Jump to navigation Jump to search
m
m
Line 21: Line 21:
 
[[Category:Image]]
 
[[Category:Image]]
 
[[Category:METS]]
 
[[Category:METS]]
 +
[[Category:ALTO format]]
  
 
== Description ==
 
== Description ==

Revision as of 06:13, 3 July 2020


Document digitization workflow software
Homepage:https://content-conversion.com/#docworks-2
License:Commercial
Platforms:Windows

Description

docWorks helps archives and content owners to convert their print holdings into professional digital libraries. This process consists of two steps: the digitization, i.e. the scanning of the printed page, and the conversion, i.e. the recognition of all contained text, image, layout and structural information.

docWorks is a conversion software that covers all conversion steps in a single workflow. It provides layout analysis and offers multiple OCR engines to handle any type of publication, language or writing system.

Import formats are TIF, JPG, JP2, GIF and PDF and you can export METS and ALTO XML, image files, PDF, PDF/A-1, full-text XML, RTF and EPUB. Metadata schemes are MIX, MARC, MODS, DC, METS physical structural maps and METS logical structural maps.

User Experiences

Development Activity