Difference between revisions of "Docworks"

From COPTR
Jump to navigation Jump to search
m
(Added links to Function and/or Content Type)
Line 1: Line 1:
<!-- Use the structure provided in this template, do not change it! -->
+
{{Infobox tool
 
+
|purpose=Document digitization workflow software
{{Infobox_tool
 
|purpose=Document digitization workflow software  
 
 
|homepage=https://content-conversion.com/#docworks-2
 
|homepage=https://content-conversion.com/#docworks-2
 
|license=Commercial
 
|license=Commercial
 
|platforms=Windows
 
|platforms=Windows
 +
|function=OCR, Workflow, Quality Assurance
 +
|content=Image, METS, ALTO format
 
}}
 
}}
<!-- Note that to use the image field, you should leave the value as {{PAGENAMEE}}.png (or similar) and upload a copy of the image. Hot-linking is not supported. If you don't want an image, just remove that line. -->
+
{{Infobox tool details}}
 
 
<!-- Add one or more categories to describe the function of the tool, such as:
 
[[Category:Metadata Extraction]] or [[Category:Preservation System]] or [[Category:Backup]]
 
Choose carefully, and view the list of existing categories first (see the Navigation sidebar on the left) -->
 
[[Category:OCR]]
 
[[Category:Workflow]]
 
[[Category:Quality Assurance]]
 
 
 
<!-- Add relevant categories to describe the content type that the tool addresses, such as:
 
[[Category:Audio]] or [[Category:Document]] or [[Category:Research Data]]
 
Choose carefully, and view the list of existing categories first (see the Navigation sidebar on the left). If the tool works on any content type, do not add a category. -->
 
[[Category:Image]]
 
[[Category:METS]]
 
[[Category:ALTO format]]
 
 
 
 
== Description ==
 
== Description ==
 
<!-- Describe the what the tool does, focusing on it's digital preservation value. Keep it factual. -->
 
<!-- Describe the what the tool does, focusing on it's digital preservation value. Keep it factual. -->
Line 38: Line 23:
 
<!-- Provide *evidence* of development activity of the tool. For example, RSS feeds for code issues or commits. -->
 
<!-- Provide *evidence* of development activity of the tool. For example, RSS feeds for code issues or commits. -->
 
<!-- Add the OpenHub.com ID for the tool, if known.  
 
<!-- Add the OpenHub.com ID for the tool, if known.  
{{Infobox_tool_details
+
-->
|releases_rss=
 
|issues_rss=
 
|mailing_lists=
 
|ohloh_id=
 
}} -->
 

Revision as of 19:51, 20 April 2021



Document digitization workflow software
Homepage:https://content-conversion.com/#docworks-2
License:Commercial
Platforms:Windows
Function:OCR,Workflow,Quality Assurance
Content type:Image,METS,ALTO format




Description

docWorks helps archives and content owners to convert their print holdings into professional digital libraries. This process consists of two steps: the digitization, i.e. the scanning of the printed page, and the conversion, i.e. the recognition of all contained text, image, layout and structural information.

docWorks is a conversion software that covers all conversion steps in a single workflow. It provides layout analysis and offers multiple OCR engines to handle any type of publication, language or writing system.

Import formats are TIF, JPG, JP2, GIF and PDF and you can export METS and ALTO XML, image files, PDF, PDF/A-1, full-text XML, RTF and EPUB. Metadata schemes are MIX, MARC, MODS, DC, METS physical structural maps and METS logical structural maps.

User Experiences

Development Activity