Difference between revisions of "Kraken"

From COPTR
Jump to navigation Jump to search
m
Line 21: Line 21:
 
== Description ==
 
== Description ==
 
<!-- Describe the what the tool does, focusing on it's digital preservation value. Keep it factual. -->
 
<!-- Describe the what the tool does, focusing on it's digital preservation value. Keep it factual. -->
kraken is a turn-key OCR system forked from ocropus. It is intended to rectify a number of issues while preserving (mostly) functional equivalence.
+
kraken is a turn-key OCR system forked from [https://github.com/tmbarchive/ocropy ocropus]. It is intended to rectify a number of issues while preserving (mostly) functional equivalence.
  
 
'''main features:'''
 
'''main features:'''
Line 33: Line 33:
  
 
All functionality not pertaining to OCR and prerequisite steps has been removed, i.e. no more error rate measuring, etc.
 
All functionality not pertaining to OCR and prerequisite steps has been removed, i.e. no more error rate measuring, etc.
 
  
 
== User Experiences ==
 
== User Experiences ==

Revision as of 08:19, 3 July 2020


Open Source turn-key OCR system forked from ocropus
Homepage:http://kraken.re/
License:Apache 2.0 License
Platforms:Linux

Description

kraken is a turn-key OCR system forked from ocropus. It is intended to rectify a number of issues while preserving (mostly) functional equivalence.

main features:

  • Script detection and multi-script recognition support
  • Right-to-Left, BiDi, and Top-to-Bottom script support
  • ALTO, abbyXML, and hOCR output
  • Word bounding boxes and character cuts
  • Public repository of model files
  • Lightweight model files
  • Variable recognition network architectures

All functionality not pertaining to OCR and prerequisite steps has been removed, i.e. no more error rate measuring, etc.

User Experiences

Development Activity

Commits : https://github.com/mittagessen/kraken/commits

Issues : https://github.com/mittagessen/kraken/issues