Difference between revisions of "Kraken"
Jump to navigation
Jump to search
m |
m (→Description) |
||
Line 21: | Line 21: | ||
== Description == | == Description == | ||
<!-- Describe the what the tool does, focusing on it's digital preservation value. Keep it factual. --> | <!-- Describe the what the tool does, focusing on it's digital preservation value. Keep it factual. --> | ||
− | kraken is a turn-key OCR system forked from ocropus. It is intended to rectify a number of issues while preserving (mostly) functional equivalence. | + | kraken is a turn-key OCR system forked from [https://github.com/tmbarchive/ocropy ocropus]. It is intended to rectify a number of issues while preserving (mostly) functional equivalence. |
'''main features:''' | '''main features:''' | ||
Line 33: | Line 33: | ||
All functionality not pertaining to OCR and prerequisite steps has been removed, i.e. no more error rate measuring, etc. | All functionality not pertaining to OCR and prerequisite steps has been removed, i.e. no more error rate measuring, etc. | ||
− | |||
== User Experiences == | == User Experiences == |
Revision as of 08:19, 3 July 2020
Description
kraken is a turn-key OCR system forked from ocropus. It is intended to rectify a number of issues while preserving (mostly) functional equivalence.
main features:
- Script detection and multi-script recognition support
- Right-to-Left, BiDi, and Top-to-Bottom script support
- ALTO, abbyXML, and hOCR output
- Word bounding boxes and character cuts
- Public repository of model files
- Lightweight model files
- Variable recognition network architectures
All functionality not pertaining to OCR and prerequisite steps has been removed, i.e. no more error rate measuring, etc.