Difference between revisions of "Kraken"
Jump to navigation
Jump to search
Prwheatley (talk | contribs) |
|||
Line 4: | Line 4: | ||
|license=Apache 2.0 License | |license=Apache 2.0 License | ||
|platforms=Linux | |platforms=Linux | ||
+ | |formats_out=ALTO (Analyzed Layout and Text Object) | ||
|function=OCR | |function=OCR | ||
− | |content= | + | |content=Image, Document |
}} | }} | ||
{{Infobox tool details}} | {{Infobox tool details}} |
Latest revision as of 14:54, 8 June 2021
Description[edit]
kraken is a turn-key OCR system forked from ocropus. It is intended to rectify a number of issues while preserving (mostly) functional equivalence.
main features:
- Script detection and multi-script recognition support
- Right-to-Left, BiDi, and Top-to-Bottom script support
- ALTO, abbyXML, and hOCR output
- Word bounding boxes and character cuts
- Public repository of model files
- Lightweight model files
- Variable recognition network architectures
All functionality not pertaining to OCR and prerequisite steps has been removed, i.e. no more error rate measuring, etc.