Difference between revisions of "Kraken"
Jump to navigation
Jump to search
m (→Description) |
|||
| Line 1: | Line 1: | ||
| − | + | {{Infobox tool | |
| − | |||
| − | {{ | ||
|purpose=Open Source turn-key OCR system forked from ocropus | |purpose=Open Source turn-key OCR system forked from ocropus | ||
|homepage=http://kraken.re/ | |homepage=http://kraken.re/ | ||
|license=Apache 2.0 License | |license=Apache 2.0 License | ||
|platforms=Linux | |platforms=Linux | ||
| + | |function=OCR | ||
| + | |content=ALTO format | ||
}} | }} | ||
| − | <!-- | + | {{Infobox tool details}} |
| + | <!-- Use the structure provided in this template, do not change it! --> | ||
| − | |||
| − | |||
| − | |||
| − | |||
| − | <!-- | + | <!-- Note that to use the image field, you should leave the value as {{PAGENAMEE}}.png (or similar) and upload a copy of the image. Hot-linking is not supported. If you don't want an image, just remove that line. --> |
| − | |||
| − | |||
| − | |||
== Description == | == Description == | ||
Revision as of 09:42, 23 April 2021
Description
kraken is a turn-key OCR system forked from ocropus. It is intended to rectify a number of issues while preserving (mostly) functional equivalence.
main features:
- Script detection and multi-script recognition support
- Right-to-Left, BiDi, and Top-to-Bottom script support
- ALTO, abbyXML, and hOCR output
- Word bounding boxes and character cuts
- Public repository of model files
- Lightweight model files
- Variable recognition network architectures
All functionality not pertaining to OCR and prerequisite steps has been removed, i.e. no more error rate measuring, etc.