OCR Classification

Initial training of receipts

1Klick Mail Kontakt Initially an OCR project is created for the different document classes and extraction fields of the customer receipts. In this OCR project all document classes and document fields are configured via phrases and/or rules according to the customer requirement. To simplify the training, business transactions are presorted into different stacks and introduced to the system. Based on the content (sentences, phrases, document structure, layout, etc.) the system is autonomously learning the respective business transactions or document classes.

Documents with a recognition rate lower than a defined threshold are sent to the training workspace. Inside the OCR Training Client, unrecognized documents are learned by simple association (clicking) of unknown values. The training is easily doable even for technically inexperienced users.

Oberfläche für das Training der Belege

Interface for receipt training