Today, there arenumerous versions of optical character recognition software (shorter OCR), that enable the optical identification and digitization of a set of various characters and alphanumeric types. OCR software tools differ in operating mode and recognition quality, and the results can differ significantly. Most often, problems arise in the identification of the character set of more complex and unusual alphanumeric types, often with special characters (e.g. postalveolar consonants) and texts, set out in tables.
In the theoretical part of the thesis, the history of OCR programs, their operating mode, the description of the implementation of digitization and recognition improvements are presented. In addition, features of optical readers and optical character recognition software are also presented. In the thesis, the essential differences between free and payable OCR programs have been defined and the guidelines for selecting the suitability of OCR programs for personal use are also given.
In the experimental part, a study on how to identify individual selected letters and typefaces using free and payable OCR software tools was conducted. For this purpose, an appropriate graphic design template was created, containing variously defined and shaped elements of the selected alphanumeric typefaces. To compare the conversion quality, in addition to the individual characters, a spreadsheet with data and a photo with a text was added to the graphic design template. Based on the obtained results, a detailed analysis was performed, which enabled the possibility of submitting – suggesting the best solution for using a particular free OCR software tool.
|