开发者

OCR and word reviewing

开发者 https://www.devze.com 2022-12-21 23:48 出处:网络
I\'m using Tesseract for my letter recognition project and currently the recognitions is quite good. The image processing part was done using OpenCv libraries.

I'm using Tesseract for my letter recognition project and currently the recognitions is quite good. The image processing part was done using OpenCv libraries. The letters are hand written.But there are some problems when I used it to recognise the letter "O" and number "0". These letters are used in data areas as the fields that enter names. So names cannot have any numbers with it. And when we are using the the system of the data fields as date of birth it only co开发者_如何学Pythonntains numbers. So I'm willing to give restriction to the recognition system saying that the corresponding data fields have only numbers or the letters.

And also I'm willing to review the recognised letters with the possible words so we can improve the accuracy of the data. I'm willing to use the openCv libraries for this task. But I don't know what are the libraries that help for this task and what are the functionalities of those. So please can some one help me. Thank you.

Regards, Thilanka.


I've never used Tesseract. However, in the FAQ it says

How do I recognise only digits?

TessBaseAPI::SetVariable("tessedit_char_whitelist", "0123456789");

Presumably you could use the pattern of the FAQ entry to set it up so it only recognises letters or just digits appropriately.

If you have already tried this, can you give more details of why it doesn't work?

0

精彩评论

暂无评论...
验证码 换一张
取 消