Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Best way for tesstwo to recognize decimals and several other characters? #105

Open
uzbekbear opened this issue Jun 20, 2018 · 2 comments
Open

Comments

@uzbekbear
Copy link

Hello,
could someone please advise me what is the best available training file for tesstwo to identify integers or decimals? i.e 100.00, 25.50, 1,250.1 ?
That is the top priority for my case, but I would also like for it to be able to identify a small set of additional characters, such as '$', '€' ect.

@jiangytcn
Copy link

Same here, really appreciate for the comments from community

@mbhoshen
Copy link

I am needing an overall tool for recognising international currency symbols.
For instance the Ukrainian Hryvnia, the Thai Baht. As far as I could work it out, even the home languages do not recognise the symbols.
For instance, the Wikipedia page on baht in Thai, after saving as pdf, and then extracting using pdf2img and then scanning with tesseract using Thai language shows the Baht symbol as the figure 8

Out[274]: '18.11.2019 บาท (สกุลเงิน) - วิกิพีเดีย\n\nบาอธาว14\n\nบาท (สกุลเงิน)\n\nจากวิกิพีเดีย สารานุกรมเสรี\n\nเงินบาท (ตัวละดิน: 8ลไห : สัญลักษณ์: 8 : รหัสสากลตาม 150 4217: 7118) เป็นสกุลเงินตราประจํา -\nชาติของประเทศไทย เดิมคําว่า "บาท" เป็นหนึ่งในคําใช้เรียกหน่วยการชั่งน้าหนักของไทย ปัจจุบันยัง
I am working with Anaconda _spyder in python
This is quite unsuitable, obviously
are there any smart ideas? Perhaps somebody has trained a currency symbol model?
Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants