Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Chinese OCR doesn't work #11729

Open
719q opened this issue Apr 29, 2024 · 1 comment
Open

Chinese OCR doesn't work #11729

719q opened this issue Apr 29, 2024 · 1 comment

Comments

@719q
Copy link

719q commented Apr 29, 2024

Does your feature request involve difficulty completing a task? Please describe.
Difficulty manually selecting more than one character to look up in dictionary, two characters are the most problematic. Similar issue was with Japanese in #4091, and solved in #8270.

Describe the solution you'd like
Full Chinese language support similar to the Japanese one would be very welcome.

Describe alternatives you've considered
StarDict dictionary

Additional context
KOReader version: v2024.03.1
Device: Kindle PW4

@719q
Copy link
Author

719q commented May 1, 2024

Looks like Chinese support works with epub files very well after installing a CC-CEDICT dictionary. It automatically selects multiple characters, but the problem with pdf still pertains. I updated my KOReader to v2024.04 which should fix #11715. I can't select characters with document language set to Chinese. I tried force OCR on, reflow on/off but nothing seems to fix it. Should I open a new issue? If so, please close this one. I use Tesseract 3.04 chi_sim trained data.

@719q 719q changed the title FR: Chinese language support Chinese OCR doesn't work May 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant