New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Tesseract 4.0_Train Katakana Characters for Japanese Language #91

Open

GunjiVijay opened this issue Mar 23, 2018 · 1 comment

GunjiVijay commented Mar 23, 2018 •

edited

We are using Tesseract-OCR 4.0 trained data. We would like to train a data file for only Katakana characters alone for Japanese language.

Kindly let us know, the steps to follow for train the Katakana characters alone using Tesseract 4.0 OCR extraction.

Appreciate your help on this earliest.

Contributor

Shreeshrii commented Mar 23, 2018

Related issue tesseract-ocr/langdata#81
Add Half-width Katakana for Japanese

@GunjiVijay Please see https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment