Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tesseract 4.0_Train Katakana Characters for Japanese Language #91

Open
GunjiVijay opened this issue Mar 23, 2018 · 1 comment
Open

Tesseract 4.0_Train Katakana Characters for Japanese Language #91

GunjiVijay opened this issue Mar 23, 2018 · 1 comment

Comments

@GunjiVijay
Copy link

GunjiVijay commented Mar 23, 2018

We are using Tesseract-OCR 4.0 trained data. We would like to train a data file for only Katakana characters alone for Japanese language.

Kindly let us know, the steps to follow for train the Katakana characters alone using Tesseract 4.0 OCR extraction.

Appreciate your help on this earliest.

@Shreeshrii
Copy link
Contributor

Related issue tesseract-ocr/langdata#81
Add Half-width Katakana for Japanese

@GunjiVijay Please see https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants