Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

True_inference_with_LayoutLMv2ForTokenClassification, add support for custom OCR #128

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

amtam0
Copy link

@amtam0 amtam0 commented Jun 19, 2022

Hi @NielsRogge,
Thanks for these great tutorials.
This PR to add support for custom OCR alternatives to the built-in one used in LayoutLMv2Processor.
In real world use-cases, Image quality is not good and Tesseract is not the best tool to use, using other alternatives that have better text detectors can improve inference performance (and can be used for fine-tuning). I added modularity to be able to use Tesseract for the recognition part if needed.
Popular OCR tools alternatives added in this Notebook under Inference Chapter:

Let me know if there are some changes to make
Thanks

@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant