Skip to content

jidel/Searchable-PDF-Creator

Repository files navigation

Searchable PDF Creator

Quick proof of concept that performs OCR (optical character recognition) on images. Portable application, no installation or admin priviliges required. The application is a simple GUI wrapper around tesseract. It uses the compiled tesseract binaries from UB Mannheim version v5.0.0-alpha.20200328

Screenshot of Searchable PDf Creator

Supported Languages

Currently, German and English are the only supported OCR languages.

License

The code in this repository is licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

   http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

NOTE: This software depends on other packages that may be licensed under different open source licenses.

Tesseract uses Leptonica library which essentially uses a BSD 2-clause license.