A API in .Net Core to extract documents OCR with many libs linux
-
Updated
Sep 5, 2018 - C#
A API in .Net Core to extract documents OCR with many libs linux
PdfReg is a web tool, which gets text at selected regions of pdf document.
✔️ A Python Flask API to manage PDF files.
Simple and Useful Automation Tools built with the help of modules available with Python published at PyPI.
Newspaper mining and the analysis of the results using python. Cleaning the text using OCR.
Data Center Advanced Walkthrough. Insert data from a PDF file into MySQL database
Natural language processing tools developed by the World Bank's DECAT unit. A suite of text preprocessing and cleaning algorithms for NLP analysis and modeling.
Web application for information extraction and named entity recognition for PDF files (work-in-progress).
We present Ypdf, a PDF document processing application that combines the best features of existing solutions and provides the most popular and requested functionality for free to its users.
Python library and Web service based on Poppler Pdftotext utility and Tesseract OCR for extracting text from PDF documents
A collection of scripts to "help" you with your programming exams and assignments.
The code base of the front-end of nocodefunctions.com
io for nocodefunctions: csv, txt, pdf, and xlsx so far
A lightweight Python-based Software Package for daily use
Add a description, image, and links to the pdf2text topic page so that developers can more easily learn about it.
To associate your repository with the pdf2text topic, visit your repo's landing page and select "manage topics."