textract

Here are 77 public repositories matching this topic...

aws-samples / mask-words-in-image

A tool that can mask words that match regular expression, keywords or PII (Personally Identifiable Information) in an image file.

textract rekognition comprehend

Updated May 23, 2024
Python

srcecde / aws-tutorial-code

Star

AWS tutorial code.

aws lambda tutorial cloudformation aws-lambda api-gateway dynamodb glue s3-website s3-bucket ecs amazon-web-services textract aws-lambda-python comprehend

Updated May 20, 2024
Python

aeksco / aws-pdf-textract-pipeline

Sponsor

Star

🔍 Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript

pdf aws lambda cloudformation typescript serverless jest dynamodb s3 sns webscraping textract data-pipeline cdk puppeteer aws-cdk aws-textract

Updated May 30, 2024
TypeScript

slub / textract2page

Star

Convert AWS Textract JSON to PRImA PAGE XML

python ocr textract page-xml

Updated Apr 26, 2024
Python

rhabed / aws-ai-bedrock-textract

Star

Demo for AWS Textextract and Bedrock

aws-lambda bedrock textract step-functions aws-textract aws-ai-services aws-bedrock

Updated Apr 26, 2024
Python

SanjuJssmr / OCR

Star

To extract information from an damaged image using AWS textract and Azure form recognizer (OCR) ✨💥

aws ocr azure textract ocr-recognition formrecognizer

Updated Mar 30, 2024
JavaScript

aws-samples / winform-amazon-bedrock-document-bot

Star

A conversational document bot Windows Forms desktop application that allows users to upload PDF or Word files and ask questions about their content, with the bot keeping track of the conversation history and providing contextual responses based on the whole conversation.

dotnet bedrock dotnet-core windows-desktop windows-forms textract

Updated Mar 14, 2024
C#

OzgenOzan / word-counter-py

Star

An algorithm developed for counting words from documents in Python using pandas and textract. REGex pattern is tweaked to identify Latin characters all together (such as enzyme, protein names)

python excel pandas regex-pattern textract wordcounter

Updated Mar 13, 2024
Python

este6an13 / checks-ocr

Star

Software that applies OCR + RAG to extract bank checks information

python docker ocr openai textract rag llm langchain

Updated Mar 12, 2024
Python

Mkranj / PapersCited

Star

List all unique citations in your document

python excel word python3 citations references articles textract academic-paper xlsxwriter scientific-writing chicago-style apa-style paperscited

Updated Feb 17, 2024
Python

tomnotthomas / Docu.ai

Star

Docu.ai: Document Analysis POC for Fintech Company 📈📊

react javascript aws express cloud mongodb ml full-stack textract

Updated Feb 10, 2024
JavaScript

Achanandhi-M / Amazon-Textract-flask

Star

flask aws machine-learning textract

Updated Feb 7, 2024
Python

NItesh1724 / Resume_classification_project

Star

This is my NLP project on Resume Classification in this i have performed EDA , data cleaning, Model building on various models, model evaluation and model deployment

python machine-learning numpy pandas wordcloud matplotlib textract nlp-machine-learning