Skip to content

Screenshot Image to Text Data Extract into excel File

License

Notifications You must be signed in to change notification settings

jeturgavli/ImageToText

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Text Extraction and Excel Parsing from Images

This Python script extracts text from images using Tesseract OCR and organizes it into an Excel file.

Features

  • Automated Installation: Checks for required Python modules (pytesseract, openpyxl, pandas) and installs them if missing.
  • Text Extraction: Utilizes Tesseract OCR to extract text from images.
  • Data Parsing: Parses extracted text to extract contact names and times seen, organizing them into an Excel file.
  • Logging: Logs informative messages, warnings, and errors for better tracking and debugging.
  • User Interaction: Prompts the user for image and output folder paths, allowing for interactive usage.

Usage

  1. Ensure Python is installed.
  2. Install Tesseract OCR:
  3. Clone or download the repository.
  4. Place images to be processed in the images folder.
  5. Run the script (main.py).
  6. Follow the prompts to input image and output folder paths.
  7. View the generated Excel files in the output folder.

Dependencies

  • Python 3.x
  • Tesseract OCR
  • Required Python modules: pytesseract, openpyxl, pandas

Author

LAKSHMI

Contribution

JETUR GAVLI

License

This project is licensed under the MIT License.

About

Screenshot Image to Text Data Extract into excel File

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages