Skip to content

This project is a Streamlit application that uses the Gemini Pro Vision model from Google's Generative AI to extract information from invoice images. It allows users to upload an invoice image and enter a prompt, and the application will generate a response based on the information present in the image.

Notifications You must be signed in to change notification settings

shrimantasatpati/invoice_extraction_using_gemini_streamlit_app

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Invoice Extraction Using Gemini Streamlit App

This project is a Streamlit application that uses the Gemini Pro Vision model from Google's Generative AI to extract information from invoice images. It allows users to upload an invoice image and enter a prompt, and the application will generate a response based on the information present in the image.

Prerequisites

Before running the application, make sure you have the following prerequisites installed:

  • Python 3.11
  • Pip (Python package installer)

Installation

  1. Clone the repository:
git clone https://github.com/your-username/invoice_extraction_using_gemini_streamlit_app.git
  1. Navigate to the project directory:
cd invoice_extraction_using_gemini_streamlit_app
  1. Create a virtual environment (recommended):
python -m venv venv
  1. Activate the virtual environment (on windows):
venv\Scripts\activate
  1. Install the required dependencies:
pip install -r requirements.txt

Usage:

Run the Streamlit application:

streamlit run app.py
  • The application will open in your default web browser.
  • Upload an invoice image by clicking the "Choose an image" button.
  • Enter a prompt in the text input field related to the information you want to extract from the invoice.
  • Click the "Submit" button to generate the response based on the uploaded image and the provided prompt.
  • The generated response will be displayed in the application.

Contributing

Contributions to this project are welcome. If you find any issues or have suggestions for improvements, please open an issue or submit a pull request.

License

This project is licensed under the MIT License.

About

This project is a Streamlit application that uses the Gemini Pro Vision model from Google's Generative AI to extract information from invoice images. It allows users to upload an invoice image and enter a prompt, and the application will generate a response based on the information present in the image.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages