Universal-NER PDF Analysis

Description

This Streamlit-based web application is designed to process PDF documents for Named Entity Recognition (NER) tasks. It allows users to upload PDF files, from which the application extracts text, images, and tables. The extracted text is then used to identify entities based on a user-specified entity type (e.g., 'Person', 'Location').

Screenshots

Features

PDF Upload: Users can upload PDF documents to be processed.
Entity Type Specification: Users can specify the type of entity they're looking for.
Text Extraction: The application extracts and displays the text from the uploaded PDF.
Image Extraction: Any images in the PDF are saved and can be displayed or further processed.
Table Extraction: The application is capable of extracting tables from the PDF.
Entity Recognition: The extracted text is processed to identify entities of interest.

Installation

To set up the project, you need to have Python installed on your system. Follow these steps:

Clone the repository to your local machine.
Navigate to the project directory and install the required dependencies:
```
pip install -r requirements.txt
```
```
   streamlit run src/app.py
```

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
scraped_data		scraped_data
src		src
.gitignore		.gitignore
Paper Review_UniversalNER_BoutainaELYAZIJI.pdf		Paper Review_UniversalNER_BoutainaELYAZIJI.pdf
README.md		README.md
Universal-NER PDF Analysis.png		Universal-NER PDF Analysis.png
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scraped_data

scraped_data

src

src

.gitignore

.gitignore

Paper Review_UniversalNER_BoutainaELYAZIJI.pdf

Paper Review_UniversalNER_BoutainaELYAZIJI.pdf

README.md

README.md

Universal-NER PDF Analysis.png

Universal-NER PDF Analysis.png

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

Universal-NER PDF Analysis

Description

Screenshots

Features

Installation

About

Releases

Packages

Languages

BoutainaELYAZIJI/Universal-NER

Folders and files

Latest commit

History

Repository files navigation

Universal-NER PDF Analysis

Description

Screenshots

Features

Installation

About

Topics

Resources

Stars

Watchers

Forks

Languages