Skip to content

Compares PDF documents and visualizes similarity using graph. Documents are represented as TF-IDF vector and their similarity is based on cosinus similarity. Visualization is done using Python's library Dash.

License

jaskier07/DocumentComparator

Repository files navigation

Needed modules:
pip install pdfminer
pip install nltk
pip install numpy
pip install sklearn

About

Compares PDF documents and visualizes similarity using graph. Documents are represented as TF-IDF vector and their similarity is based on cosinus similarity. Visualization is done using Python's library Dash.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published