Skip to content
/ pcu_pdf Public

PDF parser component (Apache Tika) for PCU project

License

Notifications You must be signed in to change notification settings

zevio/pcu_pdf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pcu_pdf (Apache Tika parser for PCU project)

PDF parser component (Apache Tika) for PCU project. From the path of a PDF file, get its textual content.

Based on Apache Tika.

pdf


Check PCU project.

Usage in another project

If you wish to import this module in another Python project, please install it :

pip install pcu-pdf

Then, add this import line at the beginning of your Python file :

from pcu_pdf import pcu_pdf

You can now use pcu_pdf's functions, for example :

pcu_pdf.PDFParser("path/to/pdf/file")

Test

To test your installation, go to pcu_pdf/ directory and execute the Makefile with the following command line :

make test