Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

After running parsr.get_text(), where are the text files saved? #664

Open
maxloosmu opened this issue Apr 14, 2023 · 0 comments
Open

After running parsr.get_text(), where are the text files saved? #664

maxloosmu opened this issue Apr 14, 2023 · 0 comments

Comments

@maxloosmu
Copy link

maxloosmu commented Apr 14, 2023

There are only 3 files in my directory for testing Parsr:

test.py
defaultConfig.json
sample.pdf 

This is my test.py code:

import os
from parsr_client import ParsrClient
parsr = ParsrClient('localhost:3001')
input_file = 'sample.pdf'
parsr.send_document(
    file_path=input_file,
    config_path='defaultConfig.json',
    document_name='sample',
    save_request_id=True)
parsr.get_text()

I copied defaultConfig.json from https://github.com/axa-group/Parsr/blob/master/server/defaultConfig.json.

Most of my test.py code is copied from https://pypi.org/project/parsr-client/.

First, I started docker in one terminal window: docker run -p 3001:3001 axarev/parsr
Then, I run this command in another terminal window: python3 test.py

In the docker's terminal window, the output printed at the terminal is as I've copied into this attached file: docker output.txt

However, I do not know where to find the output text file created after pdf text extraction on my Mac. Could someone enlighten me?

docker output.txt

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant