Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: OCRmyPDF Docker Hot Folder Option OCR_ON_SUCCESS_ARCHIVE OCR_ON_SUCCESS_DELETE doesnt work #1263

Open
mazi19 opened this issue Feb 27, 2024 · 0 comments
Assignees
Labels

Comments

@mazi19
Copy link

mazi19 commented Feb 27, 2024

Describe the bug

After OCR PDF document, files are not moved to archive or deleted even if options OCR_ON_SUCCESS_ARCHIVE OCR_ON_SUCCESS_DELETE is enabled. Problem occure only in version v16.1.1

Steps to reproduce

1. Run ocrmypdf -v16.1.1 as docker container using docker compose with image jbarlow83/ocrmypdf:v16.1.1 and options  OCR_ON_SUCCESS_ARCHIVE=1
2. Copy some files to input location
3. PDFs are OCRed and placed in output
4. Oryginal file are not archived or deleted

Files

version: "3.3"
services:
ocrmypdf:
restart: always
container_name: ocrmypdf
image: jbarlow83/ocrmypdf:v16.1.1
volumes:
- "./input/:/input"
- "./output/:/output"
- "./backup/:/processed"
environment:
- OCR_OUTPUT_DIRECTORY_YEAR_MONTH=0
- OCR_ON_SUCCESS_ARCHIVE=1
- OCR_ON_SUCCESS_DELETE=0
- OCR_USE_POLLING=1
- 'OCR_JSON_SETTINGS={"force-ocr": true, "clean": true, "deskew": true}'
user: "root:root"
entrypoint: python3
command:
- watcher.py

How did you download and install the software?

No response

OCRmyPDF version

16.1.2.dev0+g59d5b0d1.d20240216

Relevant log output

New file: /input/z000000000000B51D6E5.pdf. Waiting until fully written...
Attempting to OCRmyPDF to: /output/z000000000000B51D6E5.pdf
Scanning contents ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 100% 1/1 0:00:00
OCR ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 100% 1/1 0:00:00
Postprocessing...
Some input metadata could not be copied because it is not permitted in PDF/A. You may wish to examine the output PDF's XMP metadata.
Recompressing JPEGs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 0% 0/0 -:--:--
Deflating JPEGs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 100% 1/1 0:00:00
JBIG2 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 0% 0/0 -:--:--
Image optimization ratio: 1.29 savings: 22.4%
Total file size ratio: 1.29 savings: 22.4%
Output file is a PDF/A-2B (as expected)
OCR is done

New file: /input/z000000000000B51D6FF.pdf. Waiting until fully written...
Attempting to OCRmyPDF to: /output/z000000000000B51D6FF.pdf
Scanning contents ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 100% 2/2 0:00:00
Start processing 2 pages concurrently
OCR ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 100% 2/2 0:00:00
Postprocessing...
Some input metadata could not be copied because it is not permitted in PDF/A. You may wish to examine the output PDF's XMP metadata.
Recompressing JPEGs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 0% 0/0 -:--:--
Deflating JPEGs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 100% 2/2 0:00:00
JBIG2 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 0% 0/0 -:--:--
Image optimization ratio: 1.07 savings: 6.4%
Total file size ratio: 1.06 savings: 5.7%
Output file is a PDF/A-2B (as expected)
OCR is done

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants