You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
The .txt file only shows arrows without any text presence.
To Reproduce
Steps to reproduce the behavior:
Use this code:
import glob
import textract
file=glob.glob(r'path/to/retrieve/file.extension')
for file_path in file:
text=textract.process(file_path)
with open(f'{file_path[:-4]}.txt', 'w') as file:
file.write(text)
Expected behavior
The text from file should be showing up.
Screenshots
Desktop (please complete the following information):
OS: Windows 10
Textract version 1.6.5
Python version 3.10
Virtual environment No
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered:
Takip31
changed the title
Text can't be extracted from scanned PDF
Text can't be extracted from scanned PDF, jpg and png.
Nov 3, 2022
Describe the bug
The .txt file only shows arrows without any text presence.
To Reproduce
Steps to reproduce the behavior:
Use this code:
import glob
import textract
file=glob.glob(r'path/to/retrieve/file.extension')
for file_path in file:
text=textract.process(file_path)
with open(f'{file_path[:-4]}.txt', 'w') as file:
file.write(text)
Expected behavior
The text from file should be showing up.
Screenshots
Desktop (please complete the following information):
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: