-
Notifications
You must be signed in to change notification settings - Fork 550
Issues: Unstructured-IO/unstructured
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
bug/<ocr-agent> call PartitionPdf error: no ocr_agent found
bug
Something isn't working
#3202
opened Jun 13, 2024 by
LiangZeFenglzf
feat/merge_tables_on_different pages
enhancement
New feature or request
pdf
#3198
opened Jun 12, 2024 by
tanzeel291994
feat/less strict Python version
enhancement
New feature or request
#3193
opened Jun 12, 2024 by
egeres
bug/unstructured.paddleocr is not compatible with GPU version of PaddleOCR
bug
Something isn't working
#3191
opened Jun 12, 2024 by
peixin-lin
Suggestion: include consolidated bounding box coordinates in chunk metadata when using "by_title" chunking strategy
chunking
Related to element chunking.
enhancement
New feature or request
#3194
opened Jun 11, 2024 by
NikitaKemarskiy
bug/pdf extraction error when strategy not set
awaiting-response
bug
Something isn't working
#3187
opened Jun 11, 2024 by
pk-lit
feat/table element coordinates
enhancement
New feature or request
pdf
#3175
opened Jun 10, 2024 by
naunidh-tetrix
Bump to Issues related to the Ingest CLI or unstructured.ingest modules
deltalake>=0.18.x
ingest
#3173
opened Jun 10, 2024 by
MthwRobinson
Add ability to pass pipeline param to Elasticsearch connector
enhancement
New feature or request
needs follow up
#3166
opened Jun 7, 2024 by
aag6z
feat/skip ocr for certain element types
enhancement
New feature or request
ocr
Related to optical character recognition (OCR).
pdf
#3163
opened Jun 7, 2024 by
beez2022
bug/language specification does not work for PaddleOCR agent
bug
Something isn't working
ocr
Related to optical character recognition (OCR).
pdf
#3159
opened Jun 6, 2024 by
peixin-lin
LangChain + Unstructured: Failed to load file ${filePath} using unstructured loader.
#3158
opened Jun 6, 2024 by
ajaykrupalk
Salesforce/ source connector - Not able to ingest salesforce files
enhancement
New feature or request
ingest
Issues related to the Ingest CLI or unstructured.ingest modules
#3153
opened Jun 5, 2024 by
mogith-pn
feat/Excluding Specific Types
enhancement
New feature or request
#3149
opened Jun 4, 2024 by
tevfikcagridural
bug/HTMLTitle doesn't have Something isn't working
html
type
attribute
bug
#3144
opened Jun 3, 2024 by
FanaHOVA
bug/combineUnderNChars not working properly
bug
Something isn't working
#3138
opened Jun 3, 2024 by
leSullivan
feat/Allow max-pages/max-total-characters that should be parsed
enhancement
New feature or request
#3137
opened Jun 2, 2024 by
abdullahbaa5
bug/docker images at quay.io not up to date
awaiting-response
bug
Something isn't working
docker
Issues related to unstructured docker images
#3123
opened May 30, 2024 by
jpabbuehl
bug/partition_html ouputs different results with different args
awaiting-response
bug
Something isn't working
html
#3116
opened May 30, 2024 by
KMayank29
partition_doc
fails the first time it is run in the AMD64 container
bug
#3105
opened May 28, 2024 by
MthwRobinson
DOCX doesn't recognize listitems within textbox
docx
Related to Microsoft Word (.docx) file format
enhancement
New feature or request
#3103
opened May 28, 2024 by
veredmm
bug/PIL.UnidentifiedImageError: cannot identify image file
bug
Something isn't working
needs follow up
pdf
#3102
opened May 26, 2024 by
udit-pandey-1
unstructured-ingest s3 command causes Fsspec.Downloader.download_config.download_dir to be None
bug
Something isn't working
ingest
Issues related to the Ingest CLI or unstructured.ingest modules
#3101
opened May 26, 2024 by
tuvalusoftware
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.