Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update unstructured-client and lxml requirements #2941

Closed
wants to merge 4 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Jump to
Jump to file
Failed to load files.
Diff view
Diff view
16 changes: 16 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,19 @@
## 0.13.5-dev0

### Enhancements

### Features

### Fixes

## 0.13.4

### Enhancements

### Features

### Fixes

## 0.13.4

### Enhancements
Expand Down
21 changes: 12 additions & 9 deletions requirements/base.txt
Original file line number Diff line number Diff line change
Expand Up @@ -22,8 +22,10 @@ charset-normalizer==3.3.2
click==8.1.7
# via nltk
dataclasses-json==0.6.4
# via -r ./base.in
dataclasses-json-speakeasy==0.5.11
# via
# -r ./base.in
# unstructured-client
deepdiff==7.0.1
# via unstructured-client
emoji==2.11.1
# via -r ./base.in
Expand All @@ -39,14 +41,11 @@ jsonpath-python==1.0.6
# via unstructured-client
langdetect==1.0.9
# via -r ./base.in
lxml==4.9.4
# via
# -c ././deps/constraints.txt
# -r ./base.in
lxml==5.2.1
# via -r ./base.in
marshmallow==3.21.1
# via
# dataclasses-json
# dataclasses-json-speakeasy
# unstructured-client
mypy-extensions==1.0.0
# via
Expand All @@ -56,11 +55,15 @@ nltk==3.8.1
# via -r ./base.in
numpy==1.26.4
# via -r ./base.in
ordered-set==4.1.0
# via deepdiff
packaging==23.2
# via
# -c ././deps/constraints.txt
# marshmallow
# unstructured-client
pypdf==4.2.0
# via unstructured-client
python-dateutil==2.9.0.post0
# via unstructured-client
python-iso639==2024.2.7
Expand Down Expand Up @@ -89,14 +92,14 @@ tqdm==4.66.2
typing-extensions==4.11.0
# via
# -r ./base.in
# pypdf
# typing-inspect
# unstructured-client
typing-inspect==0.9.0
# via
# dataclasses-json
# dataclasses-json-speakeasy
# unstructured-client
unstructured-client==0.18.0
unstructured-client==0.22.0
# via
# -c ././deps/constraints.txt
# -r ./base.in
Expand Down
7 changes: 2 additions & 5 deletions requirements/deps/constraints.txt
Original file line number Diff line number Diff line change
Expand Up @@ -48,8 +48,5 @@ urllib3<1.27
# TODO: Constriant due to aiobotocore, remove when that gets updates:
botocore<1.34.52

# TODO: constraint due to current release of pikepdf (v8.14.0), remove once next version releases since fix is on main
lxml<5

# NOTE(jennings): pinned due to later versions not supporting api_key_auth in UnstructuredClient
unstructured-client<=0.18.0
# NOTE(jennings): pinned due to older versions not supporting api_key_auth in UnstructuredClient
unstructured-client>=0.15.0
2 changes: 1 addition & 1 deletion requirements/dev.txt
Original file line number Diff line number Diff line change
Expand Up @@ -181,7 +181,7 @@ jupyter-server==2.14.0
# notebook-shim
jupyter-server-terminals==0.5.3
# via jupyter-server
jupyterlab==4.1.6
jupyterlab==4.1.7
# via notebook
jupyterlab-pygments==0.3.0
# via nbconvert
Expand Down
3 changes: 1 addition & 2 deletions requirements/extra-docx.txt
Original file line number Diff line number Diff line change
Expand Up @@ -4,9 +4,8 @@
#
# pip-compile ./extra-docx.in
#
lxml==4.9.4
lxml==5.2.1
# via
# -c ././deps/constraints.txt
# -c ./base.txt
# python-docx
python-docx==1.1.0
Expand Down
3 changes: 1 addition & 2 deletions requirements/extra-odt.txt
Original file line number Diff line number Diff line change
Expand Up @@ -4,9 +4,8 @@
#
# pip-compile ./extra-odt.in
#
lxml==4.9.4
lxml==5.2.1
# via
# -c ././deps/constraints.txt
# -c ./base.txt
# python-docx
pypandoc==1.13
Expand Down
3 changes: 1 addition & 2 deletions requirements/extra-paddleocr.txt
Original file line number Diff line number Diff line change
Expand Up @@ -77,9 +77,8 @@ lazy-loader==0.4
# via scikit-image
lmdb==1.4.1
# via unstructured-paddleocr
lxml==4.9.4
lxml==5.2.1
# via
# -c ././deps/constraints.txt
# -c ./base.txt
# premailer
# unstructured-paddleocr
Expand Down
7 changes: 4 additions & 3 deletions requirements/extra-pdf-image.txt
Original file line number Diff line number Diff line change
Expand Up @@ -85,9 +85,8 @@ kiwisolver==1.4.5
# via matplotlib
layoutparser[layoutmodels,tesseract]==0.3.4
# via unstructured-inference
lxml==4.9.4
lxml==5.2.1
# via
# -c ././deps/constraints.txt
# -c ./base.txt
# pikepdf
markupsafe==2.1.5
Expand Down Expand Up @@ -198,7 +197,9 @@ pyparsing==3.0.9
# -c ././deps/constraints.txt
# matplotlib
pypdf==4.2.0
# via -r ./extra-pdf-image.in
# via
# -c ./base.txt
# -r ./extra-pdf-image.in
pypdfium2==4.29.0
# via pdfplumber
pytesseract==0.3.10
Expand Down
6 changes: 2 additions & 4 deletions requirements/extra-pptx.txt
Original file line number Diff line number Diff line change
Expand Up @@ -4,10 +4,8 @@
#
# pip-compile ./extra-pptx.in
#
lxml==4.9.4
# via
# -c ././deps/constraints.txt
# python-pptx
lxml==5.2.1
# via python-pptx
pillow==10.3.0
# via python-pptx
python-pptx==0.6.23
Expand Down
2 changes: 1 addition & 1 deletion requirements/ingest/embed-vertexai.txt
Original file line number Diff line number Diff line change
Expand Up @@ -115,7 +115,7 @@ langchain-core==0.1.46
# langchain-community
# langchain-google-vertexai
# langchain-text-splitters
langchain-google-vertexai==1.0.1
langchain-google-vertexai==1.0.2
# via -r ./ingest/embed-vertexai.in
langchain-text-splitters==0.0.1
# via langchain
Expand Down
3 changes: 1 addition & 2 deletions requirements/ingest/salesforce.txt
Original file line number Diff line number Diff line change
Expand Up @@ -25,10 +25,9 @@ idna==3.7
# requests
isodate==0.6.1
# via zeep
lxml==4.9.4
lxml==5.2.1
# via
# -c ./ingest/../base.txt
# -c ./ingest/../deps/constraints.txt
# zeep
more-itertools==10.2.0
# via simple-salesforce
Expand Down
5 changes: 2 additions & 3 deletions requirements/test.txt
Original file line number Diff line number Diff line change
Expand Up @@ -52,9 +52,8 @@ label-studio-sdk==0.0.32
# via -r ./test.in
label-studio-tools==0.0.3
# via label-studio-sdk
lxml==4.9.4
lxml==5.2.1
# via
# -c ././deps/constraints.txt
# -c ./base.txt
# label-studio-sdk
# label-studio-tools
Expand Down Expand Up @@ -97,7 +96,7 @@ pyflakes==3.2.0
# via
# autoflake
# flake8
pytest==8.1.1
pytest==8.1.2
# via
# pytest-cov
# pytest-mock
Expand Down
2 changes: 1 addition & 1 deletion unstructured/__version__.py
Original file line number Diff line number Diff line change
@@ -1 +1 @@
__version__ = "0.13.4" # pragma: no cover
__version__ = "0.13.5-dev0" # pragma: no cover