Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implementation of BlobLoader with S3 #18

Open
eyurtsev opened this issue Apr 17, 2024 · 0 comments
Open

Implementation of BlobLoader with S3 #18

eyurtsev opened this issue Apr 17, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@eyurtsev
Copy link

eyurtsev commented Apr 17, 2024

An implementation of BlobLoader with S3 would be fantastic. It'll allow users to easily hook their s3 data to langchain parsers and indexing code.

Here's an explanation about the abstractions:
https://python.langchain.com/docs/modules/data_connection/document_loaders/custom/#blob-loaders

The minimal feature set for this to be useful:

  1. filters by keys matching a given prefix
  2. filter for last_modified_date
  3. filter for max file size
  4. filter for file extension
  5. filter for min file size

Additional features:
7. if available filter for file mimetype

@eyurtsev eyurtsev added the enhancement New feature or request label Apr 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant