New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Clarify orig_elements
documentation
#2929
Comments
I had figure it out.
|
@kaleyroy you could also try this solution from an earclier converstation: As far as I can understand data pipelines, I don't see too much reason dumping the objects into JSON files. I would much rather recommend processing the data while everything is loaded into memory and only dumping the final output into a JSON. |
Good call, we'll clarify this in the documentation. In the meantime, if you want to "rehydrate" elements in JSON form into in-memory objects, all you need to do is: from unstructured.staging.base import elements_from_json
elements: list[Element] = elements_from_json(path_to_json_file) The This includes the |
The current docs do not specify that you don't dump the elements as JSON objects into the JSON file.
It would be clearer, if you gave an example of the serialization behavior.
Thanks in advance!
The text was updated successfully, but these errors were encountered: