Question regarding Image Parser #11794

jaybfn · 2024-03-09T22:26:25Z

jaybfn
Mar 9, 2024

Hello All, I am using this code "https://github.com/run-llama/llama_parse/blob/main/examples/demo_json.ipynb" to recreate the results on my document, I am facing a problem with this code when I pass my PDF doc, it extract only some of the images and the rest it leaves it as blank json, can It be due to images being vector image (.svg) format?

@jaybfn · 2024-03-09T22:26:44Z

dosubot[bot]
bot Mar 9, 2024

Hey there, @jaybfn! Great to see you back diving into more adventures with code. How's everything going?

Yes, your assumption is correct. The issue you're facing could indeed be due to the images being in vector image (.svg) format. The code you're using utilizes the PIL (Python Imaging Library) library, specifically the Image.open() method, to open and process images. PIL, or its successor Pillow, primarily supports raster file formats such as JPEG, PNG, BMP, GIF, and others. It does not natively support vector graphics formats like SVG.

When the code attempts to open an image using Image.open(img_path), if the image is in a format that PIL/Pillow does not support (such as SVG), it will not be able to process the image correctly. This could result in the image being skipped or an error being thrown, depending on how the code is structured to handle such cases. This limitation is consistent with the behavior you're observing, where only some images are extracted, and the rest result in blank JSON entries, likely because those images are in a format that the code's image processing library cannot handle.

As for why the LlamaIndex codebase uses PIL/Pillow for image processing instead of a library that supports both raster and vector graphics, it's likely due to its simplicity and direct support for the primary image manipulation tasks required by the codebase, such as converting images to and from base64 strings. The primary concern was handling raster images in common formats like JPEG, for which PIL/Pillow is well-suited due to its ease of use and comprehensive support for various image file formats and operations. The choice might not be related to the need for vector graphics support, which could explain the absence of a library that handles both raster and vector graphics.

To resolve your issue, you might need to convert your SVG images to a raster format that PIL/Pillow can handle, such as PNG or JPEG, before running the code. Alternatively, you could modify the code to use a library that supports SVG, such as CairoSVG or svglib, but this would require a good understanding of the code and the library you're integrating.

I hope this helps! If you have any more questions, feel free to ask.

Sources:

Sources

About Dosu

This response is meant to be useful and save you time. It is not meant to be a precise solution, but rather a starting point for your own research.

Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant.

If you want to continue the conversation, start your reply with @dosu-bot.

1 reply

jaybfn Mar 11, 2024
Author

@dosu-bot Thank you so much for the response, I will work on and it get back if I have anymore issue!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question regarding Image Parser #11794

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

About Dosu

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Question regarding Image Parser #11794

jaybfn Mar 9, 2024

Replies: 1 comment · 1 reply

dosubot[bot] bot Mar 9, 2024

Sources

About Dosu

jaybfn Mar 11, 2024 Author

jaybfn
Mar 9, 2024

Replies: 1 comment 1 reply

dosubot[bot]
bot Mar 9, 2024

jaybfn Mar 11, 2024
Author