Image/graph Extraction Pipeline #280

Dipankar1997161 · 2023-11-06T19:56:56Z

Dipankar1997161
Nov 6, 2023

Enhancement 🚀
The feature is simply an image extraction pipeline.

Motivation 💪
I believe any documents at times may include images just like they have tables and other data. so a clean extraction of it would be nice.
Currently I used the ImageLyaoutService which does crop the image but it works for tables. Can we possibly add a feature to extract figures and graphs?

JaMe76 · 2023-11-08T07:19:23Z

JaMe76
Nov 8, 2023
Maintainer

You are looking for an extraction pipeline that detects images (with/without text content), crops and returns all images per page, right?

I haven't worked through this use-case but I think this can be configured by changing only the .yaml.

Let me think about this, I'll try to work this out.

0 replies

Dipankar1997161 · 2023-11-18T02:17:33Z

Dipankar1997161
Nov 18, 2023
Author

You are looking for an extraction pipeline that detects images (with/without text content), crops and returns all images per page, right?

I haven't worked through this use-case but I think this can be configured by changing only the .yaml.

Let me think about this, I'll try to work this out.

@JaMe76, hey, any luck on solving this use-case of extracting images?? Do update me if in case something comes out. Thanks again for considering this request

0 replies

JaMe76 · 2023-12-30T13:05:19Z

JaMe76
Dec 30, 2023
Maintainer

Sorry for late reply.

Converting the issue into a discussion as this solution might be interesting to others as well.

I've added some functions to make it easy to extract figures without diving deep into the extraction process. Everything depends, however, on the accuracy of the layout detection model.

import deepdoctection as dd

path = "/path/to/dir/2312.13560.pdf"  # you can find the sample here: https://github.com/deepdoctection/notebooks/blob/main/sample/2312.13560.pdf

analyzer = dd.get_dd_analyzer()

df = analyzer.analyze(path=path)
df.reset_state()

for dp in df:
    figures = dp.get_annotation(category_names=dd.LayoutType.figure)
    for fig in figures:
        fig.viz(interactive=True)  # vizualize the figure with an interactive window
        np_array = fig.viz() # get the numpy array of the figure region
        dd.viz_handler.write_image(f"/path/to/dir/{fig.annotation_id}.png", np_array) # save the numpy array as .png

1 reply

tapegoji May 8, 2024

Hi There,
This is a wonderful work. Thank you very much. I have been looking for this for months and when I came across it I was really happy.
Great great job!
Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Image/graph Extraction Pipeline #280

{{title}}

Replies: 3 comments 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

Select a reply

Image/graph Extraction Pipeline #280

Dipankar1997161 Nov 6, 2023

Replies: 3 comments · 1 reply

JaMe76 Nov 8, 2023 Maintainer

Dipankar1997161 Nov 18, 2023 Author

JaMe76 Dec 30, 2023 Maintainer

tapegoji May 8, 2024

Dipankar1997161
Nov 6, 2023

Replies: 3 comments 1 reply

JaMe76
Nov 8, 2023
Maintainer

Dipankar1997161
Nov 18, 2023
Author

JaMe76
Dec 30, 2023
Maintainer