Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DO NOT MERGE] Clone/mixibo improved html links #2883

Closed
wants to merge 41 commits into from

Commits on Feb 9, 2024

  1. Refactor threshold to annotation_threshold and make it an optional pa…

    …rameter
    Michael Niestroj committed Feb 9, 2024
    Configuration menu
    Copy the full SHA
    724cdb5 View commit details
    Browse the repository at this point in the history

Commits on Feb 13, 2024

  1. Merge branch 'main' into main

    MiXiBo committed Feb 13, 2024
    Configuration menu
    Copy the full SHA
    643e67e View commit details
    Browse the repository at this point in the history

Commits on Feb 26, 2024

  1. Merge branch 'main' into main

    MiXiBo committed Feb 26, 2024
    Configuration menu
    Copy the full SHA
    34c78de View commit details
    Browse the repository at this point in the history

Commits on Feb 29, 2024

  1. Configuration menu
    Copy the full SHA
    6c34d9f View commit details
    Browse the repository at this point in the history
  2. add support for start_index to html link extraction

    Michael Niestroj committed Feb 29, 2024
    Configuration menu
    Copy the full SHA
    f4a18b5 View commit details
    Browse the repository at this point in the history

Commits on Mar 5, 2024

  1. Configuration menu
    Copy the full SHA
    ff2f3bf View commit details
    Browse the repository at this point in the history

Commits on Mar 6, 2024

  1. Revert "Refactor threshold to annotation_threshold and make it an opt…

    …ional parameter"
    
    This reverts commit 724cdb5.
    Michael Niestroj committed Mar 6, 2024
    Configuration menu
    Copy the full SHA
    4f99d67 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0c1e2c1 View commit details
    Browse the repository at this point in the history

Commits on Mar 7, 2024

  1. Configuration menu
    Copy the full SHA
    8c12ca7 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    99f3545 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    e75b43d View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    3be9452 View commit details
    Browse the repository at this point in the history
  5. test: add unit test

    christinestraub committed Mar 7, 2024
    Configuration menu
    Copy the full SHA
    9c13c98 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    70f2a87 View commit details
    Browse the repository at this point in the history

Commits on Mar 8, 2024

  1. [DO NOT MERGE] feat: add support for start_index in html links extrac…

    …tion <- Ingest test fixtures update (#2622)
    
    This pull request includes updated ingest test fixtures.
    Please review and merge if appropriate.
    
    Co-authored-by: christinestraub <christinestraub@users.noreply.github.com>
    ryannikolaidis and christinestraub committed Mar 8, 2024
    Configuration menu
    Copy the full SHA
    8737981 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    f7623e1 View commit details
    Browse the repository at this point in the history
  3. test: fix lint error

    christinestraub committed Mar 8, 2024
    Configuration menu
    Copy the full SHA
    62f15a2 View commit details
    Browse the repository at this point in the history
  4. Merge branch 'main' into mixibo/improved_pdf_html_links_support

    # Conflicts:
    #	CHANGELOG.md
    #	unstructured/__version__.py
    christinestraub committed Mar 8, 2024
    Configuration menu
    Copy the full SHA
    2079213 View commit details
    Browse the repository at this point in the history

Commits on Mar 14, 2024

  1. Merge branch 'main' into mixibo/improved_pdf_html_links_support

    # Conflicts:
    #	CHANGELOG.md
    #	unstructured/__version__.py
    christinestraub committed Mar 14, 2024
    Configuration menu
    Copy the full SHA
    83b5118 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    d8e8ac1 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    9aa5651 View commit details
    Browse the repository at this point in the history

Commits on Mar 15, 2024

  1. Merge branch 'main' into mixibo/improved_pdf_html_links_support

    # Conflicts:
    #	CHANGELOG.md
    #	unstructured/__version__.py
    #	unstructured/documents/elements.py
    christinestraub committed Mar 15, 2024
    Configuration menu
    Copy the full SHA
    0359f3a View commit details
    Browse the repository at this point in the history

Commits on Mar 18, 2024

  1. Configuration menu
    Copy the full SHA
    c22e3f6 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    a5d245c View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    5e05202 View commit details
    Browse the repository at this point in the history
  4. chore: bump version

    christinestraub committed Mar 18, 2024
    Configuration menu
    Copy the full SHA
    9483427 View commit details
    Browse the repository at this point in the history

Commits on Mar 27, 2024

  1. reviewed start_index handling

    MiXiBo committed Mar 27, 2024
    Configuration menu
    Copy the full SHA
    fcee08a View commit details
    Browse the repository at this point in the history

Commits on Mar 28, 2024

  1. Configuration menu
    Copy the full SHA
    de90ce3 View commit details
    Browse the repository at this point in the history

Commits on Apr 8, 2024

  1. Merge branch 'main' into mixibo/improved_pdf_html_links_support

    # Conflicts:
    #	CHANGELOG.md
    #	unstructured/__version__.py
    christinestraub committed Apr 8, 2024
    Configuration menu
    Copy the full SHA
    e8ff218 View commit details
    Browse the repository at this point in the history

Commits on Apr 9, 2024

  1. Configuration menu
    Copy the full SHA
    86a95f1 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    076492e View commit details
    Browse the repository at this point in the history
  3. test: fix lint error

    christinestraub committed Apr 9, 2024
    Configuration menu
    Copy the full SHA
    e300a70 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    391cac0 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    9b06533 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    8ff23af View commit details
    Browse the repository at this point in the history

Commits on Apr 11, 2024

  1. Configuration menu
    Copy the full SHA
    a11c968 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    5b9dee3 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    d7b6aff View commit details
    Browse the repository at this point in the history
  4. [DO NOT MERGE] Feat: add support for start_index in html links ex…

    …traction <- Ingest test fixtures update (#2882)
    
    This pull request includes updated ingest test fixtures.
    Please review and merge if appropriate.
    
    ---------
    
    Co-authored-by: christinestraub <christinestraub@users.noreply.github.com>
    Co-authored-by: christinestraub <christinemstraub@gmail.com>
    3 people committed Apr 11, 2024
    Configuration menu
    Copy the full SHA
    8895e5a View commit details
    Browse the repository at this point in the history
  5. Merge branch 'main' into feat/2625-html-support-link-start-index

    # Conflicts:
    #	CHANGELOG.md
    christinestraub committed Apr 11, 2024
    Configuration menu
    Copy the full SHA
    b4ec676 View commit details
    Browse the repository at this point in the history
  6. chore: update version

    christinestraub committed Apr 11, 2024
    Configuration menu
    Copy the full SHA
    1765496 View commit details
    Browse the repository at this point in the history