Skip to content

[BEE] Table model missing the last part of the table in the Japanese document #3402

@divekarsc

Description

@divekarsc

Bug

The bottom row of the table is missing from the output.

Observation:

These are selectable texts in the table
The token appears in the token list but not considered in table structure.
As per bounding box , token is well within the page

{
"id": "TOKEN_000102",
"confidence": 1.0,
"bbox": { "x": 169, "y": 606, "width": 110, "height": 10, "page_number": 1 },
"text": "ESGワーキング・グループ"
}

Steps to reproduce

Run IBM_2021_ESG_Report 10.pdf through the latest docling.
The document is kept here -> https://ibm.ent.box.com/folder/380274980502

Docling version

Latest

Python version

3.12

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions