Grounding/Grasping Issues on OCID-VLG

We evaluated GraspMAS on the OCID-VLG dataset and observed performance issues regarding object grounding and grasp generation.

### 1. Grounding Failure (GroundingDINO)
The GroundingDINO model frequently fails to ground the target object when allowing up to 5 rounds.

* **Red Rectangle:** Model Prediction
* **Green Rectangle:** Ground Truth

<img width="640" height="480" alt="Image" src="https://github.com/user-attachments/assets/9b60a813-3bed-48d6-9bb7-6cf186a3dbc4" />

### 2. Overly Large Grasp Rectangles
The generated grasp rectangles are often significantly larger than the target object and do not align with the object’s geometry.

* **Red Rectangle:** Model Prediction
* **Green Rectangle:** Ground Truth

<img width="640" height="480" alt="Image" src="https://github.com/user-attachments/assets/adc0620c-9b8f-4f7f-b897-ceef7b61aa14" />

### Overall Performance
When running the full pipeline on OCID-VLG, we observed an overall success rate of ~17%, which is lower than expected.

### Reproducibility / Verification Request

We have compiled five specific [failure cases](https://drive.google.com/file/d/1ntCHihvHhgjnKk89Wsm8l-wYUICGY4bt/view?usp=sharing) including their respective prompts. We would appreciate it if the maintainers (or other users) could run these cases and confirm whether the same grounding and grasping behavior is observed.

We would appreciate any insight into whether these results are expected under the current implementation or if additional configuration is required.

Thank you for your assistance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Grounding/Grasping Issues on OCID-VLG #3

1. Grounding Failure (GroundingDINO)

2. Overly Large Grasp Rectangles

Overall Performance

Reproducibility / Verification Request

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Grounding/Grasping Issues on OCID-VLG #3

Description

1. Grounding Failure (GroundingDINO)

2. Overly Large Grasp Rectangles

Overall Performance

Reproducibility / Verification Request

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions