IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks Paper • 2506.16402 • Published Jun 19 • 1
RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents Paper • 2403.19622 • Published Mar 28, 2024
Geometrically-Constrained Agent for Spatial Reasoning Paper • 2511.22659 • Published 13 days ago • 38
Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE Paper • 2311.02684 • Published Nov 5, 2023
A Topic-level Self-Correctional Approach to Mitigate Hallucinations in MLLMs Paper • 2411.17265 • Published Nov 26, 2024 • 1
Use Property-Based Testing to Bridge LLM Code Generation and Validation Paper • 2506.18315 • Published Jun 23 • 11