FloorplanQA: A Benchmark for Spatial Reasoning in LLMs using Structured Representations Paper • 2507.07644 • Published Jul 10, 2025 • 5
The Complexity Trap: Simple Observation Masking Is as Efficient as LLM Summarization for Agent Context Management Paper • 2508.21433 • Published Aug 29, 2025 • 8
TAPS: Task Aware Proposal Distributions for Speculative Sampling Paper • 2603.27027 • Published Mar 27 • 144
NearID: Identity Representation Learning via Near-identity Distractors Paper • 2604.01973 • Published Apr 2 • 31
Mind-the-Glitch: Visual Correspondence for Detecting Inconsistencies in Subject-Driven Generation Paper • 2509.21989 • Published Sep 26, 2025 • 23