MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published 24 days ago • 158
view article Article ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases Nov 5 • 53
CoDA: Agentic Systems for Collaborative Data Visualization Paper • 2510.03194 • Published Oct 3 • 28
BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent Paper • 2508.06600 • Published Aug 8 • 41
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents Paper • 2508.13186 • Published Aug 14 • 18