SERA: Soft-Verified Efficient Repository Agents Paper • 2601.20789 • Published about 17 hours ago • 3
Linear representations in language models can change dramatically over a conversation Paper • 2601.20834 • Published about 16 hours ago • 5
OmegaUse: Building a General-Purpose GUI Agent for Autonomous Task Execution Paper • 2601.20380 • Published 1 day ago • 3
SketchDynamics: Exploring Free-Form Sketches for Dynamic Intent Expression in Animation Generation Paper • 2601.20622 • Published about 21 hours ago
UI Remix: Supporting UI Design Through Interactive Example Retrieval and Remixing Paper • 2601.18759 • Published 3 days ago • 2
SAGE: Steerable Agentic Data Generation for Deep Search with Execution Feedback Paper • 2601.18202 • Published 3 days ago • 6
DSGym: A Holistic Framework for Evaluating and Training Data Science Agents Paper • 2601.16344 • Published 7 days ago • 9
Endless Terminals: Scaling RL Environments for Terminal Agents Paper • 2601.16443 • Published 6 days ago • 15
SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents Paper • 2601.16746 • Published 6 days ago • 77
Memory-V2V: Augmenting Video-to-Video Diffusion Models with Memory Paper • 2601.16296 • Published 7 days ago • 26
Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning Paper • 2601.16163 • Published 7 days ago • 13
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces Paper • 2601.11868 • Published 12 days ago • 32
Rethinking Video Generation Model for the Embodied World Paper • 2601.15282 • Published 8 days ago • 42