Gradio-Blocks-Party

company

AI & ML interests

None defined yet.

Recent Activity

taesiri submitted a paper about 4 hours ago

Agentic AI and the next intelligence explosion

taesiri submitted a paper about 4 hours ago

PivotRL: High Accuracy Agentic Post-Training at Low Compute Cost

taesiri submitted a paper about 4 hours ago

Effective Strategies for Asynchronous Software Engineering Agents

View all activity

submitted 5 papers to Daily Papers about 4 hours ago

Agentic AI and the next intelligence explosion

Paper • 2603.20639 • Published 3 days ago • 3

PivotRL: High Accuracy Agentic Post-Training at Low Compute Cost

Paper • 2603.21383 • Published 1 day ago • 6

Effective Strategies for Asynchronous Software Engineering Agents

Paper • 2603.21489 • Published 1 day ago • 1

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Paper • 2603.22117 • Published about 16 hours ago • 7

WorldCache: Content-Aware Caching for Accelerated Video World Models

Paper • 2603.22286 • Published about 14 hours ago • 3

submitted 5 papers to Daily Papers 1 day ago

Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD

Paper • 2603.20155 • Published 4 days ago • 7

WorldAgents: Can Foundation Image Models be Agents for 3D World Models?

Paper • 2603.19708 • Published 4 days ago • 10

Hyperagents

Paper • 2603.19461 • Published 4 days ago • 24

Teaching an Agent to Sketch One Part at a Time

Paper • 2603.19500 • Published 4 days ago • 4

A Subgoal-driven Framework for Improving Long-Horizon LLM Agents

Paper • 2603.19685 • Published 4 days ago • 14

submitted 4 papers to Daily Papers 4 days ago

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

Paper • 2603.18815 • Published 5 days ago • 10

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

Paper • 2603.18886 • Published 5 days ago • 3

Matryoshka Gaussian Splatting

Paper • 2603.19234 • Published 5 days ago • 8

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 5 days ago • 54

submitted 5 papers to Daily Papers 5 days ago

AI Scientist via Synthetic Task Scaling

Paper • 2603.17216 • Published 6 days ago • 3

Efficient Exploration at Scale

Paper • 2603.17378 • Published 6 days ago • 12

PRISM: Demystifying Retention and Interaction in Mid-Training

Paper • 2603.17074 • Published 7 days ago • 1

LaDe: Unified Multi-Layered Graphic Media Generation and Decomposition

Paper • 2603.17965 • Published 6 days ago • 5

Unified Spatio-Temporal Token Scoring for Efficient Video VLMs

Paper • 2603.18004 • Published 6 days ago • 12

submitted a paper to Daily Papers 6 days ago

Efficient Reasoning on the Edge

Paper • 2603.16867 • Published 7 days ago • 17