25 77 256

Yinxu Pan

cppowboy

https://github.com/Cppowboy

AI & ML interests

RL for LLM, Code&Math Reasoning, Function Calling, Code Interpreter, Vision-Language Pretraining

Recent Activity

upvoted a paper 30 minutes ago

Nested Browser-Use Learning for Agentic Information Seeking

upvoted a paper 1 day ago

SWE-RM: Execution-free Feedback For Software Engineering Agents

upvoted a paper 1 day ago

MAI-UI Technical Report: Real-World Centric Foundation GUI Agents

View all activity

Organizations

upvoted a paper 30 minutes ago

Nested Browser-Use Learning for Agentic Information Seeking

Paper • 2512.23647 • Published about 18 hours ago • 6

upvoted 2 papers 1 day ago

SWE-RM: Execution-free Feedback For Software Engineering Agents

Paper • 2512.21919 • Published 4 days ago • 7

MAI-UI Technical Report: Real-World Centric Foundation GUI Agents

Paper • 2512.22047 • Published 4 days ago • 24

upvoted 3 papers 5 days ago

SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios

Paper • 2512.18470 • Published 10 days ago • 9

NVIDIA Nemotron 3: Efficient and Open Intelligence

Paper • 2512.20856 • Published 6 days ago • 27

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published 7 days ago • 28

liked a dataset 6 days ago

nebius/SWE-agent-trajectories

Viewer • Updated Dec 23, 2024 • 80k • 1.15k • 67

upvoted a paper 8 days ago

SWE-Bench++: A Framework for the Scalable Generation of Software Engineering Benchmarks from Open-Source Repositories

Paper • 2512.17419 • Published 11 days ago • 9

liked 2 datasets 12 days ago

nvidia/Nemotron-Cascade-RL-SWE

Viewer • Updated 14 days ago • 110k • 459 • 22

princeton-nlp/SWE-bench

Viewer • Updated Mar 3 • 21.5k • 16.3k • 131

upvoted an article 15 days ago

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

15 days ago

•

101

liked 2 datasets 22 days ago

TuringEnterprises/Turing-Open-Reasoning

Viewer • Updated 24 days ago • 50 • 20.3k • 180

Anthropic/AnthropicInterviewer

Viewer • Updated 22 days ago • 1.25k • 12.2k • 342

upvoted 2 papers 24 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26 • 145

PretrainZero: Reinforcement Active Pretraining

Paper • 2512.03442 • Published 27 days ago • 46

upvoted a paper 27 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 28 days ago • 241

upvoted a paper 28 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 29 days ago • 93

liked a dataset 28 days ago

nvidia/ToolScale

Viewer • Updated 13 days ago • 4.06k • 3.03k • 163

liked a model 28 days ago

deepseek-ai/DeepSeek-V3.2

Text Generation • 685B • Updated 29 days ago • 114k • • 1.05k

upvoted a collection about 1 month ago

Olmo 3 Post-training

Collection

All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated 7 days ago • 46

Yinxu Pan

AI & ML interests

Recent Activity

Organizations

cppowboy's activity

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models