23 210 9

Chengsong Huang

ChengsongHuang

https://chengsong-huang.github.io/

hcscctv

AI & ML interests

None yet

Recent Activity

upvoted a paper about 8 hours ago

Share More, Search Less: Collaborative Parallel Thinking for Efficient Test-Time Scaling

upvoted a paper 1 day ago

Language Models Need Sleep

upvoted a paper 3 days ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

View all activity

Organizations

upvoted a paper about 8 hours ago

Share More, Search Less: Collaborative Parallel Thinking for Efficient Test-Time Scaling

Paper • 2605.27030 • Published 2 days ago • 24

upvoted a paper 1 day ago

Language Models Need Sleep

Paper • 2605.26099 • Published 3 days ago • 8

upvoted a paper 3 days ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published 6 days ago • 184

upvoted 2 papers 7 days ago

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

Paper • 2605.21468 • Published 8 days ago • 48

The Unlearnability Phenomenon in RLVR for Language Models

Paper • 2605.16787 • Published 12 days ago • 6

upvoted a paper 8 days ago

Process Rewards with Learned Reliability

Paper • 2605.15529 • Published 13 days ago • 52

upvoted a paper 16 days ago

G-Zero: Self-Play for Open-Ended Generation from Zero Data

Paper • 2605.09959 • Published 17 days ago • 17

upvoted a paper 17 days ago

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Paper • 2605.08083 • Published 20 days ago • 68

upvoted a paper 18 days ago

SkillOS: Learning Skill Curation for Self-Evolving Agents

Paper • 2605.06614 • Published 21 days ago • 45

upvoted a paper 19 days ago

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

Paper • 2605.05242 • Published 25 days ago • 115

upvoted a paper 20 days ago

Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration

Paper • 2605.05566 • Published 21 days ago • 37

upvoted a paper 29 days ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published about 1 month ago • 273

upvoted 2 papers about 1 month ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Paper • 2604.13602 • Published Apr 15 • 32

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published Apr 15 • 163

upvoted 2 papers about 2 months ago

Graph of Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills

Paper • 2604.05333 • Published Apr 7 • 22

MARS: Enabling Autoregressive Models Multi-Token Generation

Paper • 2604.07023 • Published Apr 8 • 38

upvoted a paper 2 months ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 140

upvoted 3 papers 3 months ago

Chengsong Huang

AI & ML interests

Recent Activity

Organizations

ChengsongHuang's activity