Guanhua Huang
Carlanlarkk
AI & ML interests
None yet
Recent Activity
authored
a paper
2 months ago
Low-probability Tokens Sustain Exploration in Reinforcement Learning
with Verifiable Reward
upvoted
a
paper
2 months ago
Cogito, Ergo Ludo: An Agent that Learns to Play by Reasoning and
Planning
upvoted
a
paper
2 months ago
Low-probability Tokens Sustain Exploration in Reinforcement Learning
with Verifiable Reward
Organizations
None yet