Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jiafei Lyu's picture
1

Jiafei Lyu

dmux
BryantMcGill's profile picture 21world's profile picture
ยท
https://dmksjfl.github.io/
  • dmksjfl

AI & ML interests

Reinforcement Learning

Recent Activity

upvoted a paper 20 days ago
EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control
authored a paper 8 months ago
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning
authored a paper over 1 year ago
SEABO: A Simple Search-Based Method for Offline Imitation Learning
View all activity

Organizations

Tsinghua University's profile picture

dmux 's datasets

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs