arxiv:2510.00492
Jiongdao Jin
jiongdao
AI & ML interests
None yet
Recent Activity
upvoted a paper 18 days ago
Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients updated a model 22 days ago
jiongdao/grpo-outputs updated a dataset 22 days ago
jiongdao/grpo-results