Mingzhe Li's picture

4 2

Mingzhe Li

Mubuky

·

https://www.mubuky.com

Mubuky

AI & ML interests

RL & Agent

Recent Activity

upvoted a paper 3 days ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

upvoted a paper 3 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted a paper 4 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

View all activity

Organizations

Papers 1

arxiv:2511.04570

models 0

None public yet

datasets 0

None public yet