-
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Paper • 2505.03335 • Published • 188 -
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
Paper • 2502.06781 • Published • 59 -
internlm/OREAL-7B
Text Generation • 8B • Updated • 79 • • 20 -
internlm/OREAL-32B
Text Generation • 33B • Updated • 266 • 24
AI-Insight
AI & ML interests
None defined yet.
Recent Activity
AI-Insight is a collaborative initiative dedicated to delivering cutting-edge perspectives in the fast-evolving field of artificial intelligence. Jointly initiated by contributors from OpenMMLab, Hugging Face, ModelScope, SmartFlowAI, and other leading AI communities, AI-Insight focuses on curating the most valuable knowledge amidst the overwhelming information era.
Our flagship event series, AI-Insight Talk, invites authors of Hugging Face’s hottest papers and top research works to share their latest findings and behind-the-scenes insights. By bridging pioneering researchers with the open-source community, we aim to foster deeper understanding, inspire innovation, and spark meaningful discussions across all areas of AI — from foundational models to multimodal systems and beyond.
Join AI-Insight to stay at the forefront of AI research, learn from the best minds, and contribute to shaping the future of intelligent technology.
What We Do:
Host AI-Insight Talks featuring top Hugging Face Hot Papers
Provide in-depth discussions and practical takeaways from groundbreaking AI research
Build a vibrant, knowledge-driven community for researchers, developers, and enthusiasts
Promote open-source values and collaboration in the global AI ecosystem
-
CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive Programming
Paper • 2505.12925 • Published • 2 -
Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination
Paper • 2503.04149 • Published • 6 -
OSS-Bench: Benchmark Generator for Coding LLMs
Paper • 2505.12331 • Published • 2 -
UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench
Paper • 2506.09289 • Published • 2
-
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Paper • 2505.03335 • Published • 188 -
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
Paper • 2502.06781 • Published • 59 -
internlm/OREAL-7B
Text Generation • 8B • Updated • 79 • • 20 -
internlm/OREAL-32B
Text Generation • 33B • Updated • 266 • 24
-
CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive Programming
Paper • 2505.12925 • Published • 2 -
Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination
Paper • 2503.04149 • Published • 6 -
OSS-Bench: Benchmark Generator for Coding LLMs
Paper • 2505.12331 • Published • 2 -
UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench
Paper • 2506.09289 • Published • 2