ACE-Step 1.5: Pushing the Boundaries of Open-Source Music Generation Paper • 2602.00744 • Published 9 days ago • 5
Privasis: Synthesizing the Largest "Public" Private Dataset from Scratch Paper • 2602.03183 • Published 6 days ago • 8
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback Paper • 2204.05862 • Published Apr 12, 2022 • 3
view article Article Introducing OptiMind, a research model designed for optimization 24 days ago • 34
Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability Collection A compilation of sparse auto-encoders trained on large language models. • 37 items • Updated Dec 16, 2025 • 23
DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints Paper • 2601.18137 • Published 14 days ago • 25
ECO: Quantized Training without Full-Precision Master Weights Paper • 2601.22101 • Published 10 days ago • 6
World events Collection Dataset containing real world events from 2023 till present • 3 items • Updated 14 days ago • 5
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits Paper • 2512.20578 • Published Dec 23, 2025 • 86
A^3-Bench: Benchmarking Memory-Driven Scientific Reasoning via Anchor and Attractor Activation Paper • 2601.09274 • Published 26 days ago • 84
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos Paper • 2601.00393 • Published Jan 1 • 130
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published Jan 8 • 222