Advantage-Guided Distillation for Preference Alignment in Small Language Models Paper • 2502.17927 • Published Feb 25, 2025 • 1