Operator Learning Using Weak Supervision from Walk-on-Spheres Paper • 2603.01193 • Published Mar 1 • 2
Found-RL: foundation model-enhanced reinforcement learning for autonomous driving Paper • 2602.10458 • Published Feb 11
MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge Paper • 2507.21183 • Published Jul 27, 2025 • 15
SePPO: Semi-Policy Preference Optimization for Diffusion Alignment Paper • 2410.05255 • Published Oct 7, 2024 • 5
Multi-task Hierarchical Adversarial Inverse Reinforcement Learning Paper • 2305.12633 • Published May 22, 2023
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation Paper • 2112.02721 • Published Dec 6, 2021