Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning
Paper
•
2504.03784
•
Published
•
2
None defined yet.
AdaDetectGPT: Adaptive Detection of LLM-Generated Text with Statistical Guarantees
Doubly Robust Alignment for Large Language Models