Multi-Preference Optimization: Generalizing DPO via Set-Level Contrasts Paper • 2412.04628 • Published Dec 5, 2024 • 1