Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF Paper • 2405.19320 • Published May 29, 2024 • 10
SQL-PaLM: Improved Large Language ModelAdaptation for Text-to-SQL Paper • 2306.00739 • Published May 26, 2023 • 20