¡Desconecta con la aplicación Player FM !
Fine-tuning and Preference Alignment in a Single Streamlined Process
Manage episode 423374192 series 2570898
Jiwoo Hong and Noah Lee of KAIST AI are co-authors of ORPO: Monolithic Preference Optimization without Reference Model.
Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/
Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS.
Detailed show notes can be found on The Data Exchange web site.
277 episodios
Manage episode 423374192 series 2570898
Jiwoo Hong and Noah Lee of KAIST AI are co-authors of ORPO: Monolithic Preference Optimization without Reference Model.
Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/
Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS.
Detailed show notes can be found on The Data Exchange web site.
277 episodios
Todos los episodios
×Bienvenido a Player FM!
Player FM está escaneando la web en busca de podcasts de alta calidad para que los disfrutes en este momento. Es la mejor aplicación de podcast y funciona en Android, iPhone y la web. Regístrate para sincronizar suscripciones a través de dispositivos.