Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback.
Javier Rando,Tony Wang,Stewart Slocum,Dmitrii Krasheninnikov,Usman Anwar,Micah Carroll,Xander Davies,Claudia Shi,Thomas Gilbert,Rachel Freedman,Charbel-Raphael Segerie,Phillip Christoffersen,Jacob Pfau,Tomek Korbak,Xin Chen,Lauro Langosco,Samuel Marks,Erdem Bıyık,Dorsa Sadigh,David Krueger,Pedro Freire,Mehul Damani,Jérémy Scheurer,David Lindner,Anca Dragan,Anand Siththaranjan,Dylan Hadfield-Menell,Max Nadeau,Stephen Casper,Peter Hase,Andi Peng,Eric Michaud ICLR 2025(2025)
Key words
Software Reliability Modeling
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper