Improving Multimodal Interactive Agents with Reinforcement Learning from
Human Feedback
Josh Abramson,Arun Ahuja,Federico Carnevale,Petko Georgiev,Alex Goldin,Alden Hung,Jessica Landon, Jirka Lhotka,Timothy Lillicrap,Alistair Muldal, George Powell,Adam Santoro,Guy Scully,Sanjana Srivastava,Tamara von Glehn,Greg Wayne,Nathaniel Wong,Chen Yan,Rui Zhu CoRR(2022)
AI 理解论文
溯源树
样例
