Skip to yearly menu bar Skip to main content


Tutorial

Reinforcement Learning from Human Feedback: A Tutorial *

Dmitry Ustalov · Nathan Lambert

[ Project Page ]
[ Slides
2023 Tutorial

Abstract:

Chat is not available.