These comments are powered by Fermat's Library. If you want to get direct links to references, BibTeX extraction
and comments on all arXiv papers install our
Chrome extension Librarian
Deep reinforcement learning from human preferences
by Paul Christiano, Jan Leike, Tom B. Brown, Miljan Martic, Shane Legg, Dario Amodei
There are no comments for this paper yet. Be the first to ask a question or share a comment.
Join the discussion! Ask questions and share your comments.
Sign in with Google
Sign in with Facebook
Sign in with email