Item

Reinforcement learning with human feedback (Q2177): Difference between revisions

(‎Created a new Item)
(No difference)

Revision as of 10:09, 13 October 2025

Training a model using human preferences
  • RLHF
Language Label Description Also known as
English
Reinforcement learning with human feedback
Training a model using human preferences
  • RLHF

Statements