Item

Reinforcement learning (Q2571): Difference between revisions

(‎Added [de] label: Reinforcement learning with human feedback - GraphIT.htm)
 
(7 intermediate revisions by the same user not shown)
label / enlabel / en
Reinforcement learning with human feedback
Reinforcement learning
label / delabel / de
Reinforcement learning with human feedback - GraphIT.htm
Reinforcement learning
description / endescription / en
RLHF
Successful solutions are rewarded and given greater weight
description / dedescription / de
 
Erfolgreiche Lösungsansätze werden belohnt, und später stärker gewichtet
Property / depends on
 
Property / depends on: Machine learning / rank
 
Normal rank
Property / depends on
 
Property / depends on: Training AI models / rank
 
Normal rank
Property / depends on
 
Property / depends on: Supervised learning / rank
 
Normal rank

Latest revision as of 13:42, 27 January 2026

Successful solutions are rewarded and given greater weight
Language Label Description Also known as
English
Reinforcement learning
Successful solutions are rewarded and given greater weight

    Statements

    0 references
    0 references
    0 references