Item

Reinforcement learning (Q2571): Difference between revisions

(‎Changed [en] label: Reinforcement learning)
 
(5 intermediate revisions by the same user not shown)
label / delabel / de
Reinforcement learning with human feedback
Reinforcement learning
description / endescription / en
RLHF
Successful solutions are rewarded and given greater weight
description / dedescription / de
 
Erfolgreiche Lösungsansätze werden belohnt, und später stärker gewichtet
Property / depends on
 
Property / depends on: Machine learning / rank
 
Normal rank
Property / depends on
 
Property / depends on: Training AI models / rank
 
Normal rank
Property / depends on
 
Property / depends on: Supervised learning / rank
 
Normal rank

Latest revision as of 13:42, 27 January 2026

Successful solutions are rewarded and given greater weight
Language Label Description Also known as
English
Reinforcement learning
Successful solutions are rewarded and given greater weight

    Statements

    0 references
    0 references
    0 references