Reinforcement learning (Q2571): Difference between revisions
(Created a new Item: Reinforcement learning with human feedback, RLHF) |
(Created claim: depends on (P1): Supervised learning (Q2199)) |
||
| (8 intermediate revisions by the same user not shown) | |||
| label / en | label / en | ||
Reinforcement learning | Reinforcement learning | ||
| label / de | label / de | ||
Reinforcement learning | |||
| description / en | description / en | ||
Successful solutions are rewarded and given greater weight | |||
| description / de | description / de | ||
Erfolgreiche Lösungsansätze werden belohnt, und später stärker gewichtet | |||
| Property / depends on | |||
| Property / depends on: Machine learning / rank | |||
Normal rank | |||
| Property / depends on | |||
| Property / depends on: Training AI models / rank | |||
Normal rank | |||
| Property / depends on | |||
| Property / depends on: Supervised learning / rank | |||
Normal rank | |||
Latest revision as of 13:42, 27 January 2026
Successful solutions are rewarded and given greater weight
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Reinforcement learning |
Successful solutions are rewarded and given greater weight |