Reinforcement learning (Q2571): Difference between revisions

Latest revision as of 13:42, 27 January 2026

Successful solutions are rewarded and given greater weight

Language	Label	Description	Also known as
English	Reinforcement learning	Successful solutions are rewarded and given greater weight

0 references

0 references

0 references

@@ label / de / label / de @@
-Reinforcement learning with human feedback
+Reinforcement learning
@@ description / en / description / en @@
-RLHF
+Successful solutions are rewarded and given greater weight
@@ description / de / description / de @@
+Erfolgreiche Lösungsansätze werden belohnt, und später stärker gewichtet
@@ Property / depends on @@
+Machine learning
@@ Property / depends on: Machine learning / rank @@
+Normal rank
@@ Property / depends on @@
+Training AI models
@@ Property / depends on: Training AI models / rank @@
+Normal rank
@@ Property / depends on @@
+Supervised learning
@@ Property / depends on: Supervised learning / rank @@
+Normal rank