FoundationsFoundational
Reinforcement Learning
Teaching an AI by rewarding good outcomes and penalizing bad ones — which sounds straightforward until the system finds a way to maximize the reward without achieving the actual goal.
Concept library
Start with the practical answer, then open a concept for use cases, risks, prompts, and related ideas.
Explore the concept map1 concept
Teaching an AI by rewarding good outcomes and penalizing bad ones — which sounds straightforward until the system finds a way to maximize the reward without achieving the actual goal.