Reinforcement Learning

Learning by trial and error from reward signals.

foundation tier

Reinforcement Learning addresses learning by trial and error from reward signals. It sits within Machine Learning and inherits that area’s core questions about correctness, scale, and tractability. This page surveys the conceptual axes of the topic and points to the references that frame ongoing research and teaching. The intent is to be useful both as an entry point for newcomers and as an index for practitioners cross-checking their mental model against the field’s primary sources.

Work on reinforcement learning can be organised around a few interlocking concerns: the formal objects under study, the algorithms or systems that compute over them, the resource trade-offs (time, memory, communication, statistical efficiency), and the empirical or theoretical guarantees that practitioners rely on. The sources cited below approach the topic from a mix of these angles.

Foundational references

Sutton, Reinforcement Learning: An Introduction (2018) is a standard reference for this material and is used both as a curriculum anchor and as a long-form survey of techniques.

Supporting and complementary work

Szepesvári, Algorithms for Reinforcement Learning (2010) provides supporting material that complements the primary references — readers comparing approaches will find useful framings, alternative notations, or extensions there.

Open methodological questions in reinforcement learning cluster around how to compose the techniques above under realistic constraints — scale, adversarial inputs, partial observability, and shifting workloads. The cited references give the precise statements, proofs, and empirical evaluations that this overview only sketches; downstream topic pages drill into specific subfields.

Prerequisites

Sources

textbook · primary · 2018

Reinforcement Learning: An Introduction

sutton-2018
textbook · supporting · 2010

Algorithms for Reinforcement Learning

szepesvari-2010

In context

Where this topic sits in the prerequisite graph. Click any node to jump.

Open in full atlas →

Reviewed by

@lucaderumier field

Explore

Review this topic

This page was drafted by an agent and is waiting on expert review. Spotted a wrong prerequisite, a missing concept, a misattributed source, or a factual slip? Tell us — your review opens a tracked issue maintainers act on.

Reinforcement Learning

Foundational references

Supporting and complementary work

Prerequisites

Sources

In context

Reviewed by

Explore

Human-in-the-Loop Reinforcement Learning

Markov Decision Processes

Sim-to-Real Reinforcement Learning

Value-Based Methods

Multi-Agent Reinforcement Learning

Policy Gradient Methods

Model-Based RL

Offline Reinforcement Learning

Exploration in RL

Hierarchical RL

Inverse Reinforcement Learning

Imitation Learning

RL from Human Feedback

Distributional RL

Safe Reinforcement Learning

Review this topic