Free account = 1 chapter of every course unlocked
No credit card ยท Google sign-in in 30 seconds ยท 17+ free chapters across 17 courses
Start free โ†’
All Courses/Reinforcement Learning
๐ŸŽฎ

Reinforcement Learning

From bandits to deep RL. Policy gradients, Q-learning, actor-critic, RLHF, and real-world applications.

12 chaptersFirst chapter free to preview

After this course, you'll be able to:

โœ“Implement Q-learning, policy gradients, and actor-critic from scratch
โœ“Train AI from human feedback (RLHF โ€” the technique behind ChatGPT)
โœ“Build multi-agent RL systems
โœ“Deploy RL in production applications

Full syllabus

1

Foundations

Free preview
Read free โ†’
2

Dynamic Programming

3

Monte Carlo Methods

4

Temporal Difference

5

Function Approximation

6

Policy Gradient

7

Advanced Policy Optimization

8

Model-Based RL

9

Multi-Agent RL

10

RLHF

11

Applications

12

Production Deployment

Unlock all 12 chapters

Plus 18 other courses โ€” 344 more chapters included.

Compare all plans

Free Tools & Calculators