Free account = 1 chapter of every course unlocked
No credit card ยท Google sign-in in 30 seconds ยท 17+ free chapters across 17 courses
๐ฎ
Reinforcement Learning
From bandits to deep RL. Policy gradients, Q-learning, actor-critic, RLHF, and real-world applications.
12 chaptersFirst chapter free to preview
After this course, you'll be able to:
โImplement Q-learning, policy gradients, and actor-critic from scratch
โTrain AI from human feedback (RLHF โ the technique behind ChatGPT)
โBuild multi-agent RL systems
โDeploy RL in production applications
Full syllabus
2
Dynamic Programming
3
Monte Carlo Methods
4
Temporal Difference
5
Function Approximation
6
Policy Gradient
7
Advanced Policy Optimization
8
Model-Based RL
9
Multi-Agent RL
10
RLHF
11
Applications
12
Production Deployment
Unlock all 12 chapters
Plus 18 other courses โ 344 more chapters included.