Exploration in Reinforcement Learning Reading List
Curated by Mouhssine Rifaki | Stanford Electrical Engineering | Last updated April 2026
How agents learn to explore efficiently in large, sparse-reward environments.
- Unifying Count-Based Exploration and Intrinsic Motivation
Bellemare et al.. NeurIPS 2016.
- Curiosity-Driven Exploration by Self-Supervised Prediction
Pathak et al.. ICML 2017.
- Never Give Up: Learning Directed Exploration Strategies
Badia et al.. ICLR 2020.
- First Return, Then Explore
Ecoffet et al.. Nature 2021.
- Exploration by Random Network Distillation
Burda et al.. ICLR 2019.
- Self-Supervised Exploration via Disagreement
Pathak et al.. ICML 2019.
- BYOL-Explore: Exploration by Bootstrapped Prediction
Guo et al.. NeurIPS 2022.
- Asymptotic Convergence and Performance of Multi-Agent Q-Learning Dynamics
Hussain, Belardinelli, Piliouras. ICML 2023.
← Back to main page