-
Notifications
You must be signed in to change notification settings - Fork 1
Jaeyoung Lee edited this page May 8, 2020
·
3 revisions
https://github.com/stereoboy/deep-reinforcement-learning
- PPO: Proximal Policy Optimization Algorithms
- A3C: Asynchronous Methods for Deep Reinforcement Learning
- D4PG: DISTRIBUTED DISTRIBUTIONAL DETERMINISTIC POLICY GRADIENTS
-
Benchmarking Deep Reinforcement Learning for Continuous Control
-
Blog Article: Proximal Policy Optimization
-
Deep Reinforcement Learning Doesn't Work Yet
- Markov Games
- Cooperation, Competition, Mixed Environments
https://github.com/stereoboy/deep-reinforcement-learning/tree/master/p3_collab-compet