https://github.com/higgsfield/RL-Adventure-2/blob/master/3.ppo.ipynb
https://arxiv.org/abs/1707.06347
https://github.com/VashishtMadhavan/rl2
https://github.com/noahgolmant/RL-squared
https://github.com/pytorch/examples/blob/master/reinforcement_learning/reinforce.py
https://medium.com/@sanketgujar95/trust-region-policy-optimization-trpo-and-proximal-policy-optimization-ppo-e6e7075f39ed
https://towardsdatascience.com/understanding-gru-networks-2ef37df6c9be
cshaib/SNAIL_Pytorch
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|