Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).
Hands-on Deep Reinforcement Learning, published by Packt
Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty