项目作者: ikostrikov

项目描述 :
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
高级语言: Python
项目地址: git://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail.git
创建时间: 2017-08-22T15:57:25Z
项目社区:https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail

开源协议:MIT License

下载