Reinforcement Learning codes

Name Description Illustration
pytorch-a2c-ppo-acktr PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).
irl-imitation Implementations of model-based Inverse Reinforcement Learning (IRL) algorithms in python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL
NoisyNet-A3C NoisyNet [1] (LSTM) asynchronous advantage actor-critic (A3C) [2] on the CartPole-v1 environment. This repo has a minimalistic design and a classic control environment to enable quick investigation of different hyperparameters.
dqn.Pytorch Deep Q-Learning Network in pytorch http://shws.cc.oita-u.ac.jp/shibata/KissSystem.png
rl.pytorch Deep Reinforcement Learning with pytorch & visdom
a3c.PyTorch PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
REINFORCE.pytorch PyTorch Implementation of REINFORCE for both discrete & continuous control
chatbot.PyTorch A Marvelous ChatBot implement using PyTorch.
rl.Pytorch Deep Reinforcement Learning with pytorch & visdom (the branch for A3C continuous control)
dress Sentence Simplification with Deep Reinforcement Learning
pytorch-ddpg Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch
rl_atari_pytorch ReinforcementLearning Learn Play Atari Using DDPG and LSTM.
NoisyNet-A3C.PyTorch Noisy Networks for Exploration
neural-combinatorial-rl.pytorch PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning.
spatial-reasoning.pytorch Code for the paper "Representation Learning for Grounded Spatial Reasoning" https://arxiv.org/abs/1707.03938
clevr-iep.PyTorch This is the code for the paper: Inferring and Executing Programs for Visual Reasoning. Justin Johnson, Bharath Hariharan, Laurens van der Maaten, Judy Hoffman, Fei-Fei Li, Larry Zitnick, Ross Girshick arXiv 2017
es.PyTorch Evolution Strategies in PyTorch