Reinforcement Learning codes

Name	Description	Illustration

pytorch-a2c-ppo-acktr	PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).
irl-imitation	Implementations of model-based Inverse Reinforcement Learning (IRL) algorithms in python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL
NoisyNet-A3C	NoisyNet [1] (LSTM) asynchronous advantage actor-critic (A3C) [2] on the CartPole-v1 environment. This repo has a minimalistic design and a classic control environment to enable quick investigation of different hyperparameters.
dqn.Pytorch	Deep Q-Learning Network in pytorch
rl.pytorch	Deep Reinforcement Learning with pytorch & visdom
a3c.PyTorch	PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
REINFORCE.pytorch	PyTorch Implementation of REINFORCE for both discrete & continuous control
chatbot.PyTorch	A Marvelous ChatBot implement using PyTorch.
rl.Pytorch	Deep Reinforcement Learning with pytorch & visdom (the branch for A3C continuous control)
dress	Sentence Simplification with Deep Reinforcement Learning
pytorch-ddpg	Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch
rl_atari_pytorch	ReinforcementLearning Learn Play Atari Using DDPG and LSTM.
NoisyNet-A3C.PyTorch	Noisy Networks for Exploration
neural-combinatorial-rl.pytorch	PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning.
spatial-reasoning.pytorch	Code for the paper "Representation Learning for Grounded Spatial Reasoning" https://arxiv.org/abs/1707.03938
clevr-iep.PyTorch	This is the code for the paper: Inferring and Executing Programs for Visual Reasoning. Justin Johnson, Bharath Hariharan, Laurens van der Maaten, Judy Hoffman, Fei-Fei Li, Larry Zitnick, Ross Girshick arXiv 2017
es.PyTorch	Evolution Strategies in PyTorch