|
|
![]() |
pytorch-a2c-ppo-acktr
|
PyTorch implementation of Advantage Actor Critic
(A2C), Proximal Policy Optimization (PPO) and Scalable
trust-region method for deep reinforcement learning using
Kronecker-factored approximation (ACKTR). |
 |
irl-imitation
|
Implementations of model-based Inverse Reinforcement
Learning (IRL) algorithms in python/Tensorflow. Deep
MaxEnt, MaxEnt, LPIRL |
 |
NoisyNet-A3C
|
NoisyNet [1] (LSTM) asynchronous advantage
actor-critic (A3C) [2] on the CartPole-v1 environment.
This repo has a minimalistic design and a classic control
environment to enable quick investigation of different
hyperparameters. |
 |
dqn.Pytorch
|
Deep Q-Learning Network in pytorch |
![http://shws.cc.oita-u.ac.jp/shibata/KissSystem.png]() |
rl.pytorch
|
Deep Reinforcement Learning with pytorch &
visdom |
![]() |
a3c.PyTorch
|
PyTorch implementation of Asynchronous Advantage
Actor Critic (A3C) from "Asynchronous Methods for Deep
Reinforcement Learning". |
 |
REINFORCE.pytorch
|
PyTorch Implementation of REINFORCE for both discrete
& continuous control |
![]() |
chatbot.PyTorch
|
A Marvelous ChatBot implement using PyTorch. |
 |
rl.Pytorch
|
Deep Reinforcement Learning with pytorch & visdom
(the branch for A3C continuous control) |
![]() |
dress
|
Sentence Simplification with Deep Reinforcement
Learning |
![]() |
pytorch-ddpg
|
Implementation of the Deep Deterministic Policy
Gradient (DDPG) using PyTorch |
 |
rl_atari_pytorch
|
ReinforcementLearning Learn Play Atari Using DDPG and
LSTM. |
![]() |
NoisyNet-A3C.PyTorch
|
Noisy Networks for Exploration |
 |
neural-combinatorial-rl.pytorch
|
PyTorch implementation of Neural Combinatorial
Optimization with Reinforcement Learning. |
 |
spatial-reasoning.pytorch
|
Code for the paper "Representation Learning for
Grounded Spatial Reasoning"
https://arxiv.org/abs/1707.03938 |
 |
clevr-iep.PyTorch
|
This is the code for the paper: Inferring and
Executing Programs for Visual Reasoning. Justin Johnson,
Bharath Hariharan, Laurens van der Maaten, Judy Hoffman,
Fei-Fei Li, Larry Zitnick, Ross Girshick arXiv 2017 |
 |
es.PyTorch
|
Evolution Strategies in PyTorch |
 |