Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, Sergey Levine
ICLR 2018' workshop
|
openreview
|
code*
Problematic:
Contributions:
Issues & comments: