Optimization & Loss functions
Update: 23/07/2018_15:12:15
On the Convergence of Adam and Beyond
Efficient softmax approximation for GPUs