Update: 23/07/2018_15:12:15
  1. On the Convergence of Adam and Beyond
  2. Efficient softmax approximation for GPUs