Captioning models
Update: 23/07/2018_15:12:13
Show, Tell and Discriminate
FlipDial
Is an image worth more than a thousand words? on the fine-grain semantic differences