Shaojie Bai, J. Zico Kolter and Vladlen Koltun
How to choose between CNN and RNN based sequence models
Benchmarking convolutional and recurrent neural networks on a range of standard tasks.
Describe a generic temporal convolutional network (TCN) including practices such as dilations and residual connections and compare it to a baseline RNN architecture in three variants (LSTM, GRU, Vanilla).
No attention used with the RNN, although the TCN is boosted with the recent updates
Focusing mainly sequence modelling tasks.
Check this for another comparative study.