Self-attention eidetic 3D-LSTM: Video prediction models for traffic flow forecasting

Neurocomputing(2022)

引用 7|浏览9
暂无评分
摘要
Video prediction is extremely challenging in a traffic flow forecasting problem due to dynamic spatiotemporal dependence. Eidetic 3D convolutional long short-term memory (E3D-LSTM) network, a state-of-the-art video prediction model, proposes a gate-controlled self-attention module called recall gate in the LSTM mechanism to make the present memory state interact with its historical records for long-term relations. Instead of using the gate-controlled self-attention mechanism, we introduce a query-key-value self-attention mechanism into E3D-LSTM for long-term relations from the perspectives of algorithm (internal) and network architecture (external). As for the algorithm (internal) perspective, we replace the original recall gate inside the E3D-LSTM cell with a query-key-value self-attention (SA) module. While for the network architecture (external) perspective, we propose an independent residual query-key-value self-attention (RSA) block outside E3D-LSTM networks in conjunction with the original recall gate. In a traffic flow forecasting problem, we find that both the models from the internal perspective and the external one, named SAE3D-LSTM and RSA-E3D-LSTM, outperform E3D-LSTM and seven other baseline models on three traffic datasets. This validates the effectiveness of the query-key-value self-attention mechanism for long-term relations. Furthermore, we find that SAE3D-LSTM performs better than RSA-E3D-LSTM. This indicates that the query-key-value self-attention mechanism alone can capture long-term relations, dispensing with the gate-controlled self-attention mechanism.
更多
查看译文
关键词
Video prediction,Traffic flow forecasting,Deep learning,Self-attention mechanism,Long short-term memory (LSTM) network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要