On encoding temporal evolution for real-time action prediction

arXiv: Computer Vision and Pattern Recognition(2018)

引用 23|浏览30
暂无评分
摘要
Anticipating future actions is a key component ofintelligence, specifically when it applies to realtimesystems, such as robots or autonomous cars.While recent works have addressed prediction ofraw RGB pixel values, we focus on anticipating themotion evolution in future video frames. To thisend, we construct dynamic images (DIs) by summarisingmoving pixels through a sequence of futureframes. We train a convolutional LSTMs topredict the next DIs based on an unsupervisedlearning process, and then recognise the activity associatedwith the predicted DI. We demonstrate theeffectiveness of our approach on 3 benchmark actiondatasets showing that despite running on videoswith complex activities, our approach is able toanticipate the next human action with high accuracyand obtain better results than the state-of-the-artmethods.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要