RATM: Recurrent Attentive Tracking Model

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)(2016)

引用 52|浏览171
暂无评分
摘要
We present an attention-based modular neural framework for computer vision. The framework uses a soft attention mechanism allowing models to be trained with gradient descent. It consists of three modules: a recurrent attention module controlling where to look in an image or video frame, a feature-extraction module providing a representation of what is seen, and an objective module formalizing why the model learns its attentive behavior. The attention module allows the model to focus computation on task-related information in the input. We apply the framework to several object tracking tasks and explore various design choices. We experiment with three data sets, bouncing ball, moving digits and the real-world KTH data set. The proposed Recurrent Attentive Tracking Model performs well on all three tasks and can generalize to related but previously unseen sequences from a challenging tracking data set.
更多
查看译文
关键词
recurrent attentive tracking model,RATM,attention-based modular neural framework,computer vision,soft attention mechanism,gradient descent,video frame,objective module,task-related information,bouncing ball,moving digits,KTH data set
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要