Action Recognition In The Presence Of One Egocentric And Multiple Static Cameras
COMPUTER VISION - ACCV 2014, PT V(2014)
摘要
In this paper, we study the problem of recognizing human actions in the presence of a single egocentric camera and multiple static cameras. Some actions are better presented in static cameras, where the whole body of an actor and the context of actions are visible. Some other actions are better recognized in egocentric cameras, where subtle movements of hands and complex object interactions are visible. In this paper, we introduce a model that can benefit from the best of both worlds by learning to predict the importance of each camera in recognizing actions in each frame. By joint discriminative learning of latent camera importance variables and action classifiers, our model achieves successful results in the challenging CMU-MMAC dataset. Our experimental results show significant gain in learning to use the cameras according to their predicted importance. The learned latent variables provide a level of understanding of a scene that enables automatic cinematography by smoothly switching between cameras in order to maximize the amount of relevant information in each frame.
更多查看译文
关键词
Action Recognition, Near Neighbor, Multiple Camera, Static Camera, Visual Hull
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络