Learning from Video and Text via Large-Scale Discriminative Clustering

2017 IEEE International Conference on Computer Vision (ICCV)(2017)

引用 50|浏览165
暂无评分
摘要
Discriminative clustering has been successfully applied to a number of weakly-supervised learning tasks. Such applications include person and action recognition, text-to-video alignment, object co-segmentation and colocalization in videos and images. One drawback of discriminative clustering, however, is its limited scalability. We address this issue and propose an online optimization algorithm based on the Block-Coordinate Frank-Wolfe algorithm. We apply the proposed method to the problem of weakly supervised learning of actions and actors from movies together with corresponding movie scripts. The scaling up of the learning problem to 66 feature length movies enables us to significantly improve weakly supervised action recognition.
更多
查看译文
关键词
online optimization algorithm,Block-Coordinate Frank-Wolfe algorithm,weakly supervised action recognition,large-scale discriminative clustering,weakly supervised learning tasks,text-to-video alignment
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要