Challenge Huawei challenge: Fusing multimodal features with deep neural networks for Mobile Video Annotation

ICME Workshops(2014)

引用 13|浏览53
暂无评分
摘要
We participated in the Huawei Accurate and Fast Mobile Video Annotation Challenge (MoVAC) at IEEE ICME 2014. Three result runs were submitted by combining different features and classification techniques, with emphasis on both accuracy and efficiency. In this paper, we briefly summarize the techniques used in our system, and the components used for generating each of the three submitted results. One novel component in our system is a specially tailored deep neural network (DNN) that can explore the relationships of multiple features for improved annotation performance, which is very efficient based on an implementation with the GPU. Only 18.8 seconds were needed by one of our DNN-based submissions to process a test video. By combining the DNN with the traditional SVM learning, we achieved the best accuracy across all the worldwide submissions to this challenge.
更多
查看译文
关键词
video annotation,deep neural network,video signal processing,image fusion,svm learning,learning (artificial intelligence),huawei challenge,video classification,mobile video annotation challenge,movac,image classification,multimodal feature fusion,dnn,ieee icme 2014,mobile computing,support vector machines,neural nets,multimodal features,feature extraction,vectors,mel frequency cepstral coefficient,learning artificial intelligence,accuracy,visualization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要