MV-TAL: Mulit-view Temporal Action Localization in Naturalistic Driving

IEEE Conference on Computer Vision and Pattern Recognition(2022)

引用 4|浏览57
暂无评分
摘要
Human risky behavior in driving is an important visual recognition problem. In this paper, we propose a multi-view temporal action localization system based on the grayscale video to achieve action recognition in naturalistic driving. Specifically, we adopted SwinTransformer as feature extractor, and a single framework to detect boundary and class at the same time. Also, we improve multiple loss function for explicit constraints of embedded feature distributions. Our proposed framework achieves the overall F1-score of 0.3154 on A2 dataset.
更多
查看译文
关键词
important visual recognition problem,multiview temporal action localization system,grayscale video,action recognition,naturalistic driving,feature extractor,single framework,embedded feature distributions,MV-TAL,mulit-view temporal action localization,human risky behavior
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要