TED-CS: Textual Enhanced Sensitive Video Detection with Common Sense Knowledge.

Bihui Yu, Linzhuang Sun,Jingxuan Wei, Shuyue Tan, Yiman Zhao,Liping Bu

ADMA (2)(2023)

引用 0|浏览0
暂无评分
摘要
In the era of short videos, the task of sensitive video detection faces new challenges with the increasing diversity and quantity of videos in the network. Aiming at the problem that existing research methods are constrained by missing video comments and user subjectivity, a novel text enhancement method is proposed for sensitive video detection. Firstly, based on the CLIP pre-training model, image caption are generated for video frames. Then, through the integration of external common sense knowledge, the method extracts deep contextual information from the generated captions, including the underlying intentions and purposes conveyed by the text. Besides, considering the complementarity and redundancy between different sources of information, a multi-source data collaborative encoding mechanism and a multi-modal feature fusion mechanism are designed to achieve semantic feature alignment. Finally, state-of-the-art was achieved on the two public datasets NPDI-800 and Pornography-2k, and a large number of detailed comparison and ablation experiments were performed to verify the effectiveness of the method.
更多
查看译文
关键词
common sense knowledge,detection,video
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要