A Novel Lip Descriptor for Audio-Visual Keyword Spotting Based on Adaptive Decision Fusion

IEEE Transactions on Multimedia(2016)

引用 50|浏览79
暂无评分
摘要
Keyword spotting remains a challenge when applied to real-world environments with dramatically changing noise. In recent studies, audio-visual integration methods have demonstrated superiorities since visual speech is not influenced by acoustic noise. However, for visual speech recognition, individual utterance mannerisms can lead to confusion and false recognition. To solve this problem, a novel ...
更多
查看译文
关键词
Visualization,Speech recognition,Speech,Feature extraction,Shape,Acoustics,Spatiotemporal phenomena
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要