A Novel Lip Descriptor for Audio-Visual Keyword Spotting Based on Adaptive Decision Fusion
IEEE Transactions on Multimedia(2016)
摘要
Keyword spotting remains a challenge when applied to real-world environments with dramatically changing noise. In recent studies, audio-visual integration methods have demonstrated superiorities since visual speech is not influenced by acoustic noise. However, for visual speech recognition, individual utterance mannerisms can lead to confusion and false recognition. To solve this problem, a novel ...
更多查看译文
关键词
Visualization,Speech recognition,Speech,Feature extraction,Shape,Acoustics,Spatiotemporal phenomena
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络