14.4 A scalable speech recognizer with deep-neural-network acoustic models and voice-activated power gating.

ISSCC(2017)

引用 88|浏览72
暂无评分
摘要
The applications of speech interfaces, commonly used for search and personal assistants, are diversifying to include wearables, appliances, and robots. Hardware-accelerated automatic speech recognition (ASR) is needed for scenarios that are constrained by power, system complexity, or latency. Furthermore, a wakeup mechanism, such as voice activity detection (VAD), is needed to power gate the ASR and downstream system. This paper describes IC designs for ASR and VAD that improve on the accuracy, programmability, and scalability of previous work.
更多
查看译文
关键词
scalable speech recognizer,deep neural network acoustic model,voice-activated power gating,speech interface application,hardware-accelerated automatic speech recognition,ASR,system complexity,latency,wakeup mechanism,downstream system,IC design,voice activity detection,VAD
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要