Lattice Rescoring Strategies for Long Short Term Memory Language Models in Speech Recognition

Shankar Kumar,Michael Nirschl, Daniel Holtmann-Rice,Hank Liao,Ananda Theertha Suresh,Felix Yu

2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)（2017）

引用 26|浏览130

暂无评分

摘要

Recurrent neural network (RNN) language models (LMs) and Long Short Term Memory (LSTM) LMs, a variant of RNN LMs, have been shown to outperform traditional N-gram LMs on speech recognition tasks. However, these models are computationally more expensive than N-gram LMs for decoding, and thus, challenging to integrate into speech recognizers. Recent research has proposed the use of lattice-rescoring algorithms using RNNLMs and LSTMLMs as an efficient strategy to integrate these models into a speech recognition system. In this paper, we evaluate existing lattice rescoring algorithms along with new variants on a YouTube speech recognition task. Lattice rescoring using LSTMLMs reduces the word error rate (WER) for this task by 8% relative to the WER obtained using an N-gram LM.

查看译文

关键词

LSTM,language modeling,lattice rescoring,speech recognition

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要