Syllable based keyword search: Transducing syllable lattices to word lattices

SLT(2014)

引用 8|浏览17
暂无评分
摘要
This paper presents a weighted finite state transducer (WFST) based syllable decoding and transduction framework for keyword search (KWS). Acoustic context dependent phone models are trained from word forced alignments. Then syllable decoding is done with lattices generated using a syllable lexicon and language model (LM). To process out-of-vocabulary (OOV) keywords, pronunciations are produced using a grapheme-to-syllable (G2S) system. A syllable to word lexical transducer containing both in-vocabulary (IV) and OOV keywords is then constructed and composed with a keyword-boosted LM transducer. The composed transducer is then used to transduce syllable lattices to word lattices for final KWS. We show that our method can effectively perform KWS on both IV and OOV keywords, and yields up to 0.03 Actual Term-Weighted Value (ATWV) improvement over searching keywords directly in subword lattices. Word Error Rates (WER) and KWS results are reported for three different languages.
更多
查看译文
关键词
keyword-boosted lm transducer,atwv,finite state machines,syllable lattices-word lattices transduction,weighted finite state transducer based syllable decoding,syllable lexicon,keyword search,actual term-weighted value,in-vocabulary keywords,oov keywords,language model,lm,information retrieval,out-of-vocabulary keywords,vocabulary,syllable decoding,word error rates,grapheme-to-syllable system,acoustic signal processing,word forced alignments,wfst,speech coding,g2s system,iv keywords,lattice transduction,natural language processing,kws,syllable based keyword search,speech recognition,syllable-word lexical transducer,acoustic context dependent phone models
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要