Multi-class Model M

Acoustics, Speech and Signal Processing(2011)

引用 7|浏览75
暂无评分
摘要
Model M, a novel class-based exponential language model, has been shown to significantly outperform word n-gram models in state-of-the-art machine translation and speech recognition systems. The model was motivated by the observation that shrinking the sum of the parameter magnitudes in an exponential language model leads to better performance on unseen data. Being a class-based language model, Model M makes use of word classes that are found automatically from training data. In this paper, we extend Model M to allow for different clusterings to be used at different word positions. This is motivated by the fact that words play different roles depending on their position in an n-gram. Experiments on standard NIST and GALE Arabic-to-English development and test sets show improvements in machine translation quality as measured by automatic evaluation metrics.
更多
查看译文
关键词
language translation,speech recognition,GALE Arabic-to-English development,NIST,class-based exponential language model,machine translation,multiclass model M,n-gram model,speech recognition system,Language Modeling,Machine Translation,Maximum-Entropy Models
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要