Using Continuous Lexical Embeddings To Improve Symbolic-Prosody Prediction In A Text-To-Speech Front-End

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)(2016)

引用 35|浏览65
暂无评分
摘要
The prediction of symbolic prosodic categories from text is an important, but challenging, natural-language processing task given the various ways in which an input can be realized, and the fact that knowledge about what features determine this realization is incomplete or inaccessible to the model. In this work, we look at augmenting baseline features with lexical representations that are derived from text, providing continuous embeddings of the lexicon in a lower-dimensional space. Although learned in an unsupervised fashion, such features capture semantic and syntactic properties that make them amenable for prosody prediction. We deploy various embedding models on prominence-and phrase-break prediction tasks, showing substantial gains, particularly for prominence prediction.
更多
查看译文
关键词
word embeddings,prominence prediction,prosodic phrasing,speech synthesis,deep learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要