A Comparison of Speech-to-Speech Neural Network Methodologies for Digit Pronunciation

springer

引用 0|浏览0
暂无评分
摘要
In this work, the classical problem of digit recognition and pronunciation from an audio source in Spanish is revisited and compared with that of directly teaching a deep neural network to pronounce the corresponding digit. While the first approach roughly corresponds to that of most current speech processing methodologies that intend to identify and reconstruct phonetic units, before performing any task from reproducing to translation, the second approach is rarely found in the literature despite the fact that it is clearly more biologically inspired than the first one. Advantages and disadvantages of both methodologies are discussed based on the obtained results.
更多
查看译文
关键词
Speech processing, Neural networks, Pattern recognition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要