Shared resources for robust speech-to-text technology

INTERSPEECH(2003)

引用 31|浏览24
暂无评分
摘要
Abstract This paper ,describes ,ongoing ,efforts at Linguistic ,Data Consortium to create shared resources for improved,speech-totext technology. Under the DARPA EARS program, technology providers are charged with creating STT systems whose outputs are substantially richer and much,more accurate than is currently ,possible. These aggressive program ,goals motivate new,approaches,to corpus,creation and distribution. EARS participants ,require ,multilingual ,broadcast ,and telephone speech data, transcripts and annotations at a much higher volume,than for any previous program. While standard approaches,to resource ,collection ,and ,creation ,are prohibitively expensive for this volume of material, within EARS new ,methods ,have been established to allow ,for the development of vast quantities of audio, transcripts and annotations. New ,distribution methods ,also provide ,for efficient deployment ,of needed ,resources ,to participating research sites as well ,as enabling ,eventual publication to a wider community,of language researchers.
更多
查看译文
关键词
speech to text
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要