The rich transcription 2006 Spring Meeting Recognition Evaluation

MACHINE LEARNING FOR MULTIMODAL INTERACTION(2006)

引用 156|浏览0
暂无评分
摘要
We present the design and results of the Spring 2006 (RT-06S) Rich Transcription Meeting Recognition Evaluation; the fourth in a series of community-wide evaluations of language technologies in the meeting domain. For 2006, we supported three evaluation tasks in two meeting sub-domains: the Speech-To-Text (STT) transcription task, and the "Who Spoke When" and "Speech Activity Detection" diarization tasks. The meetings were from the Conference Meeting, and Lecture Meeting sub-domains. The lowest STT word error rate, with up to four simultaneous speakers, in the multiple distant microphone condition was 46.3% for the conference sub-domain, and 53.4% for the lecture sub-domain. For the "Who Spoke When" task, the lowest diarization error rates for all speech were 35.8% and 24.0% for the conference and lecture sub-domains respectively. For the "Speech Activity Detection" task, the lowest diarization error rates were 4.3% and 8.0% for the conference and lecture sub-domains respectively.
更多
查看译文
关键词
diarization task,lowest diarization error rate,spring meeting recognition evaluation,evaluation task,transcription task,first-time experimental proof-of-concept task,stt task,recognition evaluation,rich transcription,rich transcription spring,community-wide evaluation,relative reduction,language technology,speech to text,proof of concept,speech activity detection,word error rate,error rate
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要