A Spoken Dialog Corpus for Car Telematics Services

msra(2005)

引用 7|浏览35
暂无评分
摘要
Spoken corpora provide a critical resource for research, development and evaluation of spoken dialog systems. This chapter describes the spoken dialog corpus used in the design of CAMMIA (Conversational Agent for Multimedia Mobile Information Access), which employs a novel dialog management system that allows users to switch dialog tasks in a flexible manner. The corpus for car telematics services was collected from 137 male and 113 female speakers. The age distribution of speakers is balanced in the five age brackets of 20’s, 30’s, 40’s, 50’s, and 60’s. Analysis of the gathered dialogs reveals that the average number of dialog tasks per speaker was 8.1. The three most frequentlyrequested types of information in the corpus were traffic information, tourist attraction information, and restaurant information. Analysis of speaker utterances shows that the implied vocabulary size is approximately 5,000 words. The results are used for development and evaluation of automatic speech recognition (ASR) and dialog management software.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要