Does My Speech Rock? Automatic Assessment Of Public Speaking Skills

Lucas Azaïs,Adrien Payan, Tianjiao Sun, Guillaume Vidal, Tina Zhang,Eduardo Coutinho,Florian Eyben,Björn W. Schuller

16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5(2015)

引用 25|浏览55
暂无评分
摘要
In this paper, we introduce results for the task of Automatic Public Speech Assessment (APSA). Given the comparably sparse work carried out on this task up to this point, a novel database was required for training and evaluation of machine learning models. As a basis, the freely available oral presentations of the ICASSP conference in 2011 were selected due to their transcription including non-verbal vocalisations. The data was specifically labelled in terms of the perceived oratory ability of the speakers by five raters according to a 5 -point Public Speaking Skill Rating Likert scale. We investigate the feasibility of speaker -independent APSA using different standardised acoustic feature sets computed per fixed chunk of an oral presentation in a series of ternary classification and continuous regression experiments. Further, we compare the relevance of different feature groups related to fluency (speech/hesitation rate), prosody, voice quality and a variety of spectral features. Our results demonstrate that oratory speaking skills can be reliably assessed using supra segmental audio features, with prosodic ones being particularly suited.
更多
查看译文
关键词
Automatic Public Speech Assessment, database, classification, regression, prosody
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要