A Study of Speech Recognition, Speech Translation, and Speech Summarization of TED English Lectures

Kazumasa Yamamoto, Haruhiko Banno, Haruki Sakurai, Toichiro Adachi,Seiichi Nakagawa

2023 IEEE 12th Global Conference on Consumer Electronics (GCCE)(2023)

引用 0|浏览0
暂无评分
摘要
Our research focuses on developing an automatic speech recognition system for English lectures, which involves summarizing the content and providing Japanese subtitles. Subtitling the entire audio of an English lecture could hinder comprehension and readability, so a summarization system is desired. By employing the DNN-HMM based speech recognition system, we achieved an 88% word accuracy for recognizing TED lecture speeches. Speech translation results showed a lower BLEU score of approximately 14% compared to text translation. Conversely, speech summarization proved its robustness to speech recognition errors, as the extracted important sentences were almost the same as those in the text summarization process.
更多
查看译文
关键词
speech recognition,speech translation,speech summarization,lecture speech
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要