Spoken content retrieval and understanding using deep learning

Impact(2021)

引用 0|浏览11
暂无评分
摘要
Spoken content refers to all content over the Internet which includes human voice, essentially those in multimedia, such As YouTube videos and online courses. Today such content is retrieved via Google primarily based on human-generated text labels, because Google can only retrieve text over the Internet. The goal of this project is to produce technologies to retrieve accurately and efficiently such spoken content directly based on the included audio sounds instead of text labels, because machines today can listen to human voice just as they can read the text. The long term goal is to create a spoken version of Google, which may revolutionize the ways in which humans access information and improve their knowledge. Professor Lin-shan Lee at National Taiwan University is leading this project. He has been a distinguished leader in the global scientific community for the area of teaching machines to speak and listen to human voice for many years.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要