Deep Learning for High-Speed Laryngeal Imaging Analysis

2023 International Conference on Computational Intelligence and Knowledge Economy (ICCIKE)(2023)

引用 0|浏览5
暂无评分
摘要
High-speed imaging of the larynx provides a valuable means for studying vocal folds function and vibratory behaviors. Using laryngeal high-speed videoendoscopy (HSV) with a flexible nasolaryngoscope, one can record the detailed vibratory movements of vocal folds during connected speech. This high-speed imaging tool enables us to study the normal function of the vocal folds and how this function can be disrupted due to the presence of voice disorders. In this work, HSV data were utilized during connected speech from subjects with normophonic voices (no voice disorders) and a neurological voice disorder. The data were obtained using a high-speed camera, coupled with a flexible endoscope, at 4,000 frames per second. Deep learning was used for the analysis of the big HSV dataset to extract the vibratory behaviors of the vocal folds. This deep-learning-based tool achieved high levels of accuracy for analysis of challenging HSV data in connected speech. This tool provides a computationally cost-effective and an accurate measurement approach that could help design more advanced voice assessment protocols in future.
更多
查看译文
关键词
Deep Learning,Image Analysis,Machine Learning,High-Speed Imaging,Voice Disorders
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要