Tuning methodology for speech enhancement algorithms using a simulated conversational database and perceptual objective measures

Hands-free Speech Communication and Microphone Arrays(2014)

引用 7|浏览3
暂无评分
摘要
In this paper, we propose a formal methodology for tuning the parameters of a single-microphone speech enhancement system for hands-free devices. The tuning problem is formulated as a large-scale nonlinear programming problem that is solved by a genetic algorithm to determine the global solution. A conversational speech database is automatically generated by modeling the interactivity in telephone conversations, and perceptual objective quality measures are used as the optimization criteria for the automated tuning over the generated database. A subjective listening test is then performed by comparing the automatically tuned system based on objective criteria to the system tuned by expert human listeners. Subjective and objective evaluation result shows that the proposed automated tuning methodology greatly improves the enhanced speech quality, potentially saving resources over manual evaluation, speeding up development and deployment time, and guiding the algorithmic design.
更多
查看译文
关键词
audio databases,echo suppression,genetic algorithms,microphones,nonlinear programming,speech enhancement,enhanced speech quality,formal methodology,genetic algorithm,hands-free devices,large-scale nonlinear programming problem,perceptual objective measures,simulated conversational database,single-microphone speech enhancement system,speech database,speech enhancement algorithms,subjective listening test,telephone conversations,tuning methodology,tuning problem,acoustic echo cancellation,conversation analysis,perceptual objective quality,acoustics,databases,speech,tuning,noise
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要