SRP-PHAT methods of locating simultaneous multiple talkers using a frame of microphone array data

ICASSP(2010)

引用 53|浏览8
暂无评分
摘要
Two new methods for locating multiple sound sources using a single segment of data from a large-aperture microphone array are presented. Both methods employ the proven-robust steered response power using the phase transform (SRP-PHAT) as a functional. To cluster the data points into highly probable regions containing global peaks, the first method fits a Gaussian mixture model (GMM), whereas the second one sequentially finds the points with highest SRP-PHAT values that most likely represent different clusters. Then the low-cost global optimization method, stochastic region contraction (SRC), is applied to each cluster to find the global peaks. We test the two methods using real data from five simultaneous talkers in a room with high noise and reverberation. Results are presented and discussed.
更多
查看译文
关键词
optimisation,srp-phat method,steered response power,gmm,large-aperture microphone array,src,microphones,global optimization,stochastic region contraction,phase transform,acoustic arrays,acoustic signal processing,acoustic radiators,gaussian processes,microphone arrays,acoustic position measurement,gaussian mixture model,clustering algorithms,data engineering,speech,acoustic noise,speech processing,acoustical engineering,signal to noise ratio,reverberation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要