Automatic keyword selection for keyword search development and tuning

ICASSP(2014)

引用 16|浏览66
暂无评分
摘要
In this paper, we investigate the problem of automatically selecting textual keywords for keyword search development and tuning on audio data for any language. Briefly, the method samples candidate keywords in the training data while trying to match a set of target marginal distributions for keyword features such as keyword frequency in the training or development audio, keyword length, frequency of out-of-vocabulary words, and TF-IDF scores. The method is evaluated on four IARPA Babel program base period languages. We show the use of the automatically selected keywords for the keyword search system development and tuning. We show also that search performance is improved by tuning the decision threshold on the automatically selected keywords.
更多
查看译文
关键词
keyword search,speech processing,keyword search turning,automatic textual keyword selection,tf-idf scores,speech recognition,vocabulary,target marginal distributions,spoken term detection,iarpa babel program base period languages,keyword length,audio data,development audio,natural language processing,audio signal processing,query selection,keyword search development,keyword frequency,keyword features,keyword selection,out-of-vocabulary word frequency,query processing,training data,training audio,acoustics,tuning,nist,speech
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要