RACP: A network with attention corrected prototype for few-shot speaker recognition using indefinite distance metric

Xingmei Wang, Jiaxiang Meng, Bin Wen,Fuzhao Xue

Neurocomputing(2022)

引用 4|浏览14
暂无评分
摘要
Few-shot speaker recognition task is to identify speakers from limited support samples. We argue that query samples and support samples are both informative for classification. To help Prototypical Networks capture information from query samples, this paper proposes the relation-based indefinite distance metric attentive correction prototype network (RACP). Since the mean prototype deviates from the ideal prototype, we calculate attention scores for each query sample to customize the attention prototype. Then, to compensate for the missed query samples information, the prototype is further refined by correction data that is constructed by combining query samples with the global class attention score. Later, the indefinite distance metric of Relation Networks is introduced on Prototypical Networks, and the relation scores between the sample prototypes and the query samples are calculated for final prediction. Compare with existing methods, RACP can consider both query samples and support samples instead of ignoring the query ones. We compare RACP with strong baselines (e.g. GMM-SVM, MAML, Prototypical Networks, Res32, and VGG11). Ablation study and generalizability study of different scenarios are also conducted on different datasets. Results show that RACP achieves better performance and generalization ability.
更多
查看译文
关键词
Few-shot learning,Speaker recognition,Prototypical networks,Indefinite distance metric,Attention
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要