Optimizing Speaker Embeddings using Meta-Training Sets

APSIPA(2020)

引用 0|浏览2
暂无评分
摘要
This paper presents a method to learn speaker embeddings for text-independent speaker verification. The proposed method aims to optimize embeddings for unseen enrollment/test speakers by training a network with a meta-training set. The main procedure consists of two steps. The first step generates a meta-training set, a set of episodes each with a pair of intraepisode training and testing sets. The second step optimizes network parameters so that the average verification performance over the generated episodes is maximized. An advantage of our approach lies in its complementarity to studies focusing on network structure and we demonstrate its effectiveness with recent ResNet-based models in experiments on the VoxCeleb dataset.
更多
查看译文
关键词
Text-Independent Speaker Verification,Speaker Embedding,Neural Networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要