首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Nearest neighbor based i-vector normalization for robust speaker recognition under unseen channel conditions
【24h】

Nearest neighbor based i-vector normalization for robust speaker recognition under unseen channel conditions

机译:基于最近邻的i-vector归一化,可在看不见的信道条件下实现可靠​​的说话人识别

获取原文

摘要

Many state-of-the-art speaker recognition engines use i-vectors to represent variable-length acoustic signals in a fixed low-dimensional total variability subspace. While such systems perform well under seen channel conditions, their performance greatly degrades under unseen channel scenarios. Accordingly, rapid adaptation of i-vector systems to unseen conditions has recently attracted significant research effort from the community. To mitigate this mismatch, in this paper we propose nearest neighbor based i-vector mean normalization (NN-IMN) and i-vector smoothing (IS) for unsupervised adaptation to unseen channel conditions within a state-of-the-art i-vector/PLDA speaker verification framework. A major advantage of the approach is its ability to handle multiple unseen channels without explicit retraining or clustering. Our observations on the DARPA Robust Automatic Transcription of Speech (RATS) speaker recognition task suggest that part of the distortion caused by an unseen channel may be modeled as an offset in the i-vector space. Hence, the proposed nearest neighbor based normalization technique is formulated to compensate for such a shift. Experimental results with the NN based normalized i-vectors indicate that, on average, we can recover 46% of the total performance degradation due to unseen channel conditions.
机译:许多最先进的说话人识别引擎都使用i矢量来表示固定的低维总可变性子空间中的可变长度声音信号。尽管此类系统在可见的信道条件下表现良好,但在看不见的信道情况下其性能会大大降低。因此,i-矢量系统快速适应看不见的状况最近吸引了来自社区的大量研究工作。为了缓解这种不匹配,在本文中,我们提出了基于最近邻的i向量均值归一化(NN-IMN)和i向量平滑化(IS),以在最新i向量中无监督地适应看不见的信道条件/ PLDA说话者验证框架。该方法的主要优点是它能够处理多个看不见的频道,而无需进行明确的重新训练或聚类。我们对DARPA语音自动转录(RATS)说话人识别任务的观察表明,由看不见的声道引起的部分失真可以建模为i向量空间中的偏移量。因此,所提出的基于最近邻的归一化技术被公式化以补偿这种偏移。基于NN的归一化i向量的实验结果表明,平均而言,由于看不见的信道条件,我们可以恢复总性能下降的46%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号