首页> 外文会议>INTERSPEECH 2012 >A Two Stage Mask Estimation Approach to Robust Speaker Verification
【24h】

A Two Stage Mask Estimation Approach to Robust Speaker Verification

机译:一种强大的扬声器验证的两个阶段掩模估计方法

获取原文

摘要

We propose a two-stage mask estimation approach to robust speaker verification (SV) in noise environments. We consider a practical semi-blind SV scenario: the location of the target speaker is fixed while the locations of al1 interferers are unknown. In the first stage, we use a dual-microphone and a semi-blind degenerate unmixing estimation technique (DUET) to estimate an initial binary mask. In the second stage, we refine the mask based on the time and frequency histograms of the initial mask. As a result, only highly reliable time-frequency components in the spectro-temporal features are kept for downstream verification. Experiments show that the proposed approach is superior to a baseline MFCC approach and a recent local SNR based mask estimation approach.
机译:我们提出了一种在噪声环境中强大的扬声器验证(SV)的两级掩模估计方法。我们考虑一个实用的半盲SV场景:目标扬声器的位置是固定的,而Al1干扰源的位置未知。在第一阶段,我们使用双麦克风和半盲解析解密估计技术(Duet)来估计初始二进制掩模。在第二阶段,我们基于初始掩模的时间和频率直方图来优化掩模。结果,仅频谱 - 时间特征中的高度可靠的时频分量被保存用于下游验证。实验表明,该方法优于基线MFCC方法和最近的本地SNR基础掩模估计方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号