【24h】

Robust feature front-end for speaker identification

机译:强大的前端功能可识别说话人

获取原文
获取原文并翻译 | 示例

摘要

One important challenge for speaker identification (SID) system is sustained performance in diverse conditions. This study presents a novel front-end feature extraction method for SID in clean, noisy, and channel-mismatched acoustic conditions. To address the problem, the perceptual minimum variance distortionless response (PMVDR) feature is employed. While PMVDR has been successfully used for noisy ASR, it has not been considered for SID. We also incorporate longer temporal speaker knowledge based on the shifted delta cepstral (SDC) algorithm. The evaluation over YOHO and another new diversified Robust Open-Set Speaker Identification (ROSSI) database show that both PMVDR and the union with SDC can improve performance significantly. Compared with traditional feature extraction, PMVDR and PMVDR-SDC always give improvement across diverse adverse conditions. Also, PMVDR-SDC can contribute additional improvement in the presence of noise and channel mismatch.
机译:说话人识别(SID)系统的一项重要挑战是在各种条件下的持续性能。这项研究提出了一种新的前端特征提取方法,用于在干净,嘈杂和通道不匹配的声学条件下进行SID。为了解决该问题,采用了感知最小方差无失真响应(PMVDR)功能。虽然PMVDR已成功用于嘈杂的ASR,但尚未将其用于SID。我们还结合了基于位移三角倒谱(SDC)算法的较长的时间说话者知识。通过YOHO和另一个新的多样化的健壮开放式说话人识别(ROSSI)数据库进行的评估表明,PMVDR和与SDC的结合都可以显着提高性能。与传统特征提取相比,PMVDR和PMVDR-SDC始终可以在各种不利条件下进行改进。此外,在存在噪声和通道失配的情况下,PMVDR-SDC可以做出其他改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号