首页> 外文期刊>International Journal of Pattern Recognition and Artificial Intelligence >SPARSE-BASED AUDITORY MODEL FOR ROBUST SPEAKER RECOGNITION
【24h】

SPARSE-BASED AUDITORY MODEL FOR ROBUST SPEAKER RECOGNITION

机译:基于稀疏的健壮说话人识别音频模型

获取原文
获取原文并翻译 | 示例

摘要

The mismatch between the training and the testing environments greatly degrades the performance of speaker recognition. Although many robust techniques have been proposed, speaker recognition in mismatch condition is still a challenge. To solve this problem, we propose a sparse-based auditory model as the front-end of speaker recognition by simulating auditory processing of speech signal. To this end, we introduce narrow-band filter-bank instead of the widely used wide-band filter-bank to simulate the basilar membrane filter-bank, use sparse representation as the approximation of basilar membrane coding strategy, and incorporate the frequency selectivity enhance mechanism between tectorial membrane and basilar membrane by practical engineering approximation. Compared with the standard Mel-frequency cepstral coefficient approach, our preliminary experimental results indicate that the sparse-based auditory model consistently improve the robustness of speaker recognition in mismatched condition.
机译:培训和测试环境之间的不匹配会大大降低说话者识别的性能。尽管已经提出了许多鲁棒的技术,但是失配条件下的说话人识别仍然是一个挑战。为了解决这个问题,我们通过模拟语音信号的听觉处理,提出了一种基于稀疏的听觉模型作为说话人识别的前端。为此,我们引入了窄带滤波器组,而不是广泛使用的宽带滤波器组来模拟基底膜滤波器组,使用稀疏表示作为基底膜编码策略的近似,并结合了频率选择性增强实际工程上的近似,说明了被膜与基底膜之间的相互作用机理。与标准的梅尔频率倒谱系数方法相比,我们的初步实验结果表明,基于稀疏的听觉模型在不匹配的情况下能够持续提高说话人识别的鲁棒性。

著录项

  • 来源
  • 作者单位

    School of Computer Science and Technology Harbin Institute of Technology, 92 West Dazhi Street Nan Gang District, Harbin, 150001, P. R. China;

    School of Computer Science and Technology Harbin Institute of Technology, 92 West Dazhi Street Nan Gang District, Harbin, 150001, P. R. China;

    School of Computer Science and Technology Harbin Institute of Technology, 92 West Dazhi Street Nan Gang District, Harbin, 150001, P. R. China;

    School of Computer Science and Technology Harbin Institute of Technology, 92 West Dazhi Street Nan Gang District, Harbin, 150001, P. R. China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    sparse representation; selectivity gain; robust feature; speaker recognition;

    机译:稀疏表示选择性增益;强大的功能;说话人识别;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号