首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2011 >LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization
【24h】

LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization

机译:LP残留功能可实现健壮,对隐私敏感的说话人区分

获取原文

摘要

We present a comprehensive study of linear prediction residual for speaker diarization on single and multiple distant microphone conditions in privacy-sensitive settings, a requirement to analyze a wide range of spontaneous conversations. Two representations of the residual are compared, namely real-cepstrum and MFCC, with the latter performing better. Experiments on RT06eval show that residual with subband information from 2.5 kHz to 3.5 kHz and spectral slope yields a performance close to traditional MFCC features. As a way to objectively evaluate privacy in terms of linguistic information, we perform phoneme recognition. Residual features yield low phoneme accuracies compared to traditional MFCC features.
机译:我们提出了一个线性预测残差的综合研究,用于在隐私敏感的环境中对单个和多个远距离麦克风条件下的说话人进行说话人区分,这是分析各种自发性对话的要求。比较了残差的两种表示形式,即实倒谱和MFCC,后者表现更好。 RT06eval上的实验表明,具有2.5 kHz至3.5 kHz的子带信息以及频谱斜率的残差产生的性能接近传统MFCC功能。作为客观评估语言信息隐私权的一种方法,我们执行音素识别。与传统的MFCC功能相比,残留功能产生的音素准确性较低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号