...
首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Wordless Sounds: Robust Speaker Diarization Using Privacy-Preserving Audio Representations
【24h】

Wordless Sounds: Robust Speaker Diarization Using Privacy-Preserving Audio Representations

机译:无言的声音:使用保护隐私的音频表示实现鲁棒的扬声器分离

获取原文
获取原文并翻译 | 示例
           

摘要

This paper investigates robust privacy-sensitive audio features for speaker diarization in multiparty conversations: i.e., a set of audio features having low linguistic information for speaker diarization in a single and multiple distant microphone scenarios. We systematically investigate Linear Prediction (LP) residual. Issues such as prediction order and choice of representation of LP residual are studied. Additionally, we explore the combination of LP residual with subband information from 2.5 kHz to 3.5 kHz and spectral slope. Next, we propose a supervised framework using deep neural architecture for deriving privacy-sensitive audio features. We benchmark these approaches against the traditional Mel Frequency Cepstral Coefficients (MFCC) features for speaker diarization in both the microphone scenarios. Experiments on the RT07 evaluation dataset show that the proposed approaches yield diarization performance close to the MFCC features on the single distant microphone dataset. To objectively evaluate the notion of privacy in terms of linguistic information, we perform human and automatic speech recognition tests, showing that the proposed approaches to privacy-sensitive audio features yield much lower recognition accuracies compared to MFCC features.
机译:本文研究了健壮的隐私敏感音频功能,用于多方对话中的说话人差异化:即,一组音频功能具有低语言信息,用于在单个和多个远距离麦克风场景中进行说话人差异化。我们系统地研究线性预测(LP)残差。研究了预测顺序和LP残差表示的选择等问题。此外,我们探索了LP残差与2.5 kHz至3.5 kHz的子带信息以及频谱斜率的组合。接下来,我们提出一种使用深度神经网络架构的受监督框架,以导出对隐私敏感的音频功能。我们将这两种方法与传统的“梅尔频率倒谱系数”(MFCC)功能进行了基准测试,以在两种麦克风场景中实现扬声器的二值化。在RT07评估数据集上进行的实验表明,所提出的方法产生的分离性能接近于单个远距离麦克风数据集上的MFCC特征。为了客观地评估语言信息方面的隐私概念,我们执行了人工和自动语音识别测试,表明与MFCC功能相比,针对隐私敏感的音频功能的拟议方法产生的识别精度要低得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号