首页> 外文会议>International Conference on Human Language Technologies - The Baltic Perspective >Using Privacy-Transformed Speech in the Automatic Speech Recognition Acoustic Model Training
【24h】

Using Privacy-Transformed Speech in the Automatic Speech Recognition Acoustic Model Training

机译:在自动语音识别声学模型培训中使用隐私转换的演讲

获取原文

摘要

Automatic Speech Recognition (ASR) requires huge amounts of real user speech data to reach state-of-the-art performance. However, speech data conveys sensitive speaker attributes like identity that can be inferred and exploited for malicious purposes. Therefore, there is an interest in the collection of anonymized speech data that is processed by some voice conversion method. In this paper, we evaluate one of the voice conversion methods on Latvian speech data and also investigate if privacy-transformed data can be used to improve ASR acoustic models. Results show the effectiveness of voice conversion against state-of-the-art speaker verification models on Latvian speech and the effectiveness of using privacy-transformed data in ASR training.
机译:自动语音识别(ASR)需要大量的真实用户语音数据来达到最先进的性能。 但是,语音数据会传染敏感的扬声器属性,如可以被推断和剥削恶意目的的身份。 因此,对由某种语音转换方法处理的匿名语音数据的集合有兴趣。 在本文中,我们评估了拉脱维亚语音数据的语音转换方法之一,并调查了隐私转换的数据是否可用于改善ASR声学模型。 结果表明,语音转换对拉脱维亚语音的最先进的扬声器验证模型的有效性以及在ASR培训中使用隐私转换数据的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号