Using Privacy-Transformed Speech in the Automatic Speech Recognition Acoustic Model Training

机译：在自动语音识别声学模型培训中使用隐私转换的演讲

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic Speech Recognition (ASR) requires huge amounts of real user speech data to reach state-of-the-art performance. However, speech data conveys sensitive speaker attributes like identity that can be inferred and exploited for malicious purposes. Therefore, there is an interest in the collection of anonymized speech data that is processed by some voice conversion method. In this paper, we evaluate one of the voice conversion methods on Latvian speech data and also investigate if privacy-transformed data can be used to improve ASR acoustic models. Results show the effectiveness of voice conversion against state-of-the-art speaker verification models on Latvian speech and the effectiveness of using privacy-transformed data in ASR training.

机译：自动语音识别（ASR）需要大量的真实用户语音数据来达到最先进的性能。但是，语音数据会传染敏感的扬声器属性，如可以被推断和剥削恶意目的的身份。因此，对由某种语音转换方法处理的匿名语音数据的集合有兴趣。在本文中，我们评估了拉脱维亚语音数据的语音转换方法之一，并调查了隐私转换的数据是否可用于改善ASR声学模型。结果表明，语音转换对拉脱维亚语音的最先进的扬声器验证模型的有效性以及在ASR培训中使用隐私转换数据的有效性。

著录项

来源
《International Conference on Human Language Technologies - The Baltic Perspective》|2020年|263p|共8页
会议地点
作者
Askars SALIMBAJEVS;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP312-53;
关键词
Automatic speech recognition; Voice conversion; Privacy; Anonymization; Evaluation; Automatic speaker verification;

机译：自动语音识别;语音转换;隐私;匿名化;评估;自动扬声器验证;

相似文献

外文文献
中文文献
专利

1. Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition: A comparison of current training strategies [J] . Cui Xiaodong, Zhang Wei, Finkler Ulrich, IEEE Signal Processing Magazine . 2020,第3期

机译：自动语音识别深神经网络声学模型的分布式训练：当前训练策略的比较
2. Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition [J] . Michael L. Seltzer, Alex Acero IEEE transactions on audio, speech and language processing . 2007,第1期

机译：使用混合带宽训练数据训练语音识别的宽带声学模型
3. Acoustic landmarks contain more information about the phone string than other frames for automatic speech recognition with deep neural network acoustic model [J] . He Di, Lim Boon Pang, Yang Xuesong, The Journal of the Acoustical Society of America . 2018,第6aPta1期

机译：声学地标包含与具有深度神经网络声学模型的自动语音识别的其他帧的更多信息
4. Using Privacy-Transformed Speech in the Automatic Speech Recognition Acoustic Model Training [C] . Askars SALIMBAJEVS International Conference on Human Language Technologies - The Baltic Perspective . 2020

机译：在自动语音识别声学模型培训中使用隐私转换的演讲
5. Graph-based Semi-Supervised Learning in Acoustic Modeling for Automatic Speech Recognition. [D] . Liu, Yuzong. 2016

机译：用于自动语音识别的声学建模中基于图的半监督学习。
6. Brain-inspired speech segmentation for automatic speech recognition using the speech envelope as a temporal reference [O] . Byeongwook Lee, Kwang-Hyun Cho -1

机译：以语音包络作为时间参考的自动语音识别的大脑启发式语音分割
7. Using Privacy-Transformed Speech in the Automatic Speech Recognition Acoustic Model Training [O] . Askars Salimbajevs 2020

机译：在自动语音识别声学模型培训中使用隐私转换的演讲

Using Privacy-Transformed Speech in the Automatic Speech Recognition Acoustic Model Training

摘要

著录项

相似文献

相关主题

期刊订阅