首页> 外文期刊>Sadhana: Academy Proceedings in Engineering Science >Studies on inter-speaker variability in speech and its application in automatic speech recognition
【24h】

Studies on inter-speaker variability in speech and its application in automatic speech recognition

机译:语音中说话人之间的变异性及其在自动语音识别中的应用研究

获取原文
获取原文并翻译 | 示例
       

摘要

In this paper, we give an overview of the problem of inter-speaker variability and its study in many diverse areas of speech signal processing. We first give an overview of vowel-normalization studies that minimize variations in the acoustic representation of vowel realizations by different speakers. We then describe the universal-warping approach to speaker normalization which unifies many of the vowel normalization approaches and also shows the relation between speech production, perception and auditory processing. We then address the problem of inter-speaker variability in automatic speech recognition (ASR) and describe techniques that are used to reduce these effects and thereby improve the performance of speaker-independent ASR systems.
机译:在本文中,我们概述了扬声器之间的可变性问题,并研究了语音信号处理的许多不同领域。我们首先给出元音归一化研究的概述,这些研究可最大程度地减少不同说话者的元音实现的声学表示变化。然后,我们描述通用的扭曲说话人归一化方法,该方法统一了许多元音归一化方法,并且还显示了语音产生,感知和听觉处理之间的关系。然后,我们解决了自动语音识别(ASR)中说话人间差异的问题,并描述了用于减少这些影响并从而提高独立于说话者的ASR系统性能的技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号