首页> 外文会议>International Speech Communication Association >Prosody for Mandarin Speech Recognition:a Comparative Study of Read and Spontaneous Speech
【24h】

Prosody for Mandarin Speech Recognition:a Comparative Study of Read and Spontaneous Speech

机译:普通话语音识别的韵律:阅读和自发言论的比较研究

获取原文

摘要

In this paper, we present a comparative study between sponta-neous speech and read Mandarin speech in the context of au-tomatic speech recognition. We focus on analysis and mod-eling of prosodic features, based on a unique speech corpus that contains similar amounts of read and spontaneous speech data from the same group of speakers. Statistical analysis is carried out on tone contours and duration of syllable and sub-syllable units. Speech recognition experiments are performed to evaluate the effectiveness of different approaches to incorpo-rate prosodic features into acoustic modeling. A key problem being addressed is how to deal with the unvoiced frames where FO values are unavailable. We apply the technique of Multi-space distribution (MSD) to model partially continuous FO con-tours. For spontaneous speech, the tonal-syllable error rate is reduced from the MFCC baseline of 64.8% to 59.4% with the MSD based prosody model. For read speech, the performance improves from 46.0% to 36.4%.
机译:在本文中,我们在AU-Tomatic语音识别的背景下展示了Sponta-Neoy言语和读普通话语音的比较研究。基于独特的语音语料库,我们专注于分析和Mod-Eling的韵律特征,这些语料库包含来自同一组扬声器的类似读取和自发语音数据。在音节和子音节单位的音调轮廓和持续时间内进行统计分析。进行语音识别实验,以评估不同方法对电流模型的不同方法的有效性。正在解决的关键问题是如何处理不可用的无人帧。我们将多个空间分布(MSD)的技术应用于模型部分连续的Con-Tour。对于自发的言论,色调音节误差率与MFCC基线减少了64.8%至59.4%,基于MSD基于MSD的韵律模型。对于阅读言论,性能从46.0%提高到36.4%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号