首页> 外文会议>6th International Conference on Spoken Language Processing ICSLP 2000 Oct.16-Oct.20 2000 Beijing International Convention Center, Beijing, China >Consonant Discrimination in Elicited and Spontaneous Speech: A Case for Signal-Adaptive Front Ends in Asr
【24h】

Consonant Discrimination in Elicited and Spontaneous Speech: A Case for Signal-Adaptive Front Ends in Asr

机译:自发和自发性语音中的辅音歧视:Asr中信号自适应前端的一种情况

获取原文

摘要

The constant frame length in typical ASR front ends is too long to capture transient phenomena in speech, such as stop bursts. However, current HMM systems have consistently outperformed systems based solely on non-uniform units. this work investigates an approach to "add back" such transient information to a speech recognizer, without losing the robustness of the standard acoustic models. We demonstrate a set of phonetically-motivated acoustic features that discriminate a preliminary test set of highly ambiguous voiceless stops in CV contexts. The features are automatically computed from data that had been hand-marked for consonant burst location and voicing onset (extension to automatic marking is also proposed). Two corpora are processed using a parallel set of features: conversational speech over the telephone (Switchboard), and a corpus of carfully elicited speech. The latter provides an upper bound on discrimination ,and allows for comparison of feature usage across speaking style. We explore data-driven appraoches to obtaining variable-length time-localized features compatible with an HMM statistical framework. We also suggest techniques for extension to automatic annotation of burst location, for computation of features at such points, and for augmentation of an HMM system with the added information.
机译:典型的ASR前端的恒定帧长度太长,无法在语音中捕获瞬态现象,例如止动突发。然而,目前的HMM系统完全基于非均匀单位始终如一的系统。这项工作调查了“将”这种瞬态信息添加到语音识别器的方法,而不会失去标准声学模型的稳健性。我们展示了一组声学动力的声学特征,可以在CV上下文中区分一组高度模糊的声音停止的初步测试集。这些功能从已被手动标记为辅音突发位置的数据和发声开始(也提出了自动标记的扩展)。使用并行特征组合处理两种Corpora:通过电话(交换机)的会话演讲,以及伪造的语音的语料库。后者在歧视方面提供了上限,并允许在口语风格中进行特征使用。我们探索数据驱动的批准,以获得与HMM统计框架兼容的可变长度的时间局限性。我们还建议扩展到自动注释突发位置的技术,用于计算这些点处的特征,以及用于使用添加信息的HMM系统的增强。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号