首页> 外文OA文献 >Feature-based pronunciation modeling for automatic speech recognition
【2h】

Feature-based pronunciation modeling for automatic speech recognition

机译:基于特征的语音自动语音识别建模

摘要

Spoken language, especially conversational speech, is characterized by great variability in word pronunciation, including many variants that differ grossly from dictionary prototypes. This is one factor in the poor performance of automatic speech recognizers on conversational speech. One approach to handling this variation consists of expanding the dictionary with phonetic substitution, insertion, and deletion rules. Common rule sets, however, typically leave many pronunciation variants unaccounted for and increase word confusability due to the coarse granularity of phone units. We present an alternative approach, in which many types of variation are explained by representing a pronunciation as multiple streams of linguistic features rather than a single stream of phones. Features may correspond to the positions of the speech articulators, such as the lips and tongue, or to acoustic or perceptual categories. By allowing for asynchrony between features and per-feature substitutions, many pronunciation changes that are difficult to account for with phone-based models become quite natural. Although it is well-known that many phenomena can be attributed to this "semi-independent evolution" of features, previous models of pronunciation variation have typically not taken advantage of this. In particular, we propose a class of feature-based pronunciation models represented as dynamic Bayesian networks (DBNs).
机译:口语,尤其是会话语音,其特征是单词发音的变化很大,包括许多与词典原型完全不同的变体。这是自动语音识别器在会话语音上性能较差的一个因素。处理此变体的一种方法是使用语音替换,插入和删除规则来扩展字典。然而,由于电话单元的粗粒度,通用规则集通常使许多发音变体无法说明,并增加了单词的易混淆性。我们提出了一种替代方法,其中通过将发音表示为语言特征的多个流而不是单个电话流来解释多种类型的变体。特征可以对应于语音发音器的位置,例如嘴唇和舌头,或者对应于声学或感知类别。通过允许功能和按功能替换之间的异步,许多基于电话的模型难以解释的发音更改变得非常自然。尽管众所周知,许多现象都可以归因于这种特征的“半独立演变”,但是以前的语音变化模型通常没有利用这一点。特别是,我们提出了一类基于特征的发音模型,表示为动态贝叶斯网络(DBN)。

著录项

  • 作者

    Livescu Karen 1975-;

  • 作者单位
  • 年度 2005
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号