首页> 外文会议>Automatic Speech Recognition amp; Understanding, 2009. ASRU 2009 >A hierarchical structure for modeling inter and intra phonetic information for phoneme recognition
【24h】

A hierarchical structure for modeling inter and intra phonetic information for phoneme recognition

机译:用于建模内部和内部语音信息以进行音素识别的层次结构

获取原文

摘要

In this paper, we present a two-layer hierarchical structure based on neural networks for phoneme recognition. The proposed structure attempts to model only the characteristics within a phoneme, i.e., intra-phonetic information. This differs from other state-of-the-art hierarchical structures where the first layer typically models the intra-phonetic information while the second layer focuses on modeling the contextual (inter-phonetic) information. An advantage of the proposed model is that it can be added to another layer that focuses on the inter-phonetic information. In this paper, we also show that the categorization between intra- and inter-phonetic information also allows to extend other state-of-the-art hierarchical approaches. A phoneme accuracy of 77.89% is achieved on the TIMIT database, which compares favorably to the best results obtained on this database.
机译:在本文中,我们提出了一种基于神经网络的两层分层结构,用于音素识别。提出的结构试图仅对音素内的特征,即语音内信息建模。这与其他现有技术的层次结构不同,在其他层次结构中,第一层通常对语音内信息进行建模,而第二层则专注于对上下文(语音间)信息进行建模。所提出的模型的优点是可以将其添加到关注语音信息的另一层。在本文中,我们还表明,语音内信息和语音间信息之间的分类还可以扩展其他最新的分层方法。在TIMIT数据库上,音素准确度达到77.89%,与在该数据库上获得的最佳结果相比,该结果令人满意。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号