首页> 外文会议>International Conference on Pattern Recognition >Multilingual articulatory features augmentation learning
【24h】

Multilingual articulatory features augmentation learning

机译:多语言发音特征增强学习

获取原文
获取外文期刊封面目录资料

摘要

Articulatory features are used as an universal set of speech attributes shared across many different languages. Some multilingual and cross-language speech recognition systems using articulatory features have been shown to improve the performance. The existing articulatory features are defined by phonetician as a set of articulatory descriptions of phones, which represent some semantic information explaining how humans produce speech sounds via the interaction of different physiological structures. But these manually specified attributes suffer from the incomplete capturing articulation information of all languages and are not distinctive enough for accurate monolingual and multilingual phoneme recognition. In this paper, we are solving the problem of a more complete set of articulatory features representation by sparse coding methods. We learned the latent attributes that sparsely represent more speech articulation information sharing between English and Tibetan languages. Models based on the concatenated semantic and latent speech attributes performed the better accuracy over the existing methods in our experiments for English-Tibetan bilingual phone recognition.
机译:发音特征被用作在许多不同语言之间共享的通用语音属性集。一些使用发音特征的多语言和跨语言语音识别系统已被证明可以改善性能。语音学家将现有的发音特征定义为一组手机的发音描述,这些描述描述了一些语义信息,这些信息解释了人类如何通过不同生理结构的相互作用产生语音。但是,这些手动指定的属性遭受了所有语言的不完整捕获清晰度信息的困扰,并且不足以区分出准确的单语和多语音素。在本文中,我们正在解决通过稀疏编码方法实现更完整的发音特征表示的问题。我们了解到潜在的属性稀疏地代表了英语和藏语之间更多的语音清晰度信息共享。基于连接的语义和潜在语音属性的模型比我们现有的英语-藏语双语电话识别实验中的现有方法具有更好的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号