Multilingual articulatory features augmentation learning

机译：多语言发音特征增强学习

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Articulatory features are used as an universal set of speech attributes shared across many different languages. Some multilingual and cross-language speech recognition systems using articulatory features have been shown to improve the performance. The existing articulatory features are defined by phonetician as a set of articulatory descriptions of phones, which represent some semantic information explaining how humans produce speech sounds via the interaction of different physiological structures. But these manually specified attributes suffer from the incomplete capturing articulation information of all languages and are not distinctive enough for accurate monolingual and multilingual phoneme recognition. In this paper, we are solving the problem of a more complete set of articulatory features representation by sparse coding methods. We learned the latent attributes that sparsely represent more speech articulation information sharing between English and Tibetan languages. Models based on the concatenated semantic and latent speech attributes performed the better accuracy over the existing methods in our experiments for English-Tibetan bilingual phone recognition.

机译：发音特征被用作在许多不同语言之间共享的通用语音属性集。一些使用发音特征的多语言和跨语言语音识别系统已被证明可以改善性能。语音学家将现有的发音特征定义为一组手机的发音描述，这些描述描述了一些语义信息，这些信息解释了人类如何通过不同生理结构的相互作用产生语音。但是，这些手动指定的属性遭受了所有语言的不完整捕获清晰度信息的困扰，并且不足以区分出准确的单语和多语音素。在本文中，我们正在解决通过稀疏编码方法实现更完整的发音特征表示的问题。我们了解到潜在的属性稀疏地代表了英语和藏语之间更多的语音清晰度信息共享。基于连接的语义和潜在语音属性的模型比我们现有的英语-藏语双语电话识别实验中的现有方法具有更好的准确性。

著录项

来源
《International Conference on Pattern Recognition》|2016年|2895-2899|共5页
会议地点
作者
Yue Zhao; Rui Zhao; Xiaoyang Wang; Qiang Ji;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Semantics; Speech; Speech recognition; Encoding; Dictionaries; Feature extraction; Mel frequency cepstral coefficient;

机译：语义;语音;语音识别;编码;词典;特征提取;梅尔频率倒谱系数;

相似文献

外文文献
中文文献
专利

1. Processing demands upon cognitive, linguistic, and articulatory functions promote grey matter plasticity in the adult multilingual brain: Insights from simultaneous interpreters [J] . Stefan Elmer, Jurgen Hanggi Lutz Jancke Cortex: A Journal Devoted to the Study of the Nervous System and Behavior . 2014,第Null期

机译：对认知，语言和发音功能的处理要求促进了成人多语言大脑中灰质的可塑性：同声传译的见解
2. Manipulation of the prosodic features of vocal tract length, nasality and articulatory precision using articulatory synthesis [J] . Peter Birkholz, Lucia Martin, Yi Xu, Computer speech and language . 2017,第jana期

机译：使用发音合成来操纵声带长度，鼻腔和发音精度的韵律特征
3. Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion [J] . Ghosh P.K., Narayanan S. The Journal of the Acoustical Society of America . 2011,第4aPta1期

机译：使用从与主题无关的声音到发音反转的发音特征进行自动语音识别
4. Multilingual Articulatory Features Augmentation Learning [C] . Yue Zhao, Rui Zhao, Xiaoyang Wang, International Conference on Pattern Recognition . 2016

机译：多语种明晰度特征增强学习
5. Discriminative Articulatory Feature-based Pronunciation Models with Application to Spoken Term Detection [D] . Prabhavalkar, Rohit. 2013

机译：基于区分性发音特征的语音模型及其在口语检测中的应用
6. Deep Learning-Based Detection of Articulatory Features in Arabic and English Speech [O] . Mohammed Algabri, Hassan Mathkour, Mansour M. Alsulaiman, 2021

机译：基于深入的学习的阿拉伯语和英语演讲中的明晰度特征检测
7. Multilingual Articulatory Features [O] . Sebastian Stüker, Tanja Schultz, Florian Metze, 2003

机译：多语言发音特征

Multilingual articulatory features augmentation learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅