Phone set construction based on context-sensitive articulatory attributes for code-switching speech recognition

机译：基于Code-Constive Mattericulatory属性的电话机施工代码切换语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Bilingual speakers are known for their ability to code-switch or mix their languages during communication. This phenomenon occurs when bilinguals substitute a word or phrase from one language with a phrase or word from another language. For code-switching speech recognition, it is essential to collect a large-scale code-switching speech database for model training. In order to ease the negative effect caused by the data sparseness problem in training code-switching speech recognizers, this study proposes a data-driven approach to phone set construction by integrating acoustic features and cross-lingual context-sensitive articulatory features into distance measure between phone units. KL-divergence and a hierarchical phone unit clustering algorithm are used in this study to cluster similar phone units to reduce the need of the training data for model construction. The experimental results show that the proposed method outperforms other traditional phone set construction methods.

机译：双语扬声器以其在通信期间编码或混合其语言的能力而闻名。当双语时，发生这种现象，当双语替换一个语言的单词或短语，用短语或单词来自另一种语言。对于代码切换语音识别，必须为模型培训收集大规模的代码切换语音数据库。为了缓解训练代码切换语音识别器中的数据稀疏问题引起的负面影响，本研究提出了一种通过将声学特征和交叉语言敏感的剖视特征集成到距离测量中的距离测量来实现数据驱动方法电话单位。 kl-divercence和分层电话单元聚类算法用于本研究以集群类似的电话机单元，以减少模型构造训练数据的需要。实验结果表明，该方法优于其他传统电话机施工方法。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2011年||共4页
会议地点
作者
Chung-Hsien Wu; Han-Ping Shen; Yan-Ting Yang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词

相似文献

外文文献
中文文献
专利

1. Xenophones: An investigation of phone set expansion in Swedish and implications for speech recognition and speech synthesis [J] . Robert Eklund, Anders Lindstrom Speech Communication . 2001,第1a2期

机译：Xenophones：对瑞典电话机扩展的调查及其对语音识别和语音合成的影响
2. Important Attributes Selection Based on Rough Set for Speech Emotion Recognition [J] . Jian Zhou, Guoyin Wang, Yong Yang International journal of cognitive informatics and natural intelligence . 2009,第3期

机译：基于粗糙集的语音情感识别重要属性选择
3. Phoneme Set Design Based on Integrated Acoustic and Linguistic Features for Second Language Speech Recognition [J] . Xiaoyun WANG, Tsuneo KATO, Seiichi YAMAMOTO IEICE transactions on information and systems . 2017,第4期

机译：基于语音和语言特性的音素集设计用于第二语言语音识别
4. Phone set construction based on context-sensitive articulatory attributes for code-switching speech recognition [C] . Wu, Chung-Hsien IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP . 2012

机译：基于上下文相关发音属性的电话机构造，用于代码转换语音识别
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion [O] . Prasanta Kumar Ghosh, Shrikanth Narayanan -1

机译：使用从独立于受试者的声学到发音反转的发音特征进行自动语音识别
7. Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition [O] . Dong Yu, Sabato Marco Siniscalchi, Li Deng, 2012

机译：利用深度神经网络提高属性和电话估计精度，实现基于检测的语音识别

Phone set construction based on context-sensitive articulatory attributes for code-switching speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅