Phoneme set selection for russian speech recognition

机译：俄语语音识别的音素设置选择

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we describe a method for phoneme set selection based on combination of phonological and statistical information and its application for Russian speech recognition. For Russian language, currently used phoneme sets are mostly rule-based or heuristically derived from the standard SAMPA or IPA phonetic alphabets. However, for some other languages, statistical methods have been found useful for phoneme set optimization. In Russian language, almost all phonemes come in pairs: consonants can be hard or soft and vowels stressed or unstressed. First, we start with a big phoneme set and then gradually reduce it by merging phoneme pairs. Decision, which pair to merge, is based on phonetic pronunciation rules and statistics obtained from confusion matrix of phoneme recognition experiments. Applying this approach to the IPA Russian phonetic set, we first reduced it to 47 phonemes, which were used as initial set in the subsequent speech model training. Based on the phoneme confusion results, we derived several other phoneme sets with different number of phonemes down to 27. Speech recognition experiments using these sets showed that the reduced phoneme sets are better than the initial phoneme set for phoneme recognition and as good for word level speech recognition.

机译：在本文中，我们描述了一种基于语音和统计信息结合的音素集选择方法及其在俄语语音识别中的应用。对于俄语，当前使用的音素集主要是基于规则的或启发式地从标准SAMPA或IPA语音字母派生而来的。但是，对于其他一些语言，已发现统计方法可用于音素集优化。在俄语中，几乎所有的音素都是成对出现的：辅音可以是硬的也可以是软的，元音可以重读或不重读。首先，我们从一个大的音素集开始，然后通过合并音素对来逐渐减少它。根据语音发音规则和从音素识别实验的混淆矩阵中获得的统计信息，决定合并哪个对。将这种方法应用于IPA俄语注音集，我们首先将其缩减为47个音素，这些音素在随后的语音模型训练中用作初始集。根据音素混淆结果，我们导出了其他多个音素集，这些音素集具有多达27个不同的音素。使用这些音素集的语音识别实验表明，简化后的音素集在识别音素方面比初始音素集更好，并且在单词级别方面也很不错语音识别。

著录项

来源
《7th International Conference on Natural Language Processing and Knowledge Engineering》|2011年|p.475-478|共4页
会议地点 Tokushima(JP)
作者
Vazhenina Daria; Markov Konstantin;
展开▼
作者单位

Human Interface Laboratory, The University of Aizu, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;
关键词
Phoneme set; Russian language; Speech recognition;

机译：音素集；俄语；语音识别;

相似文献

外文文献
中文文献
专利

1. Contribution from the Accuracy of Phoneme Recognition to the Quality of Automatic Recognition of Russian Speech [J] . I. A. Karpukhin Moscow University Computational Mathematics and Cybernetics . 2016,第2期

机译：音素识别的准确性对俄语语音自动识别质量的贡献
2. Phoneme Set Design Based on Integrated Acoustic and Linguistic Features for Second Language Speech Recognition [J] . Xiaoyun WANG, Tsuneo KATO, Seiichi YAMAMOTO IEICE transactions on information and systems . 2017,第4期

机译：基于语音和语言特性的音素集设计用于第二语言语音识别
3. Speech Recognition of English by Japanese Using Lexicon Represented by Multiple Reduced Phoneme Sets [J] . Xiaoyun WANG, Seiichi YAMAMOTO IEICE transactions on information and systems . 2015,第12期

机译：使用多个简化音素集表示的日语对日语的语音识别
4. Phoneme set selection for russian speech recognition [C] . Vazhenina Daria, Markov Konstantin International Conference on Natural Language Processing and Knowledge Engineering . 2011

机译：音素设置俄语语音识别的选择
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. Estimation of Phoneme-Specific HMM Topologies for the Automatic Recognition of Dysarthric Speech [O] . Santiago-Omar Caballero-Morales 2013

机译：语音异常自动识别的音素特定HMM拓扑估计
7. Speech Recognition of English by Japanese Using Lexicon Represented by Multiple Reduced Phoneme Sets [O] . Xiaoyun WANG, Seiichi YAMAMOTO 2015

机译：使用多个减速音素集表示的Lexicon的语音识别日语
8. Selection of Prototype Phonemes and Preparation of Referent Transcriptions for Speech Recognition Experiments by the Use of Spectral Stationarity [R] . Haltsonen, S. 1979

机译：利用光谱平稳性选择原型音素并准备语音识别实验的指示转录

Phoneme set selection for russian speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅