Phonotactic spoken language recognition: Using diversely adapted acoustic models in parallel phone recognizers

机译：语音法口语识别：在并行电话识别器中使用多种适应的声学模型

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In phonotactic spoken language recognition systems, acoustic model adaptation prior to phone lattice decoding has been adopted to deal with the mismatch between training and test conditions. Moreover, combining diversified phonotactic features is commonly used. These motivate us to have an in-depth investigation of combining diversified phonotactic features from diversely adapted acoustic models. Our experiment shows that our approach achieves an equal error rate (EER) of 1.94% in the 30-second closed-set trials of the 2007 NIST Language Recognition Evaluation (LRE). It represents a 14.9% relative improvement in EER over a sophisticated system, in which parallel phone recognizers, speaker adaptive training (SAT) in acoustic models and CMLLR adaptation are used. Moreover, it is shown that our approach provides consistent and substantial improvements in three different phonotactic systems, in each of which a single phone recognizer is used.

机译：在音位练习口语识别系统中，已采用电话晶格解码之前的声学模型自适应来处理训练条件与测试条件之间的不匹配。而且，通常使用组合多种变音术特征。这些促使我们进行深入研究，以结合来自各种适应性声学模型的多种音律学特征。我们的实验表明，我们的方法在2007年NIST语言识别评估（LRE）的30秒封闭试验中实现了1.94％的均等错误率（EER）。与使用复杂的电话识别器，声学模型中的说话人自适应训练（SAT）和CMLLR自适应的复杂系统相比，它的EER相对提高了14.9％。此外，结果表明，我们的方法在三个不同的音律系统中提供了一致且实质性的改进，每个系统都使用一个电话识别器。

著录项

来源
《2012 8th International Symposium on Chinese Spoken Language Processing.》|2012年|p.108-111|共4页
会议地点 Hong Kong(HK);Hong Kong(HK)
作者
Leung Cheung-Chi; Ma Bin; Li Haizhou;
展开▼
作者单位

Institute for Infocomm Research, A*STAR, Singapore 138632;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;人工智能理论;
关键词
MLLR adaptation; phone lattice; phone recognizer; spoken language recognition;

机译：MLLR适应;电话格;电话识别器;口语识别;;
入库时间 2022-08-26 14:05:23

相似文献

外文文献
中文文献
专利

1. Language Recognition Based on Acoustic Diversified Phone Recognizers and Phonotactic Feature Fusion [J] . Yan DENG, Wei-Qiang ZHANG, Yan-Min QIAN, IEICE transactions on information and systems . 2011,第3期

机译：基于语音多样化电话识别和语音特征融合的语言识别
2. Language Recognition Based on Acoustic Diversified Phone Recognizers and Phonotactic Feature Fusion [J] . Yan DENG, Wei-Qiang ZHANG, Yan-Min QIAN, IEICE Transactions on Information and Systems . 2011,第3期

机译：基于语音多样化电话识别和语音特征融合的语言识别
3. Improved Modeling of Cross-Decoder Phone Co-Occurrences in SVM-Based Phonotactic Language Recognition [J] . Penagarikano M., Varona A., Rodriguez-Fuentes L. J., Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第8期

机译：基于支持向量机的语音策略语言识别中跨解码器电话共现的改进建模
4. Phonotactic spoken language recognition: Using diversely adapted acoustic models in parallel phone recognizers [C] . Leung Cheung-Chi, Ma Bin, Li Haizhou International Symposium on Chinese Spoken Language Processing . 2012

机译：语音语言识别：在并行电话识别器中使用多种调整的声学模型
5. American Sign Language recognition: Reducing the complexity of the task with phoneme-based modeling and parallel hidden Markov models. [D] . Vogler, Christian Philipp. 2003

机译：美国手语识别：通过基于音素的建模和并行隐马尔可夫模型，降低了任务的复杂性。
6. Words from spontaneous conversational speech can be recognized with human-like accuracy by an error-driven learning algorithm that discriminates between meanings straight from smart acoustic features bypassing the phoneme as recognition unit [O] . Denis Arnold, Fabian Tomaschek, Konstantin Sering, -1

机译：通过错误驱动的学习算法可以区分自发会话语音中的单词其准确性与人类类似可以从智能声学特征中区分出含义而绕过音素作为识别单元
7. Language recognition using phonotactic-based shifted delta coefficients and multiple phone recognizers [O] . DHaro Enriquez Luis Fernando, Cordoba Herralde Ricardo de, Salamea Palacios Christian Raúl, 2014

机译：使用基于音符的位移增量系数和多个电话识别器的语言识别

Phonotactic spoken language recognition: Using diversely adapted acoustic models in parallel phone recognizers

摘要

著录项

相似文献

相关主题

期刊订阅