首页> 外文会议> >Biphone-rich versus triphone-rich: a comparison of speech corpora in automatic speech recognition

【24h】

Biphone-rich versus triphone-rich: a comparison of speech corpora in automatic speech recognition

机译：丰富的Biphone与丰富的Triphone：自动语音识别中的语料库比较

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we compare the performance of a speech recognition system trained with two speech corpora. We select two set of words such that they covered all the cross-syllable bi-phones and tri-phones, and are called phonetically biphone-rich and triphone-rich respectively. It is required about 10 times more words than that of cross-syllable biphones to cover all the cross-syllable triphones. To facilitate fair comparison, the biphone-rich corpus is thus consisted often sets of words that each covers all the cross-syllable biphones. With those words as data sheets, a male Taiwanese speaker recorded all the words as microphone speech. The resulting speech corpora, about 100 minutes for each set, are used to train for the acoustic models. Although both perform quite well in tasks with recognition networks of linear net and free syllable net, the triphone-rich corpus does not show much advantages over the biphone-rich corpus.

机译：在本文中，我们比较了使用两个语音语料库训练的语音识别系统的性能。我们选择两组单词，以使它们覆盖所有交叉音节的双音节和三音节，分别在语音上被称为“富双音节”和“富三音节”。要覆盖所有的跨音节三音节，需要的单词比跨音节双音节的单词大约多十倍。为了促进公平的比较，富含双音节的语料库通常由一组单词组成，每个单词覆盖所有交叉音节的双音节。一位台湾男性讲者将这些单词作为数据表，将所有单词记录为麦克风语音。生成的语音语料库（每套大约100分钟）用于训练声学模型。尽管在线性网络和自由音节网络的识别网络中，两者的性能都很好，但是，富含三音素的语料库并没有比富含双音素的语料库显示更多优势。

著录项

来源
《》|2005年|P.194-197|共4页
会议地点
作者
Yong-Chang Yio; Min-Siong Liang; Yuang-Chin Chiang; Ren-Yuan Lyu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类工业技术;
关键词
speech recognition; automatic speech recognition; cross-syllable biphones corpus; cross-syllable triphones corpus; speech corpora; speech recognition system;

机译：语音识别;自动语音识别;跨音节双音节语料库;跨音节三音节语料库;语音语料库;语音识别系统;

相似文献

外文文献
中文文献
专利

1. Turkish speech corpora and recognition tools developed by porting SONIC: Towards multilingual speech recognition [J] . Oezguel Salor, Bryan L. Pellom, Tolga Ciloglu, Computer speech and language . 2007,第4期

机译：通过移植SONIC开发的土耳其语语料库和识别工具：迈向多语言语音识别
2. Review of Development of Speech corpora and speech recognition research in Hindi [J] . Dr.Harshalata Petkar International Journal of Engineering Research and Applications . 2017,第7期

机译：印地语语音语料库发展与语音识别研究述评
3. Evaluation of speech corpora for speech and speaker recognition systems [J] . Jacek SLIMOK, Jan KOTAS Pomiary Automatyka Kontrola . 2014,第6期

机译：语音和说话者识别系统的语音语料库评估
4. Biphone-rich versus triphone-rich: a comparison of speech corpora in automatic speech recognition [C] . Yong-Chang Yio, Min-Siong Liang, Yuang-Chin Chiang, IEEE International Workshop on Cellular Neural Networks and their Applications . 2005

机译：Biphone-Rich与Triphone-Rich：语音语音识别中的语音语音比较
5. A multimodal fusion approach for automatic postal address recognition system using Optical Character Recognition (OCR) and Automatic Speech Recognition (ASR) techniques. [D] . Singh, Amriteshwar. 2011

机译：一种使用光学字符识别（OCR）和自动语音识别（ASR）技术的自动邮政地址识别系统的多模式融合方法。
6. A systematic comparison of contemporary automatic speech recognition engines for conversational clinical speech [O] . Jodi Kodish-Wachs, Emin Agassi, Patrick Kenny III, 2018

机译：当代自动语音识别引擎用于对话式临床语音的系统比较
7. Comparison Of Part-Of-Speech And Automatically Derived Category-Based Language Models For Speech Recognition [O] . T.R. Niesler, E. W. D. Whittaker, P.C. Woodland 1998

机译：语音识别的词性和自动派生基于类别的语言模型的比较

Biphone-rich versus triphone-rich: a comparison of speech corpora in automatic speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅