首页> 外文会议> >Biphone-rich versus triphone-rich: a comparison of speech corpora in automatic speech recognition
【24h】

Biphone-rich versus triphone-rich: a comparison of speech corpora in automatic speech recognition

机译:丰富的Biphone与丰富的Triphone:自动语音识别中的语料库比较

获取原文

摘要

In this paper, we compare the performance of a speech recognition system trained with two speech corpora. We select two set of words such that they covered all the cross-syllable bi-phones and tri-phones, and are called phonetically biphone-rich and triphone-rich respectively. It is required about 10 times more words than that of cross-syllable biphones to cover all the cross-syllable triphones. To facilitate fair comparison, the biphone-rich corpus is thus consisted often sets of words that each covers all the cross-syllable biphones. With those words as data sheets, a male Taiwanese speaker recorded all the words as microphone speech. The resulting speech corpora, about 100 minutes for each set, are used to train for the acoustic models. Although both perform quite well in tasks with recognition networks of linear net and free syllable net, the triphone-rich corpus does not show much advantages over the biphone-rich corpus.
机译:在本文中,我们比较了使用两个语音语料库训练的语音识别系统的性能。我们选择两组单词,以使它们覆盖所有交叉音节的双音节和三音节,分别在语音上被称为“富双音节”和“富三音节”。要覆盖所有的跨音节三音节,需要的单词比跨音节双音节的单词大约多十倍。为了促进公平的比较,富含双音节的语料库通常由一组单词组成,每个单词覆盖所有交叉音节的双音节。一位台湾男性讲者将这些单词作为数据表,将所有单词记录为麦克风语音。生成的语音语料库(每套大约100分钟)用于训练声学模型。尽管在线性网络和自由音节网络的识别网络中,两者的性能都很好,但是,富含三音素的语料库并没有比富含双音素的语料库显示更多优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号