首页> 外文会议>LREC-2012 >Speech and Language Resources for LVCSR of Russian
【24h】

Speech and Language Resources for LVCSR of Russian

机译:俄罗斯LVCSR的言语和语言资源

获取原文

摘要

A syllable-based language model reduces the lexicon size by hundreds of times. It is especially beneficial in case of highly inflective languages like Russian due to the abundance of word forms according to various grammatical categories. However, the main arising challenge is the concatenation of recognised syllables into the originally spoken sentence or phrase, particularly in the presence of syllable recognition mistakes. Natural fluent speech does not usually incorporate clear information about the outside borders of the spoken words. In this paper a method for the syllable concatenation and error correction is suggested and tested. It is based on the designed co-evolutionary asymptotic probabilistic genetic algorithm for the determination of the most likely sentence corresponding to the recognized chain of syllables within an acceptable time frame. The advantage of this genetic algorithm modification is the minimum number of settings to be manually adjusted comparing to the standard algorithm. Data used for acoustic and language modelling are also described here. A special issue is the preprocessing of the textual data, particularly, handling of abbreviations, Arabic and Roman numerals, since their inflection mostly depends on the context and grammar.
机译:基于音节的语言模型将Lexicon大小减少了数百次。由于根据各种语法类别,如果由于各种语法类别的单词形式的丰富,则特别有益。然而,主要出现的挑战是将公认音节串联成最初说的句子或短语,特别是在音节识别错误的存在。天然流畅的言论通常不会包含有关口语中的外界的清晰信息。在本文中,建议和测试了一种用于音节级联和纠错的方法。它基于设计的共同进化渐近概率遗传学遗传算法,用于确定对应于可接受的时间帧内识别的音节链的最可能句子。该遗传算法修改的优点是与标准算法进行比较的要手动调整的最小设置数。这里还描述了用于声学和语言建模的数据。特殊问题是文本数据的预处理,特别是处理缩写,阿拉伯语和罗马数字,因为它们的变化主要取决于背景和语法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号