首页> 外文会议>International Conference on Asian Language Processing >Improving the Accuracy of Large Vocabulary Continuous Speech Recognizer Using Dependency Parse Tree and Chomsky Hierarchy in Lattice Rescoring
【24h】

Improving the Accuracy of Large Vocabulary Continuous Speech Recognizer Using Dependency Parse Tree and Chomsky Hierarchy in Lattice Rescoring

机译:利用依赖性解析树和粗晶型晶体级校正的大词汇连续语音识别器的准确性

获取原文

摘要

This research work describes our approaches in using dependency parse tree information to derive useful hidden word statistics to improve the baseline system of Malay large vocabulary automatic speech recognition system. The traditional approaches to train language model are mainly based on Chomsky hierarchy type 3 that approximates natural language as regular language. This approach ignores the characteristics of natural language. Our work attempted to overcome these limitations by extending the approach to consider Chomsky hierarchy type 1 and type 2. We extracted the dependency tree based lexical information and incorporate the information into the language model. The second pass lattice rescoring was performed to produce better hypotheses for Malay large vocabulary continuous speech recognition system. The absolute WER reduction was 2.2% and 3.8% for MASS and MASS-NEWS Corpus, respectively.
机译:本研究工作描述了我们在使用依赖性解析树信息中的方法来推导有用的隐藏字统计,以改善马来语大型词汇自动语音识别系统的基线系统。 传统的培训语言模型的方法主要基于Chomsky层次类类型3,其近似于自然语言作为常规语言。 这种方法忽略了自然语言的特征。 我们的工作试图通过扩展Chomsky层次结构1和类型2.基于依赖树的词汇信息来克服这些限制来克服这些限制。我们将信息结合到语言模型中。 进行第二次通过格子生物扫描以产生较好的马来语大词汇连续语音识别系统的假设。 群众和大众新闻语料库的绝对萎缩减少为2.2%和3.8%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号