首页> 外文会议>European symposium on artificial neural networks, computational intelligence and machine learning >Combining higher-order N-grams and intelligent sample selection to improve language modeling for Handwritten Text Recognition
【24h】

Combining higher-order N-grams and intelligent sample selection to improve language modeling for Handwritten Text Recognition

机译:结合高阶N元语法和智能样本选择以改善用于手写文本识别的语言建模

获取原文

摘要

We combine two techniques to improve the language modeling component of a Handwritten Text Recognition (HTR) system. On the one hand, we apply a previously developed intelligent sample selection approach to language model adaptation for handwritten text recognition, which exploits a combination of in-domain and out-of-domain data for construction of language models. On the other hand, we apply rescoring methods to enable more complex language modeling in HTR. It is shown that these techniques complement each other very well, and that the combination leads to a significant error reduction in a practical HTR task for historical data.
机译:我们结合两种技术来改善手写文本识别(HTR)系统的语言建模组件。一方面,我们将先前开发的智能样本选择方法应用于用于手写文本识别的语言模型自适应,该方法利用域内和域外数据的组合来构建语言模型。另一方面,我们应用记录方法在HTR中启用更复杂的语言建模。结果表明,这些技术可以很好地相互补充,并且这种组合可以显着减少历史数据在实际HTR任务中的错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号