首页> 外文会议>International conference on neural information processing >LDA-Based Word Image Representation for Keyword Spotting on Historical Mongolian Documents
【24h】

LDA-Based Word Image Representation for Keyword Spotting on Historical Mongolian Documents

机译:基于LDA的单词图像表示法在蒙古历史文献上的关键词识别

获取原文

摘要

The original Bag-of-Visual-Words approach discards the spatial relations of the visual words. In this paper, a LDA-based topic model is adopted to obtain the semantic relations of visual words for each word image. Because the LDA-based topic model usually hurts retrieval performance when directly employs itself. Therefore, the LDA-based topic model is linearly combined with a visual language model for each word image in this study. After that, the basic query likelihood model is used for realizing the procedure of retrieval. The experimental results on our dataset show that the proposed LDA-based representation approach can efficiently and accurately attain to the aim of keyword spotting on a collection of historical Mongolian documents. Meanwhile, the proposed approach improves the performance significantly than the original BoVW approach.
机译:原始的视觉词袋方法放弃了视觉词的空间关系。本文采用基于LDA的主题模型来获取每个单词图像中视觉单词的语义关系。因为基于LDA的主题模型通常在直接使用自身时会损害检索性能。因此,在本研究中,针对每个单词图像,将基于LDA的主题模型与视觉语言模型进行线性组合。之后,使用基本查询似然模型来实现检索过程。在我们的数据集上的实验结果表明,所提出的基于LDA的表示方法可以有效,准确地达到在蒙古历史文献集上发现关键词的目的。同时,与原始BoVW方法相比,所提出的方法显着提高了性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号