【24h】

Automatic Document Navigation for Digital Content Re-mastering

机译:自动文档导航以重新制作数字内容

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a novel method of automatically adding navigation capabilities to re-mastered electronic books. We first analyze the need for a generic and robust system to automatically construct navigation links into re-mastered books. We then introduce the core algorithm based on text matching for building the links. The proposed method utilizes the tree-structured dictionary and directional graph of the table of contents to efficiently conduct the text matching. Information fusion further increases the robustness of the algorithm. The experimental results on the MIT Press digital library project are discussed and the key functional features of the system are illustrated. We have also investigated how the quality of the OCR engine affects the linking algorithm. In addition, the analogy between this work and Web link mining has been pointed out.
机译:本文介绍了一种自动为重新制作的电子书添加导航功能的新颖方法。我们首先分析对通用且强大的系统的需求,以自动将导航链接构建到重新制作的书籍中。然后,我们介绍基于文本匹配的核心算法来构建链接。所提出的方法利用目录的树状字典和方向图来有效地进行文本匹配。信息融合进一步提高了算法的鲁棒性。讨论了MIT Press数字图书馆项目的实验结果,并说明了该系统的主要功能。我们还研究了OCR引擎的质量如何影响链接算法。此外,还指出了这项工作与Web链接挖掘之间的类比。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号