首页> 外文会议>ACM symposium on document engineering >Challenges in Generating Bookmarks from TOC Entries in e-Books
【24h】

Challenges in Generating Bookmarks from TOC Entries in e-Books

机译:在电子书中从TOC参赛作品发行书签的挑战

获取原文

摘要

The automatic bookmark creation in e-books using the TOC entries are discussed at length in this paper. The approach here is novel and has significant advantages, that the extracted TOC entries could be used for extracting different parts of the book with the use of bookmarks. Bookmarks will give the exact location of the content, which could be used in the extraction of a particular Chapter / Section / Sub-section of an e-book. Apart from this, these could be guide to extract the structure information and extraction of hierarchy in e-books (e.g., Chapter→Section→Sub-section). Various challenges involved are discussed at detail. Solutions to the challenges identified were demonstrated using a PDF-specific tool named Booky. The solution however could be generalized for any digital format. Some relatively minor issues still are still open. However, we have been able to achieve about 98% success rate in the accurate extraction and linking of bookmarks automatically.
机译:本文以较长讨论使用TOC条目的电子书中的自动书签创建。这里的方法是新颖的并且具有显着的优势,即提取的TOC条目可用于通过使用书签提取本书的不同部分。书签将提供内容的确切位置,可用于提取电子书的特定章节/截面/子部分的提取。除此之外,这些可能是提取电子书中的结构信息和提取层级的指南(例如,第→部分→子部分)。涉及涉及的各种挑战是详细讨论的。通过命名为Booky的PDF特定工具证明了所识别的挑战的解决方案。然而,解决方案可以推广任何数字格式。一些相对较小的问题仍然是开放的。但是,我们已经能够在准确提取和自动链接书签中达到约98%的成功率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号