首页> 外文会议>Document Recognition and Retrieval Conference >Hierarchical logical structure extraction of book documents by analyzing tables of contents
【24h】

Hierarchical logical structure extraction of book documents by analyzing tables of contents

机译:通过分析内容表的分层逻辑结构提取书籍文档

获取原文

摘要

Logical structure extraction of book documents is significant in electronic document database automatic construction. The tables of contents in a book play an important role in representing the overall logical structure and reference information of the book documents. In this paper, a new method is proposed to extract the hierarchical logical structure of book documents, in addition to the reference information, by combining spatial and semantic information of the tables of contents in a book. Experimental results obtained from testing on various book documents demonstrate the effectiveness and robustness of the proposed approach.
机译:书籍文档的逻辑结构提取在电子文档数据库自动施工中是显着的。书中内容表在代表书籍文档的整体逻辑结构和参考信息方面发挥着重要作用。在本文中,提出了一种新方法,以通过将内容表中的空间和语义信息组合在书中结合了参考信息,以提取书籍文档的分层逻辑结构。从各种书籍文件测试获得的实验结果表明了所提出的方法的有效性和稳健性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号