首页> 外文会议>International Conference on Computational Linguistics >Hierarchical Text Segmentation for Medieval Manuscripts
【24h】

Hierarchical Text Segmentation for Medieval Manuscripts

机译:中世纪手稿的分层文本分段

获取原文

摘要

In this paper, we address the segmentation of books of hours, Latin devotional manuscripts of the late Middle Ages, that exhibit challenging issues: a complex hierarchical entangled structure, variable content, noisy transcriptions with no sentence markers, and strong correlations between sections for which topical information is no longer sufficient to draw segmentation boundaries. We show that the main state-of-the-art segmentation methods are either inefficient or inapplicable for books of hours and propose a bottom-up greedy approach that considerably enhances the segmentation results. We stress the importance of such hierarchical segmentation of books of hours for historians to explore their overarching differences underlying conception about Church.
机译:在本文中,我们解决了几小时的分割,拉丁虔诚的稿件中世纪的奉献精神稿件,即展示具有挑战性的问题:复杂的分层纠缠结构,可变内容,没有句子标记的噪声转录,以及部分之间的强烈相关性 局部信息不再足以绘制分割边界。 我们表明,主要最先进的分割方法是效率低或不适用于几小时的书籍,并提出了一种自下而上的贪婪方法,可大大提高分段结果。 我们强调了历史学家为期几小时的书籍的等级分割的重要性,以探索教会概念的总体差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号