首页> 外文会议>International Congress on Digital Heritage >ATHENA: Automatic Text Height ExtractioN for the Analysis of old handwritten manuscripts
【24h】

ATHENA: Automatic Text Height ExtractioN for the Analysis of old handwritten manuscripts

机译:雅典娜:旧手写稿件分析的自动文本高度提取

获取原文

摘要

A massive digital acquisition of huge sets of deteriorating historical documents is mandatory due to their value and delicacy. The study and the browsing of such digital libraries is becoming crucial for scholars in the Cultural Heritage field, but it requires automatic tools for analyzing and indexing those dataset items. We present here a layout analysis method to perform automatic text height estimation, without the need of any kind of manual intervention and user defined parameters. It proves to be a robust technique in the case of very noisy and damaged handwritten manuscripts. The effectiveness of the method is demonstrated on a huge heterogeneous corpus of medieval manuscripts, with different writing styles, and affected by other uncontrollable factors, such as ink bleed-through, background noise, and overtyping text lines.
机译:由于它们的价值和美味,强制性地获取大量恶化的历史文档是强制性的。该研究和浏览此类数字图书馆对文化遗产领域的学者来说变得至关重要,但它需要自动工具来分析和索引这些数据集项目。我们在这里介绍一个布局分析方法,用于执行自动文本高度估计,而无需任何类型的手动干预和用户定义的参数。在非常嘈杂和损坏的手写手稿的情况下,它被证明是一种强大的技术。该方法的有效性在中世纪手稿的巨大异构语料上,具有不同的写作风格,并受其他无法控制的因素影响,例如墨水渗透,背景噪声和室外文本线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号