首页> 外文OA文献 >From Index Locorum to Citation Network: an Approavch to the Automatic Extraction of Canonical Reeferences and its Applications to the Study of Classical Texts
【2h】

From Index Locorum to Citation Network: an Approavch to the Automatic Extraction of Canonical Reeferences and its Applications to the Study of Classical Texts

机译:从索引地方到引文网络:对典范参考自动提取的一种方法及其在典籍研究中的应用

摘要

My research focusses on the automatic extraction of canonical referencesfrom publications in Classics. Such references are the standardway of citing classical texts and are found in great numbers throughoutmonographs, journal articles and commentaries.In chapters 1 and 2 I argue for the importance of canonical citationsand for the need to capture them automatically. Their importance andfunction is to signal text passages that are studied and discussed, oftenin relation to one another as can be seen in parallel passages found inmodern commentaries. Scholars in the field have long been exploitingthis kind of information by manually creating indexes of cited passages,the so-called indices locorum. However, the challenge we now face isfind new ways of indexing and retrieving information contained in thegrowing volume of digital archives and libraries.Chapters 3 and 4 look at how this problem can be tackled by translatingthe extraction of canonical citations into a computationally solvableproblem. The approach I developed consists of treating the extractionof such citations as a problem of named entity extraction. This problemcan be solved with some degree of accuracy by applying and adaptingmethods of Natural Language Processing. In this part of the dissertationI discuss the implementation of this approach as a working prototypeand an evaluation of its performance.Once canonical references have been extracted from texts, the web ofrelations between documents that they create can be represented as anetwork. This network can then be searched, manipulated, visualisedand analysed in various ways. In chapter 5 I focus specifically on howthis network can be leveraged to search through bodies of secondaryliterature. Finally in chapter 6 I discuss how my work opens up newresearch perspectives in terms of visualisation, analysis and the applicationof such automatically extracted citation networks.
机译:我的研究集中于从Classics出版物中自动提取规范参考。这些参考文献是引用古典文本的标准方法,并且在专着,期刊文章和评论中都有大量引用。在第一章和第二章中,我主张规范引用的重要性以及自动捕获它们的必要性。它们的重要性和功能是发信号说明经过研究和讨论的文本段落,通常彼此之间是相关的,正如在现代评论中发现的平行段落中可以看到的那样。长期以来,该领域的学者一直在通过手动创建引用段落的索引(即所谓的索引局部索引)来利用这种信息。然而,我们现在面临的挑战是寻找索引和检索数字档案和图书馆中不断增长的信息的新方法。第3章和第4章探讨了如何通过将规范引文的提取转化为可计算的问题来解决该问题。我开发的方法包括将此类引文的提取视为命名实体提取的问题。通过应用和适应自然语言处理方法,可以在某种程度上准确地解决此问题。在本部分的论文中,我将讨论该方法作为工作原型的实现方式及其性能评估。一旦从文本中提取了规范引用,它们创建的文档之间的关系网络就可以表示为网络。然后可以通过各种方式搜索,操作,可视化和分析该网络。在第5章中,我特别关注如何利用这个网络来搜索中学文献的主体。最后,在第6章中,我讨论了我的工作如何在可视化,分析和这种自动提取的引文网络的应用方面开辟新的研究视角。

著录项

  • 作者

    Romanello Matteo;

  • 作者单位
  • 年度 2015
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 入库时间 2022-08-20 20:34:26

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号