首页> 外文学位 >Coreference, cross-document coreference, and information extraction methodologies.
【24h】

Coreference, cross-document coreference, and information extraction methodologies.

机译:共指,跨文档共指和信息提取方法。

获取原文
获取原文并翻译 | 示例

摘要

Much work has been done in the field of Natural Language Processing in the last decade, especially in the areas of information extraction and text (document) retrieval. Numerous systems have been developed, each using its own techniques and theories for processing text. The explosive growth of the Internet and the amount of information available on the information super-highway has created large collections of free text that is easily and readily available to a large number of people. This has created opportunities for computers to play an increasingly important role in processing this large collection of text. This phenomenal growth in the amount of information available has also given the impetus to most of the current areas of research in Natural Language Processing.; This dissertation presents several Natural Language Processing (NLP) tools, both theoretical and practical, that further the research already done. In particular, the work described here involves systems for information extraction (IE), information retrieval (IR), document summarization, named entity identification (NE), word sense disambiguation (WSD), coreferencing, and cross-document coreferencing. In addition to building these systems, the research has also been focussed on building models for analyzing the complexities of various NLP tasks, and on using the models for analyzing the performance of systems on such tasks.
机译:在过去的十年中,自然语言处理领域已经做了很多工作,尤其是在信息提取和文本(文档)检索领域。已经开发了许多系统,每个系统都使用其自己的技术和理论来处理文本。互联网的爆炸性增长和信息高速公路上可用的信息量已经产生了许多免费文本,这些文本易于为许多人方便地使用。这为计算机创造了机会,在处理大量文本中发挥越来越重要的作用。可用信息量的这种惊人增长也推动了自然语言处理研究的大多数当前领域。本文提出了几种自然语言处理(NLP)工具,无论是理论上的还是实践上的,都有助于进一步的研究。特别是,此处描述的工作涉及信息提取(IE),信息检索(IR),文档摘要,命名实体标识(NE),单词义歧义消除(WSD),核心引用和跨文档核心引用的系统。除了构建这些系统之外,研究还集中在构建用于分析各种NLP任务的复杂性的模型,以及使用模型来分析此类任务的系统性能方面。

著录项

  • 作者

    Bagga, Amit.;

  • 作者单位

    Duke University.;

  • 授予单位 Duke University.;
  • 学科 Computer Science.; Information Science.
  • 学位 Ph.D.
  • 年度 1998
  • 页码 165 p.
  • 总页数 165
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;信息与知识传播;
  • 关键词

  • 入库时间 2022-08-17 11:48:43

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号