...
首页> 外文期刊>International journal of semantic computing >ASKNET: CREATING AND EVALUATING LARGE SCALE INTEGRATED SEMANTIC NETWORKS
【24h】

ASKNET: CREATING AND EVALUATING LARGE SCALE INTEGRATED SEMANTIC NETWORKS

机译:ASKNET:创建和评估大型集成语义网络

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Extracting semantic information from multiple natural language sources and combining that information into a single unified resource is an important and fundamental goal for natural language processing. Large scale resources of this kind can be useful for a wide variety of tasks including question answering, word sense disambiguation and knowledge discovery. A single resource representing the information in multiple documents can provide significantly more semantic information than is available from the documents considered independently. The ASKNet system utilises existing nlp tools and resources, together with spreading activation based techniques, to automatically extract semantic information from a large number of English texts, and combines that information into a large scale semantic network. The initial emphasis of the ASKNet system is on wide-coverage, robustness and speed of construction. In this paper we show how a network consisting of over 1.5 million nodes and 3.5 million edges, more than twice as large as any network currently available, can be created in less than 3 days. Evaluation of large-scale semantic networks is a difficult problem. In order to evaluate ASKNet we have developed a novel evaluation metric based on the notion of a network “core” and employed human evaluators to determine the precision of various components of that core. We have applied this evaluation to networks created from randomly chosen articles used by duc (Document Understanding Conference). The results are highly promising: almost 80% precision in the semantic core of the networks.
机译:从多种自然语言源中提取语义信息并将该信息组合到一个统一的资源中是自然语言处理的重要且基本的目标。这种大规模资源可用于多种任务,包括问题解答,单词义消歧和知识发现。与独立考虑的文档相比,在多个文档中表示信息的单个资源可以提供更多的语义信息。 ASKNet系统利用现有的nlp工具和资源,以及基于扩展激活的技术,自动从大量英文文本中提取语义信息,并将该信息组合成大规模的语义网络。 ASKNet系统的最初重点是广泛的覆盖范围,坚固性和施工速度。在本文中,我们展示了如何在不到3天的时间内创建一个由150万个节点和350万个边缘组成的网络,其规模是当前可用网络的两倍以上。大规模语义网络的评估是一个难题。为了评估ASKNet,我们基于网络“核心”的概念开发了一种新颖的评估指标,并聘请了人工评估人员来确定该核心各个组成部分的精度。我们已将此评估应用于由duc(文档理解会议)使用的随机选择的文章创建的网络。结果非常有希望:网络语义核心的精度几乎达到80%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号