首页> 外文会议>International conference on knowledge science, engineering and management >A Practical Approach to Extracting Names of Geographical Entities and Their Relations from the Web
【24h】

A Practical Approach to Extracting Names of Geographical Entities and Their Relations from the Web

机译:从网络中提取地理实体名称及其关系的实用方法

获取原文

摘要

Geographical information extraction is a special case of information extraction. In this paper, we present a practical method of extracting both names of geographical entities and their relations from the Web. The method is composed of three major phases. First, we manually designed a list of 493 Chinese lexico-syntactical patterns for matching Web page excerpts which contain names of geographical entities and their relations; second, we developed a knowledge extractor for extracting those names and relations to generate a geographical graph whose nodes are entities, and edges represent relations of the entities; third, we developed several methods for handling problems or errors in the generated graph. Experimental results show that the OMKast-Googling system has a satisfactory performance both in the entity name extraction and relation extraction.
机译:地理信息提取是信息提取的一种特殊情况。在本文中,我们提出了一种从Web提取地理实体名称及其关系的实用方法。该方法包括三个主要阶段。首先,我们手动设计了493个中文词汇语法模式列表,以匹配网页摘录,其中包含地理实体的名称及其关系;其次,我们开发了一个知识提取器,用于提取那些名称和关系,以生成节点为实体,边代表实体之间关系的地理图。第三,我们开发了几种方法来处理所生成图形中的问题或错误。实验结果表明,OMKast-Googling系统在实体名称提取和关系提取方面均具有令人满意的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号