首页> 中文期刊> 《测绘学报》 >中文文本的地理命名实体标注

中文文本的地理命名实体标注

         

摘要

Semantic interpretation of geographic information in natural language can solve the semantic problem between natural language and geographical information system.Annotation schemes and corpus annotation aim to analyze specific linguistic structure of geographical information found in the text,and the establishment of the metadata describing them.Firstly,the difference of representation of geographical entities in Chinese text and GIS is analyzed.Secondly,based on linguistic characteristics of geographical named entities in Chinese text,an annotation scheme is presented and the annotation specification is given in detail.Finally,GATE(General Architecture for Text Engineering)is introduced as the annotation platform,and a large-scale annotated corpus-GeoCorpus based on "Encyclopedia of China Geography" is developed and evaluated.This study effectively addresses the current lack of related specification and standardized data.%地理信息的语义解析有效地解决自然语言与地理信息系统之间的语义障碍问题。在分析中文文本和地理信息系统中地理实体描述和表达机制差异的基础上,结合地理命名实体描述的语言特点,制定中文文本的地理命名实体标注体系和标注规范,并以GATE(General Architecture for Text Engineering)作为标注平台,构建基于《中国大百科全书中国地理》的大规模标注语料库,以解决当前相关标准和规模化标准数据匮乏的问题。

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号