首页> 外文期刊>Journal of computer sciences >Toponym Disambiguation by Arborescent Relationships | Science Publications
【24h】

Toponym Disambiguation by Arborescent Relationships | Science Publications

机译:树状关系消除地名歧义|科学出版物

获取原文
       

摘要

> Problem statement: The way of referring to a place in the geographical space can be formal, based on the spatial coordinates, or informal, which we use in natural language by using toponyms (place names). A toponym can represent several geographical places. This ambiguity made problematic its conversion towards a unique formal representation. Toponym disambiguation in text is the task of assigning a unique location to an ambiguous place name in a given textual context. Approach: Several toponym disambiguation heuristics assumed a geographical proximity between the toponyms of the same context. This proximity can be in terms of spatial distance or in terms of arborsecent relationships, i.e., proximity in the hierarchical tree of the world places. This study presented a new toponym disambiguation heuristic in text based on the quantification of the arborescent proximity between toponyms. This quantification was done by a new measure of geographical correlation that we call the Geographical Density. Results: Our method was compared to the state of the art methods using GeoSemCor corpus and it has outperformed them in term of recall (87.4%) and coverage (99.0%). The results showed that the toponyms of the same context are much closer in terms of arborescent relationships than in terms of spatial relationships. Conclusion: We believe that the quantification of arborescent relationships between toponyms of the same textual context is a good way to improve the recall of TD task. However, all the arborescent relationships
机译: > 问题陈述:指代地理空间中某个地点的方式可以是基于空间坐标的正式形式,也可以是非正式的形式,我们通过地名使用自然语言(地名)。地名可以代表几个地理位置。这种歧义性使其转换为独特的形式表示形式成为一个问题。文本中的地名消除歧义是在给定的文本上下文中为模棱两可的地名分配唯一位置的任务。 方法:几种地名消除歧义的启发式方法假设同一上下文的地名之间在地理上接近。这种接近可以根据空间距离或树状关系,即在世界场所的层次树中的接近。这项研究基于对地名之间树状接近度的量化,提出了一种新的地名歧义消除启发式方法。这种量化是通过一种称为“地理密度”的新的地理相关度量来完成的。 结果:我们的方法与使用GeoSemCor语料库的最新方法进行了比较,在召回率(87.4%)和覆盖率(99.0%)方面均优于他们。结果表明,相同上下文的地名在树状关系方面比在空间关系方面更接近。 结论:我们认为,对相同文本上下文的地名之间的树状关系进行量化是提高TD任务召回率的好方法。但是,所有树状关系

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号