首页> 外文会议>Workshop on geographic information retrieval >Geotagging: Using Proximity, Sibling, and Prominence Clues to Understand Comma Groups
【24h】

Geotagging: Using Proximity, Sibling, and Prominence Clues to Understand Comma Groups

机译:地理标记:使用邻近,兄弟姐妹和突出线索来了解逗号群体

获取原文

摘要

Geotagging is the process of recognizing textual references to geographic locations, known as toponyms, and resolving these references by assigning each lat/long values. Typical geotagging algorithms use a variety of heuristic evidence to select the correct interpretation for each toponym. A study is presented of one such heuristic which aids in recognizing and resolving lists of toponyms, referred to as comma groups. Comma groups of toponyms are recognized and resolved by inferring the common threads that bind them together, based on the toponyms' shared geographic attributes. Three such common threads are proposed and studied - population-based prominence, distance-based proximity, and sibling relationships in a geographic hierarchy - and examples of each are noted. In addition, measurements are made of these comma groups' usage and variety in a large dataset of news articles, indicating that the proposed heuristics, and in particular the proximity and sibling heuristics, are useful for resolving comma group toponyms.
机译:地理构造是识别对地理位置的文本引用,称为地理位置,并通过分配每个LAT /长值来解决这些引用。典型的地理标记算法使用各种启发式证据来为每个正文选择正确的解释。介绍了一个这样的启发式,它有助于识别和解决地名的名单,称为逗号组。逗号一组地名是通过推断绑定它们的公共线程的识别和解决,基于Toponyms的共享地理属性。提出并研究了三个这样的常见线程 - 基于人群的突出,基于距离的邻近和地理层次结构中的兄弟关系和每个术语。此外,测量是由新闻文章的大型数据集中的这些逗号组的使用和品种,表明拟议的启发式,特别是邻近和兄弟启发式,对解析逗号组地名有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号