首页> 外文会议>International Conference on Information and Communication Technology >Evaluation on geospatial information extraction and retrieval: Mining thematic maps from web source
【24h】

Evaluation on geospatial information extraction and retrieval: Mining thematic maps from web source

机译:地理空间信息提取与检索评估:来自Web源的挖掘专题映射

获取原文

摘要

The World Wide Web easily becomes the largest repository of natural language text data. We are particularly interested in state-of-the-art methods in exploiting geospatial information the web. The survey is done in the context of its extraction methods, retrieval, visualization, and further possible mining or knowledge discovery scenarios in order to produce thematic maps automatically from the web corpus. We found that Web-based Geographic Information Retrieval (GIR) methods that returns selected relevant area instead of points is still lacking, even though area modeling is common in GIS. We also found that most GIR methods is still focused on places and buildings instead of theme or information around some area. Thus it indicates that the state of the art GIR methods are not yet sufficient for thematic extraction and retrieval to generate thematic maps from web corpus. Bayesian topic models such as Latent Dirichlet Allocation may serve as a good basis to serve such use cases.
机译:万维网轻松成为自然语言文本数据的最大存储库。我们对最先进的方法尤其感兴趣地利用网站空间信息。该调查是在提取方法,检索,可视化和进一步可能的挖掘或知识发现场景的上下文中完成的,以便自动从Web语料库中生成专题映射。我们发现,即使在GIS中常见,仍然缺少返回所选相关区域而不是点的基于Web的地理信息检索(GIR)方法。我们还发现,大多数GIR方法仍然专注于地方和建筑物而不是一些地区的主题或信息。因此,它表明,最先进的GIR方法尚不就是用于主题提取和检索,以从Web语料库生成专题映射。贝叶斯主题模型,如潜在的Dirichlet分配可以作为服务这种用例的良好基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号