首页> 外文会议>International Conference on Information and Communication Technology >Evaluation on geospatial information extraction and retrieval: Mining thematic maps from web source
【24h】

Evaluation on geospatial information extraction and retrieval: Mining thematic maps from web source

机译:评估地理空间信息的提取和检索:从网络资源中挖掘专题图

获取原文

摘要

The World Wide Web easily becomes the largest repository of natural language text data. We are particularly interested in state-of-the-art methods in exploiting geospatial information the web. The survey is done in the context of its extraction methods, retrieval, visualization, and further possible mining or knowledge discovery scenarios in order to produce thematic maps automatically from the web corpus. We found that Web-based Geographic Information Retrieval (GIR) methods that returns selected relevant area instead of points is still lacking, even though area modeling is common in GIS. We also found that most GIR methods is still focused on places and buildings instead of theme or information around some area. Thus it indicates that the state of the art GIR methods are not yet sufficient for thematic extraction and retrieval to generate thematic maps from web corpus. Bayesian topic models such as Latent Dirichlet Allocation may serve as a good basis to serve such use cases.
机译:万维网很容易成为自然语言文本数据的最大存储库。我们对利用网络地理空间信息的最新方法特别感兴趣。调查是在其提取方法,检索,可视化以及其他可能的挖掘或知识发现场景的背景下进行的,以便从网络语料库中自动生成主题地图。我们发现,尽管在GIS中常见的是区域建模,但仍缺少基于Web的地理信息检索(GIR)方法,该方法可返回选定的相关区域而不是点。我们还发现,大多数GIR方法仍然专注于地点和建筑物,而不是某些区域的主题或信息。因此,这表明最先进的GIR方法还不足以进行主题提取和检索以从Web语料库生成主题图。贝叶斯主题模型(例如潜在Dirichlet分配)可以作为服务此类用例的良好基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号