首页> 外文会议>2011 Fifth IEEE International Conference on Semantic Computing >Content-Based Geospatial Schema Matching Using Semi-supervised Geosemantic Clustering and Hierarchy
【24h】

Content-Based Geospatial Schema Matching Using Semi-supervised Geosemantic Clustering and Hierarchy

机译:使用半监督Geosemantic聚类和层次结构的基于内容的地理空间模式匹配

获取原文

摘要

The problem of semantic similarity across heterogeneous geospatial data sources continues to attract interest. Semantic similarity across data sources typically involves 1:1 matching of attributes and their instances between tables. Using clustering methods, three distinct challenges remain unaddressed. First, many clustering algorithms rely only on one instance property. Second, a consistent score for an attribute match is not produced. Finally, hierarchical relationships between the data are not considered. To address these, we introduce GeoSim, a tool for determining the semantic similarity between geospatial schemas. GeoSim consists of GeoSimG and GeoSimH. GeoSimG derives clusters from attribute instances based on their geographic and semantic properties. It examines attribute instances in the clusters to calculate a consistent semantic similarity score through entropy-based distribution (EBD). GeoSimH also captures hierarchical relationships between compared tables and attributes. Results from experiments involving multi-jurisdictional geospatial datasets show that GeoSim outperforms several popular semantic similarity approaches.
机译:跨异构地理空间数据源的语义相似性问题继续引起人们的兴趣。跨数据源的语义相似性通常涉及表之间的属性及其实例的1:1匹配。使用聚类方法,三个不同的挑战仍未解决。首先,许多聚类算法仅依赖一个实例属性。其次,不会产生属性匹配的一致分数。最后,不考虑数据之间的层次关系。为了解决这些问题,我们引入了GeoSim,这是一种确定地理空间方案之间语义相似性的工具。 GeoSim由GeoSimG和GeoSimH组成。 GeoSimG根据属性实例的地理和语义属性从其派生群集。它检查聚类中的属性实例,以通过基于熵的分布(EBD)计算一致的语义相似性评分。 GeoSimH还捕获比较表和属性之间的层次关系。涉及多辖区地理空间数据集的实验结果表明,GeoSim优于几种流行的语义相似性方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号