首页> 外文会议>Machine learning and data mining in pattern recognition >Regional Pattern Discovery in Geo-referenced Datasets Using PCA
【24h】

Regional Pattern Discovery in Geo-referenced Datasets Using PCA

机译:使用PCA在地理参考数据集中发现区域模式

获取原文
获取原文并翻译 | 示例

摘要

Existing data mining techniques mostly focus on finding global patterns and lack the ability to systematically discover regional patterns. Most relationships in spatial datasets are regional; therefore there is a great need to extract regional knowledge from spatial datasets. This paper proposes a novel framework to discover interesting regions characterized by "strong regional correlation relationships" between attributes, and methods to analyze differences and similarities between regions. The framework employs a two-phase approach: it first discovers regions by employing clustering algorithms that maximize a PCA-based fitness function and then applies post processing techniques to explain underlying regional structures and correlation patterns. Additionally, a new similarity measure that assesses the structural similarity of regions based on correlation sets is introduced. We evaluate our framework in a case study which centers on finding correlations between arsenic pollution and other factors in water wells and demonstrate that our framework effectively identifies regional correlation patterns.
机译:现有的数据挖掘技术大多侧重于发现全局模式,而缺乏系统地发现区域模式的能力。空间数据集中的大多数关系都是区域性的。因此,非常需要从空间数据集中提取区域知识。本文提出了一个新颖的框架来发现有趣的区域,这些区域的特征是属性之间的“强区域关联关系”,并提出了分析区域之间异同的方法。该框架采用两阶段方法:首先使用聚类算法发现区域,该聚类算法可最大化基于PCA的适应度函数,然后应用后处理技术来解释底层区域结构和相关模式。另外,引入了一种新的相似性度量,该度量基于相关集评估区域的结构相似性。我们在一个案例研究中评估了我们的框架,该案例研究的重点是发现水井中砷污染与其他因素之间的相关性,并证明我们的框架有效地识别了区域相关性模式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号