首页> 外文学位 >Statistically Sound Interaction Pattern Discovery from Spatial Data.
【24h】

Statistically Sound Interaction Pattern Discovery from Spatial Data.

机译:从空间数据统计上合理的交互模式发现。

获取原文
获取原文并翻译 | 示例

摘要

Spatial interaction pattern mining is the process of discovering patterns that occur due to the interaction of Boolean features from a spatial domain. A positive interaction of a subset of features generates a co-location pattern, whereas a negative interaction of a subset of features generates a segregation pattern. Finding interaction patterns is important for many application domains such as ecology, environmental science, forestry, and criminology.;Existing mining algorithms also use a user provided distance threshold at which the algorithm checks for prevalent patterns. Since spatial interactions, in reality, may happen at different distances, finding the right distance threshold to mine all true patterns is not easy and a single appropriate threshold may not even exist. In the second major contribution of this thesis, we propose an algorithm to mine true co-locations at multiple distances. Our approach does not need thresholds for the prevalence measure and the interaction distance. An approximation algorithm is also proposed to prune redundant patterns that could occur in a statistical test. This algorithm finally reports a minimal set of patterns explaining all the detected co-locations. We evaluate the efficacy of our proposed approaches using synthetic and real data sets and compare our algorithms with the state-of-the-art co-location mining approach.;Existing methods use a prevalence measure, which is mainly a frequency based measure. To mine prevalent patterns, the known methods require a user defined prevalence threshold. Deciding the right threshold value is not easy and an arbitrary threshold value may result in reporting meaningless patterns and even not reporting meaningful patterns. Due to the presence of spatial auto-correlation and feature abundance, which are not uncommon in a spatial domain, random patterns may achieve prevalence measure values higher than the used threshold just by chance, in which case the existing algorithm will report them. To overcome these limitations, we introduce a new definition of interaction patterns based on a statistical test. For the statistical test, we propose to design an appropriate null model which takes spatial auto-correlation into account. To reduce the computational cost of the statistical test, we also propose two approaches.
机译:空间交互模式挖掘是发现由于来自空间域的布尔特征的交互而发生的模式的过程。特征子集的正向交互会生成共置模式,而特征子集的负向交互会生成隔离模式。查找交互模式对于许多应用领域都很重要,例如生态,环境科学,林业和犯罪学。现有的挖掘算法还使用用户提供的距离阈值,在该阈值处算法会检查流行的模式。实际上,由于空间相互作用可能发生在不同的距离,因此要找到正确的距离阈值来挖掘所有真实模式并不容易,并且甚至可能不存在单个适当的阈值。在本文的第二个主要贡献中,我们提出了一种在多个距离上挖掘真实共址的算法。我们的方法不需要普遍性测度和相互作用距离的阈值。还提出了一种近似算法来修剪可能在统计测试中出现的冗余模式。该算法最后报告了一组最小的模式,解释了所有检测到的同位。我们使用合成的和真实的数据集评估了我们提出的方法的有效性,并将我们的算法与最新的共置位挖掘方法进行了比较。现有方法使用普遍性度量,主要是基于频率的度量。为了挖掘流行模式,已知方法需要用户定义的流行阈值。确定正确的阈值并不容易,任意阈值可能导致报告无意义的模式,甚至不报告有意义的模式。由于存在空间自相关和特征丰富度(这在空间域中并不罕见),随机模式可能偶然获得的流行度测量值高于使用的阈值,在这种情况下,现有算法将报告它们。为了克服这些限制,我们基于统计测试引入了交互模式的新定义。对于统计检验,我们建议设计一个适当的空模型,该模型考虑空间自相关。为了减少统计检验的计算成本,我们还提出了两种方法。

著录项

  • 作者

    Barua, Sajib.;

  • 作者单位

    University of Alberta (Canada).;

  • 授予单位 University of Alberta (Canada).;
  • 学科 Computer science.;Mathematics.;Statistics.
  • 学位 Ph.D.
  • 年度 2014
  • 页码 151 p.
  • 总页数 151
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 老年病学;
  • 关键词

  • 入库时间 2022-08-17 11:53:52

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号