首页> 外文期刊>International Journal of Geographical Information Science >Mining significant crisp-fuzzy spatial association rules
【24h】

Mining significant crisp-fuzzy spatial association rules

机译:挖掘重要的清晰模糊空间关联规则

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Spatial association rule mining (SARM) is an important data mining task for understanding implicit and sophisticated interactions in spatial data. The usefulness of SARM results, represented as sets of rules, depends on their reliability: the abundance of rules, control over the risk of spurious rules, and accuracy of rule interestingness measure (RIM) values. This study presents crisp-fuzzy SARM, a novel SARM method that can enhance the reliability of resultant rules. The method firstly prunes dubious rules using statistically sound tests and crisp supports for the patterns involved, and then evaluates RIMs of accepted rules using fuzzy supports. For the RIM evaluation stage, the study also proposes a Gaussian-curve-based fuzzy data discretization model for SARM with improved design for spatial semantics. The proposed techniques were evaluated by both synthetic and real-world data. The synthetic data was generated with predesigned rules and RIM values, thus the reliability of SARM results could be confidently and quantitatively evaluated. The proposed techniques showed high efficacy in enhancing the reliability of SARM results in all three aspects. The abundance of resultant rules was improved by 50% or more compared with using conventional fuzzy SARM. Minimal risk of spurious rules was guaranteed by statistically sound tests. The probability that the entire result contained any spurious rules was below 1%. The RIM values also avoided large positive errors committed by crisp SARM, which typically exceeded 50% for representative RIMs. The real-world case study on New York City points of interest reconfirms the improved reliability of crisp-fuzzy SARM results, and demonstrates that such improvement is critical for practical spatial data analytics and decision support.
机译:空间关联规则挖掘(SARM)是一项重要的数据挖掘任务,用于理解空间数据中的隐式和复杂交互。 SARM结果以规则集表示的有用性取决于其可靠性:规则的丰富性,对虚假规则风险的控制以及规则兴趣度(RIM)值的准确性。这项研究提出了清晰模糊的SARM,这是一种新颖的SARM方法,可以提高所得规则的可靠性。该方法首先使用统计上合理的测试和对所涉及模式的清晰支持来修剪可疑规则,然后使用模糊支持来评估已接受规则的RIM。在RIM评估阶段,研究还提出了一种基于高斯曲线的SARM模糊数据离散化模型,该模型具有改进的空间语义设计。所提出的技术已通过综合和实际数据进行了评估。综合数据是使用预先设计的规则和RIM值生成的,因此可以自信且定量地评估SARM结果的可靠性。所提出的技术在提高这三个方面的SARM结果的可靠性方面均显示出很高的功效。与使用常规模糊SARM相比,结果规则的丰度提高了50%或更多。统计上合理的测试保证了伪造规则的最小风险。整个结果包含任何虚假规则的可能性低于1%。 RIM值还避免了由清晰的SARM造成的较大的正误差,对于典型RIM,该误差通常超过50%。关于纽约市景点的真实案例研究再次证实了清晰模糊的SARM结果的可靠性提高,并表明这种改进对于实际的空间数据分析和决策支持至关重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号