首页> 外文OA文献 >On efficient and effective association rule mining from XML data
【2h】

On efficient and effective association rule mining from XML data

机译:从XML数据高效有效地进行关联规则挖掘

摘要

[Abstract]: In this paper, we propose a framework, called XAR-Miner, for mining ARs from XML documents efficiently and effectively. In XAR-Miner, raw XML data are first transformed to either an Indexed Content Tree (IX-tree) or Multirelational databases (Multi-DB), depending on the size of XML document and memory constraint of the system, for efficient data selection in the AR mining.udConcepts that are relevant to the AR mining task are generalized to produce generalized meta-patterns. A suitable metric is devised for measuring the degreeudof concept generalization in order to prevent under-generalization or overgeneralization. Resultant generalized meta-patterns are used to generate large ARsudthat meet the support and confidence levels. An efficient AR mining algorithm is also presented based on candidate AR generation in the hierarchy of generalizedudmeta-patterns. The experiments show that XAR-Miner is more efficient in performing a large number of AR mining tasks from XML documents than the state-of-the-art method of repetitively scanning through XML documents in orderudto perform each of the mining tasks.
机译:[摘要]:在本文中,我们提出了一个名为XAR-Miner的框架,用于有效地从XML文档中挖掘AR。在XAR-Miner中,首先将原始XML数据转换为索引内容树(IX-tree)或多关系数据库(Multi-DB),具体取决于XML文档的大小和系统的内存限制,以便在其中进行有效的数据选择。 ud与AR挖掘任务相关的概念被普遍化以产生广义的元模式。设计一种合适的度量来测量概念泛化的程度,以防止泛化不足或泛化。结果的广义元模式用于生成满足支持度和置信度水平的大AR ud。还基于广义 udmeta模式层次中的候选AR生成,提出了一种有效的AR挖掘算法。实验表明,XAR-Miner在执行XML文档中的大量AR挖掘任务方面比通过重复扫描XML文档以执行每个挖掘任务的最新方法更为有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号