分布式环境下,数据的特征是规模大、分布式存储.采用传统的数据挖掘技术分别对每个数据集进行分析一般都容易实现,但是要对全部数据进行整体决策时就比较困难.为此,提出一种新的挖掘方法,可以从多个数据集中挖掘规则.提出元关联规则生成模型,可以发现在每个独立的数据集中挖掘的规则之间的共同联系.设计清晰元关联规则和模糊元关联规则两种框架,对清晰元关联规则挖掘算法和模糊元关联规则挖掘算法做了对比.结果表明,模糊元关联规则挖掘方法在易用性和精确性方面比清晰方法要好.%In distributed environment, data is characterized by large scale and distributed storage.Generally, it is easy to achieve to use traditional data mining techniques to analyze each data set separately.However,it is more difficult to make an overall decision on all the data.Therefore, a new mining method is proposed in this paper, which can be used to mine rules from multiple data sets.A generation model is put forward based on meta association rule,finding out the relationship between the rules mined in each independent data set.Two kinds of frameworks,crisp meta association rules and fuzzy meta association rules,are designed.And comparison between their algorithms is also made.The results show that the mining method based on the fuzzy meta association rules is better than the crisp method in terms of ease of use and accuracy.
展开▼