首页> 外文会议>IEEE International Conference on Big Data Computing Service and Applications >An Evolutionary Rule Mining Method for Continuous Value Prediction from Incomplete Database and Its Application Utilizing Artificial Missing Values
【24h】

An Evolutionary Rule Mining Method for Continuous Value Prediction from Incomplete Database and Its Application Utilizing Artificial Missing Values

机译:一种不完备数据库中连续值预测的进化规则挖掘方法及其在人工缺失值中的应用

获取原文

摘要

A rule mining method for continuous value prediction has been proposed to handle incomplete databases using a graph structure-based evolutionary computation technique. The method extracts the associative local distribution rule, the consequent part of which has a narrow distribution of continuous variables. A set of associative local distribution rules was applied for continuous value prediction. Instances including missing values were predicted using the predictor. A method for constructing a probability distribution of predicted values for each focusing instance was considered based on extracted rule sets. The proposed method offers some flexibility by allowing users to define the conditions of prediction rules. The method can quit rule extraction when a sufficient number of rules are extracted for building a predictor. Therefore, it is suitable for prediction when large datasets are involved. In addition, we have proposed an application of artificial missing values to improve the effectiveness of the developed rule-based prediction system. Artificial missing values are applied to avoid the sharp boundary problem encountered when discretizing continuous variables. Attribute values near the boundary in discretization are treated as missing values. The performance of the artificial missing value-based prediction method was evaluated, and the results showed that the proposed method was effective for prediction.
机译:提出了一种用于连续值预测的规则挖掘方法,以使用基于图结构的进化计算技术来处理不完整的数据库。该方法提取关联局部分布规则,该规则的结果部分具有连续变量的狭窄分布。一组关联的局部分布规则用于连续值预测。使用预测器对包括缺失值的实例进行了预测。基于提取的规则集,考虑了一种为每个聚焦实例构建预测值的概率分布的方法。所提出的方法通过允许用户定义预测规则的条件而提供了一定的灵活性。当提取足够数量的规则以建立预测器时,该方法可以退出规则提取。因此,适用于涉及大型数据集的预测。另外,我们提出了人为的缺失值的应用,以提高已开发的基于规则的预测系统的有效性。应用人为的缺失值可避免离散化连续变量时遇到的尖锐边界问题。离散化中靠近边界的属性值被视为缺失值。对基于人工缺失值的预测方法的性能进行了评估,结果表明该方法对预测是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号