...
首页> 外文期刊>Methods of information in medicine >Predicting missing values in a home care database using an adaptive uncertainty rule method.
【24h】

Predicting missing values in a home care database using an adaptive uncertainty rule method.

机译:使用自适应不确定性规则方法预测家庭护理数据库中的缺失值。

获取原文
获取原文并翻译 | 示例

摘要

OBJECTIVES: Contemporary literature illustrates an abundance of adaptive algorithms for mining association rules. However, most literature is unable to deal with the peculiarities, such as missing values and dynamic data creation, that are frequently encountered in fields like medicine. This paper proposes an uncertainty rule method that uses an adaptive threshold for filling missing values in newly added records. A new approach for mining uncertainty rules and filling missing values is proposed, which is in turn particularly suitable for dynamic databases, like the ones used in home care systems. METHODS: In this study, a new data mining method named FiMV (Filling Missing Values) is illustrated based on the mined uncertainty rules. Uncertainty rules have quite a similar structure to association rules and are extracted by an algorithm proposed in previous work, namely AURG (Adaptive Uncertainty Rule Generation). The main target was to implement an appropriate method for recovering missing values in a dynamic database, where new records are continuously added, without needing to specify any kind of thresholds beforehand. RESULTS: The method was applied to a home care monitoring system database. Randomly, multiple missing values for each record's attributes (rate 5-20% by 5% increments) were introduced in the initial dataset. FiMV demonstrated 100% completion rates with over 90% success in each case, while usual approaches, where all records with missing values are ignored or thresholds are required, experienced significantly reduced completion and success rates. CONCLUSIONS: It is concluded that the proposed method is appropriate for the data-cleaning step of the Knowledge Discovery process in databases. The latter, containing much significance for the output efficiency of any data mining technique, can improve the quality of the mined information.
机译:目的:当代文献说明了用于挖掘关联规则的大量自适应算法。但是,大多数文献无法处理在医学等领域经常遇到的特性,例如缺失值和动态数据创建。本文提出了一种不确定性规则方法,该方法使用自适应阈值填充新添加的记录中的缺失值。提出了一种用于挖掘不确定性规则并填充缺失值的新方法,该方法又特别适用于动态数据库,例如家庭护理系统中使用的数据库。方法:在这项研究中,基于挖掘的不确定性规则,说明了一种新的数据挖掘方法,称为FiMV(填充缺失值)。不确定性规则具有与关联规则非常相似的结构,并由先前工作中提出的算法AURG(自适应不确定性规则生成)提取。主要目标是实现一种在动态数据库中恢复丢失值的适当方法,该数据库中不断添加新记录,而无需事先指定任何类型的阈值。结果:该方法已应用于家庭护理监控系统数据库。随机地,在初始数据集中引入了每个记录属性的多个缺失值(以5-20%的速率递增5%)。 FiMV在每种情况下都显示出100%的完成率,并且成功率超过90%,而通常的方法则忽略了所有缺少值的记录或需要使用阈值的情况,但它们的完成率和成功率却大大降低。结论:结论是,所提出的方法适用于数据库中知识发现过程的数据清理步骤。后者对于任何数据挖掘技术的输出效率都具有重要意义,可以提高所挖掘信息的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号