首页> 外文期刊>Mobile networks & applications >Comprehensive Association Rules Mining of Health Examination Data with an Extended FP-Growth Method
【24h】

Comprehensive Association Rules Mining of Health Examination Data with an Extended FP-Growth Method

机译:使用扩展的FP-Growth方法对健康检查数据进行全面的关联规则挖掘

获取原文
获取原文并翻译 | 示例
       

摘要

With the booming of social media and health informatics, there exists a pressing need for a powerful tool to sustain comprehensive analysis of public and personal health information. In particular, it should be able (1) to maximize the discovery of association rules amongst data items and (2) to handle the rapid growing data scale. The FP-Growth algorithm is a salient association rule learning method in exploring potential relation in database possibly with a lack of priori knowledge. It has the merits of low time & space complexity, whereas it cannot handle negative association rules which is necessary in comprehensive mining of health data. In order to enable comprehensive discovery of association rules, this study extends the FP-Growth algorithm to mine both positive and negative frequent patterns, namely the PNFP-Growth framework. The extended approach also adopts a prune strategy to filter out misleading patterns to the most by correlating the negative data items and the positive ones. Experiments had been performed to evaluate the performance of the PNFP-Growth over a public data set and a database consisting of thousands of people's real health examination information (collected within 5 years from the date of this publication). The results indicate that (1) the PNFP-Growth can excavate more patterns than the traditional counterpart does while it is still highly efficient, and (2) the analysis upon the health examination data is informative and well complies with the clinical practices, e.g., more than 30 % people suffering from hypertension are having high systolic pressure and liver problems.
机译:随着社交媒体和健康信息学的蓬勃发展,迫切需要一种功能强大的工具来维持对公共和个人健康信息的综合分析。特别是,它应该能够(1)最大化发现数据项之间的关联规则,以及(2)处理快速增长的数据规模。 FP-Growth算法是一种显着的关联规则学习方法,用于探索可能缺乏先验知识的数据库中的潜在关系。它具有时间和空间复杂度低的优点,但是它不能处理在全面挖掘健康数据时必需的否定关联规则。为了能够全面发现关联规则,本研究扩展了FP-Growth算法,以挖掘积极和消极的频繁模式,即PNFP-Growth框架。扩展方法还采用修剪策略,通过关联负面数据项和正面数据项,最大程度地过滤误导性模式。已经进行了一些实验,以评估PNFP-Growth在公共数据集和由成千上万的真实健康检查信息(从发布之日起5年内收集)组成的数据库中的性能。结果表明:(1)PNFP-Growth可以挖掘出更多的模式,而传统的PNFP-Growth仍然高效,并且(2)对健康检查数据的分析是有益的,并且符合临床实践,例如,超过30%的高血压患者患有高收缩压和肝脏问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号