首页> 外文会议>2017 International Electronics Symposium on Knowledge Creation and Intelligent Computing >Discovering patterns of NED-breast cancer based on association rules using apriori and FP-growth
【24h】

Discovering patterns of NED-breast cancer based on association rules using apriori and FP-growth

机译:使用先验和FP增长基于关联规则发现NED乳腺癌的模式

获取原文
获取原文并翻译 | 示例

摘要

No Evidence of Disease (NED) is breast cancer patient condition status which it indicates that they can life, no find the cancer by tested, and without any symptoms of cancer in period of times, after they received primary treatment. NED is a critical status, because it involves the treatment type and patient cancer condition factors. This paper examines about breast cancer problem in data mining technical side, especially to discover the patterns of NED-breast cancer patient using cancer registry data from Oncology Hospital. Its patterns are discovered through the relationship of among features begin from 1dimensional, 2-dimensional, 3-dimensional, and n-dimensional. We applied association rules mining using Apriori and FP-Growth algorithm, which both have the advantage and drawback. Apriori algorithm involves all generation of candidate item sets and multiple database scans, but it makes highconsuming iteration. While FP-Growth algorithm extracts the frequent item sets directly from FP-Tree, it make the advantage of FP-Growth that is faster process needs only scan the database once. This paper experiment shown that the association result of Apriori and FP-Growth is almost similar, 10-highest confidence value represented 100% confidence of association rule on breast cancer dataset with support value up to 50%.
机译:没有疾病证据(NED)是乳腺癌患者的状况,表明他们在接受初级治疗后可以生存,无法通过测试发现癌症,并且在一段时间内没有任何癌症症状。 NED是至关重要的状态,因为它涉及治疗类型和患者癌症状况因素。本文探讨了数据挖掘技术方面的乳腺癌问题,尤其是使用来自肿瘤医院的癌症登记数据来发现NED乳腺癌患者的模式。通过从1维,2维,3维和n维开始的特征之间的关系发现其图案。我们使用Apriori和FP-Growth算法应用了关联规则挖掘,两者都有其优点和缺点。 Apriori算法涉及候选项目集的所有生成和多个数据库扫描,但是它进行了耗时的迭代。 FP-Growth算法直接从FP-Tree提取频繁项集时,却利用了FP-Growth的优势,即更快的处理过程只需要扫描数据库一次。本文实验表明,Apriori和FP-Growth的关联结果几乎相似,置信度最高的10位代表对乳腺癌数据集的关联规则的100%置信度,支持值高达50%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号