...
首页> 外文期刊>Journal of Medical Imaging and Health Informatics >Evaluation of Risk Factors in Developing Breast Cancer with Expectation Maximization Algorithm in Data Mining Techniques
【24h】

Evaluation of Risk Factors in Developing Breast Cancer with Expectation Maximization Algorithm in Data Mining Techniques

机译:用数据挖掘技术中的期望最大化算法评估发展乳腺癌的危险因素

获取原文
获取原文并翻译 | 示例

摘要

Early detection and diagnosis of breast disease can improve the treatment effectiveness. Breast cancer is the second most common cancer for women. It is the second largest cause of cancer death worldwide. Annually, approximately more than 1,700,000 women worldwide are detected due to this disease. The prevalence of approximately 2% annual increased. Breast cancer involves several risk factors, some of which are proven but some still have controversial reported results and some are almost rejected. Sometimes factors such as maternal age at first birth, age at marriage and number of children have been recognized as risk factors, and sometimes as protective measures. In this paper we proposed a model that can predict the likelihood in developing a breast cancer. We modeled 7 different risk factors and their impact factors or their weighting using the data from Breast Cancer Surveillance Consortium (BCSC) in National Cancer Institute. We discovered the latent knowledge and generated new information by applying data mining techniques. Expectation Maximization (EM) algorithm was applied, data clustering was accomplished and the correlation of different risk factors was discovered. By analyzing our discovered information, we presented a novel formula to determine the probability in developing breast cancer and by using the proposed novel formula 98.6% accuracy was acquired.
机译:早期发现和诊断乳腺疾病可以提高治疗效果。乳腺癌是女性第二大常见癌症。它是全球癌症死亡的第二大原因。每年,由于这种疾病,全世界大约有超过1,700,000名妇女被发现。每年约2%的患病率增加。乳腺癌涉及多种风险因素,其中一些已被证实,但有些仍具有争议的报道结果,而有些则几乎被拒绝。有时,诸如初生产妇年龄,结婚年龄和子女数量等因素被认为是危险因素,有时也被认为是保护性措施。在本文中,我们提出了一个可以预测发生乳腺癌的可能性的模型。我们使用来自美国国家癌症研究所乳腺癌监视协会(BCSC)的数据对7种不同的风险因素及其影响因素或权重进行了建模。我们通过应用数据挖掘技术发现了潜在知识并生成了新信息。应用期望最大化算法,完成了数据聚类,发现了不同风险因素之间的相关性。通过分析我们发现的信息,我们提出了一种新的公式来确定发生乳腺癌的可能性,并通过使用所提出的新公式获得了98.6%的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号