首页> 外文期刊>Journal of medical systems >Applying Under-Sampling Techniques and Cost-Sensitive Learning Methods on Risk Assessment of Breast Cancer
【24h】

Applying Under-Sampling Techniques and Cost-Sensitive Learning Methods on Risk Assessment of Breast Cancer

机译:在乳腺癌风险评估中采用欠采样技术和成本敏感型学习方法

获取原文
获取原文并翻译 | 示例
       

摘要

Breast cancer is one of the most common cause of cancer mortality. Early detection through mammography screening could significantly reduce mortality from breast cancer. However, most of screening methods may consume large amount of resources. We propose a computational model, which is solely based on personal health information, for breast cancer risk assessment. Our model can be served as a pre-screening program in the low-cost setting. In our study, the data set, consisting of 3976 records, is collected from Taipei City Hospital starting from 2008.1.1 to 2008.12.31. Based on the dataset, we first apply the sampling techniques and dimension reduction method to preprocess the testing data. Then, we construct various kinds of classifiers (including basic classifiers, ensemble methods, and cost-sensitive methods) to predict the risk. The cost-sensitive method with random forest classifier is able to achieve recall (or sensitivity) as 100 %. At the recall of 100 %, the precision (positive predictive value, PPV), and specificity of cost-sensitive method with random forest classifier was 2.9 % and 14.87 %, respectively. In our study, we build a breast cancer risk assessment model by using the data mining techniques. Our model has the potential to be served as an assisting tool in the breast cancer screening.
机译:乳腺癌是癌症死亡的最常见原因之一。通过乳房X线摄影筛查的早期发现可以显着降低乳腺癌的死亡率。但是,大多数筛选方法可能会消耗大量资源。我们提出了一个仅基于个人健康信息的计算模型,用于乳腺癌风险评估。我们的模型可以作为低成本环境中的预筛查程序。在我们的研究中,该数据集由2008年1月1日至2008年12月31日在台北市立医院收集,共3976条记录。在数据集的基础上,我们首先应用采样技术和降维方法对测试数据进行预处理。然后,我们构造各种分类器(包括基本分类器,集成方法和成本敏感方法)来预测风险。具有随机森林分类器的成本敏感方法能够实现100%的召回率(或敏感度)。召回率为100%时,采用随机森林分类器的成本敏感方法的准确度(阳性预测值,PPV)和特异性分别为2.9%和14.87%。在我们的研究中,我们使用数据挖掘技术建立了乳腺癌风险评估模型。我们的模型有潜力作为乳腺癌筛查的辅助工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号