首页> 外文期刊>Biocybernetics and biomedical engineering >Determining relevant biomarkers for prediction of breast cancer using anthropometric and clinical features: A comparative investigation in machine learning paradigm
【24h】

Determining relevant biomarkers for prediction of breast cancer using anthropometric and clinical features: A comparative investigation in machine learning paradigm

机译:使用人体测量和临床特征确定用于预测乳腺癌的相关生物标志物:机器学习范式的比较调查

获取原文
获取原文并翻译 | 示例
       

摘要

Early detection of breast cancer plays crucial role in planning and result of associated treatment. The purpose of this article is threefold: (i) to investigate whether or not clinical features obtained using routine blood analysis combined with anthropometric measurements can be utilized for envisaging breast cancer using predictive machine learning techniques; (ii) to explore the role of various machine learning components such as feature selection, data division protocols and classification to determine suitable biomarkers for breast cancer prediction; and (iii) to evaluate a recent database of clinical and anthropometric measurements acquired from normal individuals and individuals suffering from breast cancer. A database consisting of anthropometric and clinical attributes is used in the experiments. Various feature selection and statistical significance analysis methods are used to determine the relevance of various features. Furthermore, popular classifiers such as kernel based support vector machine (SVM), Naive Bayesian, linear discriminant, quadratic discriminant, logistic regression, K-nearest neighbor (K-NN) and random forest were implemented and evaluated for breast cancer risk prediction using these features. Results of feature selection techniques indicate that among the nine features considered in this study, glucose, age and resistin are found to be most relevant and effective biomarkers for breast cancer prediction. Further, when these three features are used for classification, the medium K-NN classifier achieves the highest classification accuracy of 92.105% followed by medium Gaussian SVM which achieves classification accuracy of 83.684% under hold out data division protocol. (C) 2019 Nalecz Institute of Biocybernetics and Biomedical Engineering of the Polish Academy of Sciences. Published by Elsevier B.V. All rights reserved.
机译:早期发现乳腺癌在相关治疗的规划和结果中起着至关重要的作用。本文的目的是三倍:(i)探讨使用常规血液分析获得的临床特征是否与人类测量相结合,可以使用预测机器学习技术来设想乳腺癌; (ii)探讨各种机器学习组分的作用,例如特征选择,数据分区协议和分类,以确定乳腺癌预测的合适生物标志物; (iii)评估近期从正常个体和患有乳腺癌的个体获得的临床和人类测量测量数据库。在实验中使用由人类测量和临床属性组成的数据库。各种特征选择和统计显着性分析方法用于确定各种特征的相关性。此外,基于内核的支持向量机(SVM),朴素贝叶斯,线性判别,二次判别,逻辑回归,K最近邻居(K-NN)和随机森林等流行分类器进行了利用这些方法对乳腺癌风险预测进行了实施和随机森林特征。特征选择技术的结果表明,在本研究中考虑的九个特征中,发现葡萄糖,年龄和抗性是乳腺癌预测的最相关和有效的生物标志物。此外,当这三个特征用于分类时,介质K-NN分类器达到92.105%的最高分类精度,然后是媒体高斯SVM,其在保持数据划分协议下实现了83.684%的分类准确度。 (c)2019年纳雷斯州博士科学学院生物医学研究所。 elsevier b.v出版。保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号