A Comparative Study for Breast Cancer Prediction using Machine Learning and Feature Selection

机译：基于机器学习和特征选择的乳腺癌预测比较研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

While there are many factors which could contribute to the occurrence of breast cancer, it is very difficult to attribute the exact environmental and other factors contributing to it, but still it has significance in determining the occurrence of cancer. Using machine learning techniques and regular diagnosis information, we can achieve our goal of assessing the risk of occurrence of breast cancer. Cancer data sets contain many attributes of patient information, but not every feature is relevant in predicting cancer. Feature selection techniques are useful in such scenarios for retaining the relevant feature set. In this paper we are doing a comparative study of the effect of feature selection techniques on the accuracies given by existing machine learning algorithms. For this purpose we have considered the following machine learning algorithms - Logistic Regression, Naive Bayes and Random Forest. The following feature selection techniques have been considered - Sequential Forward Feature Selection, Recursive Feature Elimination, f-test and correlation.The publicly available Breast Cancer Wisconsin (Diagnostic) Data Sets from UCI Repository have been used in this work. The results show that random forest algorithm gives the highest accuracy with feature selection. Furthermore f-test gives better results for the smaller dataset and Sequential Forward Selection for the larger dataset.

机译：尽管有许多因素可能导致乳腺癌的发生，但是很难确切地归因于造成乳腺癌的确切环境因素和其他因素，但是在确定癌症的发生方面仍然具有重要意义。使用机器学习技术和定期的诊断信息，我们可以实现评估乳腺癌发生风险的目标。癌症数据集包含患者信息的许多属性，但并非每个功能都与预测癌症相关。在这种情况下，特征选择技术对于保留相关特征集很有用。在本文中，我们正在对特征选择技术对现有机器学习算法所给出的准确性的影响进行比较研究。为此，我们考虑了以下机器学习算法-Logistic回归，朴素贝叶斯和随机森林。考虑了以下特征选择技术-顺序正向特征选择，递归特征消除，f检验和相关性。这项工作使用了UCI知识库中公开的乳腺癌威斯康星州（诊断）数据集。结果表明，随机森林算法在特征选择方面具有最高的准确性。此外，对于较小的数据集，f检验可提供更好的结果;对于较大的数据集，顺序检验可提供更好的结果。

著录项

来源
《International Conference on Intelligent Computing and Control Systems》|2019年|1049-1055|共7页
会议地点
作者
Dhanya R; Irene Rose Paul; Sai Sindhu Akula; Madhumathi Sivakumar; Jyothisha J Nair;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Machine learning algorithms; Breast cancer; Prediction algorithms; Standards; Machine learning;

机译：特征提取;机器学习算法;乳腺癌;预测算法;标准;机器学习;

相似文献

外文文献
中文文献
专利

1. Predicting breast cancer survivability based on machine learning and features selection algorithms: a comparative study [J] . El Rahman Sahar A. Journal of ambient intelligence and humanized computing . 2021,第8期

机译：基于机器学习和特征选择算法预测乳腺癌生存能力：比较研究
2. Feature Selection from Biological Database for Breast Cancer Prediction and Detection Using Machine Learning Classifier [J] . Abhineet Gupta, Baij Nath Kaushik Journal of Artificial Intelligence . 2018,第2期

机译：使用机器学习分类器从生物学数据库中进行乳腺癌预测和检测的特征选择
3. Determining relevant biomarkers for prediction of breast cancer using anthropometric and clinical features: A comparative investigation in machine learning paradigm [J] . Singh Bikesh Kumar Biocybernetics and biomedical engineering . 2019,第2期

机译：使用人体测量和临床特征确定用于预测乳腺癌的相关生物标志物：机器学习范式的比较调查
4. A Comparative Analysis of Feature Selection Methods and Associated Machine Learning Algorithms on Wisconsin Breast Cancer Dataset (WBCD) [C] . Nileshkumar Modi, Kaushar Ghanchi International Conference on Information and Communication Technology for Sustainable Development . 2016

机译：威斯康星乳腺癌数据集（WBCD）特征选择方法和相关机学习算法的比较分析
5. Prediction of CYP3A4 Metabolic Activity from Whole Genome RNA-Seq Data with Feature Selection Machine Learning Methods [D] . Jia, Yichen. 2017

机译：特征选择机器学习方法从全基因组RNA-Seq数据预测CYP3A4代谢活性
6. Top scoring pairs for feature selection in machine learning and applications to cancer outcome prediction [O] . Ping Shi, Surajit Ray, Qifu Zhu, 2011

机译：机器学习中的特征选择和癌症结果预测应用中的最高得分对
7. Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype [O] . Kushangi Atrey, Yogesh Sharma, Narendra K. Bodhey, 2019

机译：利用基于优势的特征过滤方法的乳腺癌预测：机器学习原型的比较调查

A Comparative Study for Breast Cancer Prediction using Machine Learning and Feature Selection

摘要

著录项

相似文献

相关主题

期刊订阅