首页> 外文会议>International Symposium on Advanced Electrical and Communication Technologies >Feature Selection with Fast Correlation-Based Filter for Breast Cancer Prediction and Classification Using Machine Learning Algorithms
【24h】

Feature Selection with Fast Correlation-Based Filter for Breast Cancer Prediction and Classification Using Machine Learning Algorithms

机译:基于快速相关滤波器的特征选择用于基于机器学习算法的乳腺癌预测和分类

获取原文

摘要

Breast cancer is the first female cancer responsible for high mortality worldwide. Despite the progress that has made it possible to better understand the mechanisms of cancer development, the causes of breast cancer are currently unknown. Nevertheless, studies have identified some risk factors that promote breast cancer and a healthy lifestyle can reduce risk. In Morocco, breast cancer is the first cancer in women. It represents 34.3% of all female cancers. In this work, the Fast Correlation-Based Feature selection (FCBF) method is used to filter irrelevant and redundant characteristics in order to improve the quality of cancer classification, and we will provide an overview of the evolution of key data in the health system and apply five learning algorithms to a breast cancer data set. The purpose of this research work is to predict breast cancer, using several machine-learning algorithms that are Random Forest, Naïve Bayes, Support Vector Machines SVM, K-Nearest Neighbors K-NN, and Multilayer Perception MLP, in order to select the most effective algorithm with and without FCBF. The experimental results show that SVM gives the highest accuracy of 97.9% without FCBF but if we apply this method we find that the SVM and MLP show the best results in comparison with other algorithms. The results will help to choose the best learning algorithm machine classification for breast cancer prediction.
机译:乳腺癌是全球首例导致高死亡率的女性癌症。尽管取得了进展,可以更好地了解癌症发展的机制,但目前尚不清楚乳腺癌的病因。尽管如此,研究已经确定了一些会导致乳腺癌的风险因素,健康的生活方式可以降低风险。在摩洛哥,乳腺癌是女性中的首例癌症。它占所有女性癌症的34.3%。在这项工作中,基于快速相关性的特征选择(FCBF)方法用于过滤不相关和冗余的特征,以提高癌症分类的质量,并且我们将概述卫生系统中关键数据的演变以及将五种学习算法应用于乳腺癌数据集。这项研究工作的目的是使用几种机器学习算法来预测乳腺癌,这些算法是随机森林,朴素贝叶斯,支持向量机SVM,K最近邻K-NN和多层感知MLP,以便选择最多的有和没有FCBF的有效算法。实验结果表明,在不使用FCBF的情况下,SVM的准确度最高,为97.9%,但如果应用此方法,则发现与其他算法相比,SVM和MLP表现出最好的结果。结果将有助于选择最佳的学习算法机器分类来预测乳腺癌。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号