首页> 外文会议>International Symposium on Advanced Electrical and Communication Technologies >Feature Selection with Fast Correlation-Based Filter for Breast Cancer Prediction and Classification Using Machine Learning Algorithms
【24h】

Feature Selection with Fast Correlation-Based Filter for Breast Cancer Prediction and Classification Using Machine Learning Algorithms

机译:采用机器学习算法的快速相关性基于快速相关的乳腺癌预测和分类的特征选择

获取原文

摘要

Breast cancer is the first female cancer responsible for high mortality worldwide. Despite the progress that has made it possible to better understand the mechanisms of cancer development, the causes of breast cancer are currently unknown. Nevertheless, studies have identified some risk factors that promote breast cancer and a healthy lifestyle can reduce risk. In Morocco, breast cancer is the first cancer in women. It represents 34.3% of all female cancers. In this work, the Fast Correlation-Based Feature selection (FCBF) method is used to filter irrelevant and redundant characteristics in order to improve the quality of cancer classification, and we will provide an overview of the evolution of key data in the health system and apply five learning algorithms to a breast cancer data set. The purpose of this research work is to predict breast cancer, using several machine-learning algorithms that are Random Forest, Na?ve Bayes, Support Vector Machines SVM, K-Nearest Neighbors K-NN, and Multilayer Perception MLP, in order to select the most effective algorithm with and without FCBF. The experimental results show that SVM gives the highest accuracy of 97.9% without FCBF but if we apply this method we find that the SVM and MLP show the best results in comparison with other algorithms. The results will help to choose the best learning algorithm machine classification for breast cancer prediction.
机译:乳腺癌是第一个负责全世界死亡率的女性癌症。尽管取得了进展使得可以更好地了解癌症发展的机制,但目前未知乳腺癌的原因。尽管如此,研究已经确定了促进乳腺癌和健康生活方式的一些危险因素可以降低风险。在摩洛哥,乳腺癌是女性的第一个癌症。它占所有女性癌症的34.3%。在这项工作中,基于快速相关的特征选择(FCBF)方法用于过滤无关和冗余特性,以提高癌症分类的质量,并概述了健康系统中的关键数据的演变将五种学习算法应用于乳腺癌数据集。本研究工作的目的是预测乳腺癌,使用几种机器学习算法,是随机森林,Na?ve贝叶斯,支持向量机SVM,K最近邻居K-NN和多层感知MLP,以便选择最有效的算法,没有fcbf。实验结果表明,没有FCBF,SVM提供了97.9%的最高精度,但如果我们应用这种方法,我们发现SVM和MLP与其他算法相比,SVM和MLP显示了最佳结果。结果将有助于为乳腺癌预测选择最佳学习算法机分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号