...
首页> 外文期刊>Indian Journal of Science and Technology >Performance Analysis of Regression and Classification Models in the Prediction of Breast Cancer
【24h】

Performance Analysis of Regression and Classification Models in the Prediction of Breast Cancer

机译:回归和分类模型在乳腺癌预测中的性能分析

获取原文
           

摘要

Objective: To suggest an automated diagnostic system for the early detection of breast cancer. Methods: This problem has been addressed by making use of machine learning algorithms that can accurately classify a tumor as either malignant or benign by identifying the minimum number of image features. A comparative study on various classification approaches such as Decision Tree, Support Vector Machine, K-Nearest Neighbor and Random Forest have also been conducted with a focus on cross validation to identify the best performing model. Findings: The study shows that Random Forest classifier gives the maximum accuracy. It also highlights that cross validation and fine tuning are necessary to prevent over fitting of data. Improvements: It has been observed that the selection of parameters play a very important role in correct classification as multicollinearity among attributes can render classifier models ineffective.
机译:目的:为乳腺癌的早期诊断提供一种自动诊断系统。方法:已经通过使用机器学习算法解决了这个问题,该算法可以通过识别最少数量的图像特征来将肿瘤准确地分类为恶性或良性。还对各种分类方法(例如决策树,支持向量机,K最近邻和随机森林)进行了比较研究,重点是交叉验证以识别性能最佳的模型。研究结果表明,随机森林分类器可提供最大的准确性。它还强调了交叉验证和微调对于防止数据过度拟合是必要的。改进:已经观察到参数的选择在正确分类中起着非常重要的作用,因为属性之间的多重共线性会使分类器模型无效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号