首页> 外文会议>International Conference on Intelligent Systems Design and Applications >Classifying breast cancer types based on fine needle aspiration biopsy data using random forest classifier
【24h】

Classifying breast cancer types based on fine needle aspiration biopsy data using random forest classifier

机译:使用随机森林分类器根据细针穿刺活检数据对乳腺癌类型进行分类

获取原文

摘要

Breast cancer is a complex and heterogeneous disease due to its diverse morphological features, as well as different clinical outcome. As a result, breast cancer patients may response to different therapeutic options. Currently, difficulties in recognizing the breast cancer types lead to inefficient treatments. Generally, there are two types of breast cancer, known as malignant and benign. Therefore it is necessary to devise a clinically meaningful classification of the disease that can accurately classify breast cancer tissues into relevant classes. This study aims to classify breast cancer lesions which have been obtained from fine needle aspiration (FNA) procedure using random forest. Random forest is a classifier built based on the combination of decision trees and has been identified to perform well in comparison to other machine learning techniques. This method has been tested on approximately 700 data, which consists of 458 instances from benign cases and 241 instances belong to malignant cases. The performance of proposed method is measured based on sensitivity, specificity and accuracy. The experimental results show that, random forest achieved sensitivity of 75%, specificity of 70% and accuracy about 72%. Thus, it can be concluded that random forest can accurately classify breast cancer types given a small number of features and it works as a promising tool to differentiate malignant from benign tumor at early stage.
机译:乳腺癌由于其多样的形态学特征以及不同的临床结果而成为一种复杂的异质性疾病。结果,乳腺癌患者可能对不同的治疗选择有反应。当前,识别乳腺癌类型的困难导致治疗效率低下。通常,有两种类型的乳腺癌,称为恶性和良性。因此,有必要设计一种对该疾病具有临床意义的分类,该分类可以将乳腺癌组织准确地分类为相关分类。这项研究的目的是对使用随机森林从细针穿刺术(FNA)获得的乳腺癌病变进行分类。随机森林是一种基于决策树组合构建的分类器,与其他机器学习技术相比,随机森林的性能较好。该方法已在大约700个数据上进行了测试,其中包括458个良性病例,而241个属于恶性病例。该方法的性能是根据敏感性,特异性和准确性来衡量的。实验结果表明,随机森林的灵敏度为75%,特异性为70%,准确度约为72%。因此,可以得出这样的结论:随机森林可以在少数特征的情况下准确地对乳腺癌类型进行分类,并且可以作为在早期将恶性肿瘤与良性肿瘤区分开的有前途的工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号