首页> 外文会议>International Conference on Information and Communication Technology >Implementing Principal Component Analysis and Multinomial Logit for Cancer Detection based on Microarray Data Classification
【24h】

Implementing Principal Component Analysis and Multinomial Logit for Cancer Detection based on Microarray Data Classification

机译:基于微阵列数据分类的癌症检测主成分分析和多项式Lo​​git实现

获取原文

摘要

Cancer is the second largest cause of death in the world; in 2018, a total of 9.6 million mortalities were recorder, due to cancer alone. It is important to detect this deadly disease early. In the medical field, there are many methods that can be used to detect cancer. One of these methods is microarray data technology. Microarray data reads thousands of gene expressions at the same time. However, this method has a major problem; data with high dimensions can affect classification performance and consume a lot of computational time. Therefore, this research used Principal Component Analysis as the dimensional reduction method. This method performed feature extraction based on a Principal Component (PC) obtained from the calculation of eigenvalues and eigenvectors. Moreover, the data reduction was implemented using a Multinomial Logit Classifier by modifying the parameters estimator using Maximum Likelihood Estimation. The cancer data used in this research consists of Colon Cancer, Leukemia, Lung Cancer, and Ovarian Cancer datasets. The test results for the Ovarian Cancer dataset gave an accuracy of 100% using a Proportion of Variance (PPV) of 90%.
机译:癌症是世界上第二大死亡原因;在2018年,仅因癌症就记录了960万例死亡。重要的是及早发现这种致命的疾病。在医学领域,有许多方法可用于检测癌症。这些方法之一是微阵列数据技术。微阵列数据可同时读取数千种基因表达。但是,该方法存在主要问题。高维数据会影响分类性能并消耗大量计算时间。因此,本研究使用主成分分析作为降维方法。该方法基于从特征值和特征向量的计算中获得的主成分(PC)进行特征提取。此外,使用多项式Lo​​git分类器通过使用最大似然估计修改参数估计器来实现数据精简。本研究中使用的癌症数据包括结肠癌,白血病,肺癌和卵巢癌数据集。卵巢癌数据集的测试结果使用90%的方差比(PPV)给出了100%的准确度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号