首页> 外文会议>Asia-Pacific Bioinformatics Conference(APBC 2003); 200302; Adelaide(AU) >Machine Learning in DNA Microarray Analysis for Cancer Classification
【24h】

Machine Learning in DNA Microarray Analysis for Cancer Classification

机译:DNA微阵列分析中的机器学习用于癌症分类

获取原文
获取原文并翻译 | 示例

摘要

The development of microarray technology has supplied a large volume of data to many fields. In particular, it has been applied to prediction and diagnosis of cancer, so that it expectedly helps us to exactly predict and diagnose cancer. To precisely classify cancer we have to select genes related to cancer because extracted genes from microarray have many noises. In this paper, we attempt to explore many features and classifiers using three benchmark datasets to systematically evaluate the performances of the feature selection methods and machine learning classifiers. Three benchmark datasets are Leukemia cancer dataset, Colon cancer dataset and Lymphoma cancer data set. Pearson's and Spearman's correlation coefficients, Euclidean distance, cosine coefficient, information gain, mutual information and signal to noise ratio have been used for feature selection. Multi-layer perceptron, k-nearest neighbour, support vector machine and structure adaptive self-organizing map have been used for classification. Also, we have combined the classifiers to improve the performance of classification. Experimental results show that the ensemble with several basis classifiers produces the best recognition rate on the benchmark dataset.
机译:微阵列技术的发展为许多领域提供了大量数据。特别是,它已被应用于癌症的预测和诊断,因此有望帮助我们准确地预测和诊断癌症。为了精确分类癌症,我们必须选择与癌症相关的基因,因为从微阵列中提取的基因会产生很多噪音。在本文中,我们尝试使用三个基准数据集探索许多特征和分类器,以系统地评估特征选择方法和机器学习分类器的性能。三个基准数据集是白血病癌症数据集,结肠癌数据集和淋巴瘤癌症数据集。皮尔逊和斯皮尔曼的相关系数,欧式距离,余弦系数,信息增益,互信息以及信噪比已用于特征选择。多层感知器,k最近邻,支持向量机和结构自适应自组织图已用于分类。另外,我们结合了分类器以提高分类性能。实验结果表明,具有多个基础分类器的集成在基准数据集上产生最佳识别率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号