首页> 外文会议>International Conference on Pattern Recognition >Finding Rule Groups to Classify High Dimensional Gene Expression Datasets
【24h】

Finding Rule Groups to Classify High Dimensional Gene Expression Datasets

机译:查找规则组以对高维基因表达数据集进行分类

获取原文

摘要

Microarray data provides quantitative information about the transcription profile of cells. To analyze microarray datasets, methodology of machine learning has increasingly attracted bioinformatics researchers. Some approaches of machine learning are widely used to classify and mine biological datasets. However, many gene expression datasets are extremely high dimensionality, traditional machine learning methods can not be applied effectively and efficiently. This paper proposes a robust algorithm to find out rule groups to classify gene expression datasets. Unlike the most classification algorithms, which select dimensions (genes) heuristically to form rules groups to identify classes such as cancerous and normal tissues, our algorithm guarantees finding out best-k dimensions (genes), which are most discriminative to classify samples in different classes, to form rule groups for the classification of expression datasets. Our experiments show that the rule groups obtained by our algorithm have higher accuracy than that of other classification approaches.
机译:微阵列数据提供有关细胞转录概况的定量信息。为了分析微阵列数据集,机器学习的方法越来越吸引了生物信息学研究人员。一些机器学习方法广泛用于分类和矿山生物数据集。然而,许多基因表达数据集是极高的维度,传统的机器学习方法不能有效且有效地应用。本文提出了一种强大的算法来查找规则组以对基因表达数据集进行分类。与最多分类的算法不同,它选择尺寸(基因)形成规则组以识别癌症和正常组织等类别,我们的算法保证了发现最佳k尺寸(基因),这些尺寸(基因)是分类不同类别中样本的最佳态度(基因) ,为表达式数据集的分类表格组。我们的实验表明,我们的算法获得的规则组具有比其他分类方法更高的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号