首页> 外文会议>Asian conference on intelligent information and database systems >Multiple Gene Sets for Cancer Classification Using Gene Range Selection Based on Random Forest
【24h】

Multiple Gene Sets for Cancer Classification Using Gene Range Selection Based on Random Forest

机译:基于随机森林的基因范围选择用于癌症分类的多个基因集

获取原文

摘要

The advancement of microarray technology allows obtaining genetic information from cancer patients, as computational data and cancer classification through computation software, has become possible. Through gene selection, we can identify certain numbers of informative genes that can be grouped into a smaller sets or subset of genes; which are informative genes taken from the initial data for the purpose of classification. In most available methods, the amount of genes selected in gene subsets are dependent on the gene selection technique used and cannot be fine-tuned to suit the requirement for particular number of genes. Hence, a proposed technique known as gene range selection based on a random forest method allows selective subset for better classification of cancer datasets. Our results indicate that various gene sets assist in increasing the overall classification accuracy of the cancer related datasets, as the amount of genes can be further scrutinized to create the best subset of genes. Moreover, it can assist the gene-filtering technique for further analysis of the microarray data in gene network analysis, gene-gene interaction analysis and many other related fields.
机译:微阵列技术的进步使得从癌症患者获得遗传信息成为可能,因为通过计算软件进行计算数据和癌症分类已成为可能。通过基因选择,我们可以确定一定数量的信息基因,这些信息可以分为较小的一组基因或子集。它们是从初始数据中获取的用于分类目的的信息基因。在大多数可用方法中,在基因子集中选择的基因数量取决于所使用的基因选择技术,因此无法进行微调以满足特定数量基因的要求。因此,一种被称为基于随机森林方法的基因范围选择的提议技术允许选择子集以更好地分类癌症数据集。我们的结果表明,各种基因集有助于提高与癌症相关的数据集的整体分类准确性,因为可以进一步检查基因的数量以创建基因的最佳子集。而且,它可以协助基因过滤技术在基因网络分析,基因-基因相互作用分析和许多其他相关领域中进一步分析微阵列数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号