首页> 外文会议>IEEE International Conference on Computational Advances in Bio and Medical Sciences >A new gene subset selection approach based on linearly separating gene pairs
【24h】

A new gene subset selection approach based on linearly separating gene pairs

机译:一种基于线性分离基因对的新基因子集选择方法

获取原文

摘要

The concept of linear separability of gene expression data sets with respect to two classes, has been recently studied in literature. The problem is to efficiently find all pairs of genes which induce a linear separation of the data. It has been suggested that an underlying molecular mechanism relates together the two genes of a separating pair to the phenotype under study, such as a specific cancer. In this paper we study the Containment Angle (CA) defined on the unit circle for a linearly separable gene pair as a better alternative to the paired t-test ranking function for gene selection. Using the CA we also show empirically that a given classifier's error is related to the degree of linear separability of a given data set. Finally we propose a new gene subset selection approach based on the CA ranking function. Our approach gives better results, in terms of subset size and classification accuracy when compared to well-performing methods, on many data sets.
机译:最近在文献中研究了基因表达数据集的线性可分离性的概念。问题是有效地找到所有对数据分离的所有基因。已经提出,潜在的分子机制将分离对的两个基因与在研究中的表型相关联,例如特异性癌症。在本文中,我们研究了在单位圆上定义的容纳角(CA),用于线性可分离基因对作为基因选择的配对T检验排名函数的更好的替代方案。使用CA我们还经验上显示给定的分类器的错误与给定数据集的线性可分离程度有关。最后,我们提出了一种基于CA排名功能的新基因子集选择方法。在与执行良好的方法相比,我们的方法在许多数据集中,我们的方法在子集大小和分类准确性方面提供了更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号