...
首页> 外文期刊>Chemometrics and Intelligent Laboratory Systems >Feature selection and classification for gene expression data using novel correlation based overlapping score method via Chou's 5-steps rule
【24h】

Feature selection and classification for gene expression data using novel correlation based overlapping score method via Chou's 5-steps rule

机译:通过Chou的5步规则使用基于新型相关性的重叠分数法的基因表达数据的特征选择和分类

获取原文
获取原文并翻译 | 示例
           

摘要

The analysis of omics data together with knowledge-based interpretation can help obtaining important information regarding different biological processes and to reflect the current physiological status of tissue and cells. The main challenge, however, is to analyze high-dimensional gene expression data consisting of a massive amount of redundant genes in extracting disease-related information. To address this problem, gene selection, that eliminates redundant and irrelevant genes, has been a key step. In current article, a feature selection technique is proposed that exploit correlation based overlapping analysis of expression data across classes. The proposed correlation based overlapping score (COS) technique is compared with state-of-the-art gene selection approaches using real-world benchmark microarray datasets. In an experimental evaluation, the COS algorithm outperforms the other methods with minimum misclassification errors obtained via boosting, random forest and k-nearest neighbour (kNN) classifiers. Moreover, the proposed technique is more stable than the other techniques in gene selection.
机译:与知识的解释一起分析OMICS数据可以帮助获得有关不同生物过程的重要信息,并反映组织和细胞的目前的生理状态。然而,主要挑战是分析由大量的冗余基因组成的高尺寸基因表达数据,在提取疾病相关信息中。为了解决这个问题,可以消除冗余和无关基因的基因选择一直是一个关键步骤。在当前文章中,提出了一种特征选择技术,从而利用基于类的相关性数据的相关性分析。将所提出的基于相关性的重叠分数(COS)技术与使用真实基准微阵列数据集进行了最先进的基因选择方法。在实验评估中,COS算法优于通过升压,随机森林和k最近邻(KNN)分类器获得的最小错误分类误差的其他方法。此外,所提出的技术比基因选择中的其他技术更稳定。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号