首页> 美国卫生研究院文献>Cancer Informatics >A Hybrid Approach for Biomarker Discovery from Microarray Gene Expression Data for Cancer Classification
【2h】

A Hybrid Approach for Biomarker Discovery from Microarray Gene Expression Data for Cancer Classification

机译:从微阵列基因表达数据中发现生物标志物的混合方法用于癌症分类

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Microarrays allow researchers to monitor the gene expression patterns for tens of thousands of genes across a wide range of cellular responses, phenotype and conditions. Selecting a small subset of discriminate genes from thousands of genes is important for accurate classification of diseases and phenotypes. Many methods have been proposed to find subsets of genes with maximum relevance and minimum redundancy, which can distinguish accurately between samples with different labels. To find the minimum subset of relevant genes is often referred as biomarker discovery. Two main approaches, filter and wrapper techniques, have been applied to biomarker discovery. In this paper, we conducted a comparative study of different biomarker discovery methods, including six filter methods and three wrapper methods. We then proposed a hybrid approach, FR-Wrapper, for biomarker discovery. The aim of this approach is to find an optimum balance between the precision of the biomarker discovery and the computation cost, by taking advantages of both filter method’s efficiency and wrapper method’s high accuracy. Our hybrid approach applies Fisher’s ratio, a simple method easy to understand and implement, to filter out most of the irrelevant genes, then a wrapper method is employed to reduce the redundancy. The performance of FR-Wrapper approach is evaluated over four widely used microarray datasets. Analysis of experimental results reveals that the hybrid approach can achieve the goal of maximum relevance with minimum redundancy.
机译:利用微阵列,研究人员可以在广泛的细胞反应,表型和条件下监测成千上万个基因的基因表达模式。从成千上万的基因中选择一小部分可区分的基因对于准确分类疾病和表型非常重要。已经提出了许多方法来寻找具有最大相关性和最小冗余度的基因子集,这些子集可以准确区分具有不同标记的样品。寻找相关基因的最小子集通常被称为生物标记物发现。两种主要方法(过滤器和包装器技术)已应用于生物标记物发现。在本文中,我们对不同的生物标记物发现方法进行了比较研究,包括六种过滤方法和三种包装方法。然后,我们提出了一种用于生物标记物发现的混合方法FR-Wrapper。这种方法的目的是通过利用过滤器方法的效率和包装器方法的高精度,在生物标志物发现的精度和计算成本之间找到最佳的平衡。我们的混合方法采用费舍尔比率(一种易于理解和实施的简单方法)来滤除大多数不相关的基因,然后采用包装器方法来减少冗余。 FR-Wrapper方法的性能在四个广泛使用的微阵列数据集中进行了评估。对实验结果的分析表明,混合方法可以以最小的冗余度实现最大的相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号