首页> 美国卫生研究院文献>Bioinformation >An improved hybrid of SVM and SCAD for pathway analysis
【2h】

An improved hybrid of SVM and SCAD for pathway analysis

机译:改进的SVM和SCAD混合用于路径分析

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Pathway analysis has lead to a new era in genomic research by providing further biological process information compared to traditional single gene analysis. Beside the advantage, pathway analysis provides some challenges to the researchers, one of which is the quality of pathway data itself. The pathway data usually defined from biological context free, when it comes to a specific biological context (e.g. lung cancer disease), typically only several genes within pathways are responsible for the corresponding cellular process. It also can be that some pathways may be included with uninformative genes or perhaps informative genes were excluded. Moreover, many algorithms in pathway analysis neglect these limitations by treating all the genes within pathways as significant. In previous study, a hybrid of support vector machines and smoothly clipped absolute deviation with groups-specific tuning parameters (gSVM-SCAD) was proposed in order to identify and select the informative genes before the pathway evaluation process. However, gSVM-SCAD had showed a limitation in terms of the performance of classification accuracy. In order to deal with this limitation, we made an enhancement to the tuning parameter method for gSVM-SCAD by applying the B-Type generalized approximate cross validation (BGACV). Experimental analyses using one simulated data and two gene expression data have shown that the proposed method obtains significant results in identifying biologically significant genes and pathways, and in classification accuracy.
机译:与传统的单基因分析相比,通路分析通过提供更多的生物学过程信息,引领了基因组研究的新纪元。除了优势之外,路径分析还给研究人员带来了一些挑战,其中之一就是路径数据本身的质量。当涉及到特定的生物学环境(例如肺癌疾病)时,通常从无生物学环境中定义途径数据,通常仅途径内的几个基因负责相应的细胞过程。也可能是某些途径可能包含非信息性基因,或者可能排除了信息性基因。此外,途径分析中的许多算法通过将途径内的所有基因都视为重要基因来忽略这些限制。在先前的研究中,提出了一种混合的支持向量机和具有特定于组的调整参数(gSVM-SCAD)的平滑剪切绝对偏差,以便在路径评估过程之前识别和选择信息基因。但是,gSVM-SCAD在分类准确度方面表现出局限性。为了解决此限制,我们通过应用B型广义近似交叉验证(BGACV)对gSVM-SCAD的调整参数方法进行了增强。使用一种模拟数据和两种基因表达数据进行的实验分析表明,该方法在鉴定生物学上重要的基因和途径以及分类准确性方面获得了显着的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号