首页> 美国卫生研究院文献>Bioinformatics >Likelihood-based complex trait association testing for arbitrary depth sequencing data
【2h】

Likelihood-based complex trait association testing for arbitrary depth sequencing data

机译:基于似然性的复杂性状关联测试用于任意深度测序数据

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Summary: In next generation sequencing (NGS)-based genetic studies, researchers typically perform genotype calling first and then apply standard genotype-based methods for association testing. However, such a two-step approach ignores genotype calling uncertainty in the association testing step and may incur power loss and/or inflated type-I error. In the recent literature, a few robust and efficient likelihood based methods including both likelihood ratio test (LRT) and score test have been proposed to carry out association testing without intermediate genotype calling. These methods take genotype calling uncertainty into account by directly incorporating genotype likelihood function (GLF) of NGS data into association analysis. However, existing LRT methods are computationally demanding or do not allow covariate adjustment; while existing score tests are not applicable to markers with low minor allele frequency (MAF). We provide an LRT allowing flexible covariate adjustment, develop a statistically more powerful score test and propose a combination strategy (UNC combo) to leverage the advantages of both tests. We have carried out extensive simulations to evaluate the performance of our proposed LRT and score test. Simulations and real data analysis demonstrate the advantages of our proposed combination strategy: it offers a satisfactory trade-off in terms of computational efficiency, applicability (accommodating both common variants and variants with low MAF) and statistical power, particularly for the analysis of quantitative trait where the power gain can be up to ∼60% when the causal variant is of low frequency (MAF < 0.01).>Availability and implementation: UNC combo and the associated R files, including documentation, examples, are available at >Contact: >Supplementary information: are available at Bioinformatics online.
机译:>摘要::在基于下一代测序(NGS)的遗传研究中,研究人员通常会先进行基因型调用,然后再应用基于标准基因型的方法进行关联测试。但是,这种两步方法忽略了关联测试步骤中基因型调用的不确定性,并可能导致功率损耗和/或I型错误扩大。在最近的文献中,已经提出了一些鲁棒且有效的基于似然性的方法,包括似然比检验(LRT)和得分检验,以进行关联检验而无需中间基因型调用。这些方法通过将NGS数据的基因型似然函数(GLF)直接纳入关联分析来考虑基因型调用不确定性。但是,现有的LRT方法对计算要求很高,或者不允许进行协变量调整。而现有的分数测试不适用于低等位基因频率(MAF)的标记。我们提供了一种LRT,可以灵活地进行协变量调整,开发统计上更强大的得分测试,并提出一种组合策略(UNC组合)以利用两种测试的优势。我们已经进行了广泛的模拟,以评估我们提出的轻快铁和分数测试的性能。仿真和真实数据分析证明了我们提出的组合策略的优点:在计算效率,适用性(适应常见变体和低MAF的变体)和统计能力方面,它提供了令人满意的折衷,特别是对于定量性状的分析当因果变量为低频时(MAF <0.01),功率增益最高可达60%。>可用性和实现:UNC组合和相关的R文件(包括文档,示例)是可在>联系方式: >补充信息:上在线访问生物信息学。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号