...
首页> 外文期刊>BMC Genetics >Above and beyond state-of-the-art approaches to investigate sequence data: summary of methods and results from the population-based association group at the Genetic Analysis Workshop 19
【24h】

Above and beyond state-of-the-art approaches to investigate sequence data: summary of methods and results from the population-based association group at the Genetic Analysis Workshop 19

机译:研究序列数据的最先进方法:遗传分析研讨会上基于群体的关联小组的方法和结果摘要19

获取原文

摘要

This paper summarizes the contributions from the Population-Based Association group at the Genetic Analysis Workshop 19. It provides an overview of the new statistical approaches tried out by group members in order to take best advantage of population-based sequence data. Although contributions were highly heterogeneous regarding the applied quality control criteria and the number of investigated variants, several technical issues were identified, leading to practical recommendations. Preliminary analyses revealed that Hurdle-negative binomial regression is a promising approach to investigate the distribution of allele counts instead of called genotypes from sequence data. Convergence problems, however, limited the use of this approach, creating a technical challenge shared by environment-stratified models used to investigate rare variant-environment interactions, as well as by rare variant haplotype analyses using well-established public software. Estimates of relatedness and population structure strongly depended on the allele frequency of selected variants for inference. Another practical recommendation was that dissenting probability values from standard and small-sample tests of a particular hypothesis may reflect a lack of validity of large-sample approximations. Novel statistical approaches that integrate evolutionary information showed some advantage to detect weak genetic signals, and Bayesian adjustment for confounding was able to efficiently estimate causal genetic effects. Haplotype association methods may constitute a valuable complement of collapsing approaches for sequence data. This paper reports on the experience of members of the Population-Based Association group with several novel, promising approaches to preprocessing and analyzing sequence data, and to following up identified association signals.
机译:本文总结了在遗传分析研讨会19上基于人群的协会小组的贡献。它概述了小组成员为了充分利用基于人群的序列数据而尝试的新统计方法。尽管在所应用的质量控制标准和研究的变体数量方面,贡献是高度不同的,但还是发现了一些技术问题,从而提出了实用建议。初步分析显示,跨栏阴性二项式回归是一种有前途的方法,可用于研究等位基因计数的分布,而不是从序列数据中调查所谓的基因型。但是,收敛性问题限制了这种方法的使用,这给用于研究稀有变异与环境相互作用的环境分层模型以及使用完善的公共软件进行稀有变异单倍型分析带来了技术挑战。相关性和总体结构的估计在很大程度上取决于所选变异的等位基因频率以进行推断。另一项实用建议是,与特定假设的标准和小样本检验不同的概率值可能反映出缺乏大样本近似值的有效性。整合进化信息的新型统计方法显示出检测弱遗传信号的优势,而贝叶斯混淆的调整能够有效地估计因果遗传效应。单体型关联方法可能构成序列数据折叠方法的宝贵补充。本文报告了基于人口的协会小组成员的经验,并提出了几种新颖的,有前途的方法来预处理和分析序列数据,以及跟踪确定的协会信号。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号