首页> 外文期刊>The Annals of applied statistics >GENE-PROXIMITY MODELS FOR GENOME-WIDE ASSOCIATION STUDIES
【24h】

GENE-PROXIMITY MODELS FOR GENOME-WIDE ASSOCIATION STUDIES

机译:用于基因组关联研究的基因邻近模型

获取原文
获取原文并翻译 | 示例
       

摘要

Motivated by the important problem of detecting association between genetic markers and binary traits in genome-wide association studies, we present a novel Bayesian model that establishes a hierarchy between markers and genes by defining weights according to gene lengths and distances from genes to markers. The proposed hierarchical model uses these weights to define unique prior probabilities of association for markers based on their proximities to genes that are believed to be relevant to the trait of interest. We use an expectation-maximization algorithm in a filtering step to first reduce the dimensionality of the data and then sample from the posterior distribution of the model parameters to estimate posterior probabilities of association for the markers. We offer practical and meaningful guidelines for the selection of the model tuning parameters and propose a pipeline that exploits a singular value decomposition on the raw data to make our model run efficiently on large data sets. We demonstrate the performance of the model in simulation studies and conclude by discussing the results of a case study using a realworld data set provided by the Wellcome Trust Case Control Consortium.
机译:由于在全基因组关联研究中检测到遗传标记和二元性状之间关联的重要问题,我们提出了一种新颖的贝叶斯模型,该模型通过根据基因长度和从基因到标记的距离定义权重来建立标记和基因之间的层次结构。提出的层次模型使用这些权重,根据标记与被认为与目标性状相关的基因的接近度,来定义标记的唯一关联先验概率。我们在过滤步骤中使用期望最大化算法来首先降低数据的维数,然后从模型参数的后验分布中采样以估计标记关联的后验概率。我们提供了用于选择模型调整参数的实用且有意义的指南,并提出了利用原始数据的奇异值分解来使我们的模型在大型数据集上高效运行的管道。我们演示了模型在仿真研究中的性能,并通过讨论使用Wellcome Trust案例控制协会提供的真实数据集进行案例研究的结果来得出结论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号