首页> 外文期刊>BMC Bioinformatics >A hierarchical Bayesian network approach for linkage disequilibrium modeling and data-dimensionality reduction prior to genome-wide association studies
【24h】

A hierarchical Bayesian network approach for linkage disequilibrium modeling and data-dimensionality reduction prior to genome-wide association studies

机译:用于全基因组关联研究之前的连锁不平衡建模和数据维数减少的分层贝叶斯网络方法

获取原文
           

摘要

Background Discovering the genetic basis of common genetic diseases in the human genome represents a public health issue. However, the dimensionality of the genetic data (up to 1 million genetic markers) and its complexity make the statistical analysis a challenging task. Results We present an accurate modeling of dependences between genetic markers, based on a forest of hierarchical latent class models which is a particular class of probabilistic graphical models. This model offers an adapted framework to deal with the fuzzy nature of linkage disequilibrium blocks. In addition, the data dimensionality can be reduced through the latent variables of the model which synthesize the information borne by genetic markers. In order to tackle the learning of both forest structure and probability distributions, a generic algorithm has been proposed. A first implementation of our algorithm has been shown to be tractable on benchmarks describing 105 variables for 2000 individuals. Conclusions The forest of hierarchical latent class models offers several advantages for genome-wide association studies: accurate modeling of linkage disequilibrium, flexible data dimensionality reduction and biological meaning borne by latent variables.
机译:背景技术发现人类基因组中常见遗传疾病的遗传基础代表着公共卫生问题。但是,遗传数据的维数(最多一百万个遗传标记)及其复杂性使统计分析成为一项艰巨的任务。结果我们基于分层的潜伏类模型的森林提出了精确的遗传标记之间依赖性模型,这是一类特殊的概率图形模型。该模型提供了一个适应框架,以处理连锁不平衡模块的模糊性质。另外,数据的维数可以通过模型的潜在变量来减少,这些变量综合了遗传标记携带的信息。为了解决对森林结构和概率分布的学习,提出了一种通用算法。我们的算法的第一个实现已被证明在描述2000个个体的10 5 变量的基准上很容易处理。结论分层潜在类模型森林为全基因组关联研究提供了多个优势:链接不平衡的精确建模,灵活的数据维数减少以及潜在变量所具有的生物学意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号