首页> 美国卫生研究院文献>American Journal of Human Genetics >Improved Pathogenic Variant Localization via a Hierarchical Model of Sub-regional Intolerance
【2h】

Improved Pathogenic Variant Localization via a Hierarchical Model of Sub-regional Intolerance

机译:通过子区域不宽容的分层模型改善病原体变异定位

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Different parts of a gene can be of differential importance to development and health. This regional heterogeneity is also apparent in the distribution of disease-associated mutations, which often cluster in particular regions of disease-associated genes. The ability to precisely estimate functionally important sub-regions of genes will be key in correctly deciphering relationships between genetic variation and disease. Previous methods have had some success using standing human variation to characterize this variability in importance by measuring sub-regional intolerance, i.e., the depletion in functional variation from expectation within a given region of a gene. However, the ability to precisely estimate local intolerance was restricted by the fact that only information within a given sub-region is used, leading to instability in local estimates, especially for small regions. We show that borrowing information across regions using a Bayesian hierarchical model stabilizes estimates, leading to lower variability and improved predictive utility. Specifically, our approach more effectively identifies regions enriched for ClinVar pathogenic variants. We also identify significant correlations between sub-region intolerance and the distribution of pathogenic variation in disease-associated genes, with AUCs for classifying de novo missense variants in Online Mendelian Inheritance in Man (OMIM) genes of up to 0.86 using exonic sub-regions and 0.91 using sub-regions defined by protein domains. This result immediately suggests that considering the intolerance of regions in which variants are found may improve diagnostic interpretation. We also illustrate the utility of integrating regional intolerance into gene-level disease association tests with a study of known disease-associated genes for epileptic encephalopathy.
机译:基因的不同部分对于发育和健康可能具有不同的重要性。这种区域异质性在疾病相关突变的分布中也很明显,这些突变通常聚集在疾病相关基因的特定区域。准确估计基因的功能重要子区域的能力将是正确破译遗传变异与疾病之间关系的关键。以前的方法已经成功地利用人类的常规变异来通过测量次区域不耐受性来表征这种变异的重要性,即,在给定基因区域内功能变异的缺失。但是,仅使用给定子区域内的信息这一事实限制了精确估计局部不宽容的能力,这导致局部估计的不稳定,尤其是对于小区域。我们显示,使用贝叶斯层次模型跨区域借阅信息可以稳定估计,从而降低可变性并提高预测效用。具体而言,我们的方法可以更有效地识别富含ClinVar致病变异的区域。我们还确定了亚区域不耐受性和疾病相关基因中病原性变异的分布之间的显着相关性,使用外显子区域和AUC对在线孟德尔男性遗传(OMIM)基因中从头错义变体的分类最高可达0.86。 0.91,使用蛋白质结构域定义的子区域。该结果立即表明,考虑发现变异的区域的不耐受性可以改善诊断解释。我们还说明了将癫痫性脑病的已知疾病相关基因研究与区域不耐受性整合到基因水平疾病相关性测试中的效用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号