首页> 外文期刊>BMC proceedings. >Identification of functional genetic variation in exome sequence analysis
【24h】

Identification of functional genetic variation in exome sequence analysis

机译:exme序列分析功能遗传变异的鉴定

获取原文
           

摘要

Recent technological advances have allowed us to study individual genomes at a base-pair resolution and have demonstrated that the average exome harbors more than 15,000 genetic variants. However, our ability to understand the biological significance of the identified variants and to connect these observed variants with phenotypes is limited. The first step in this process is to identify genetic variation that is likely to result in changes to protein structure and function, because detailed studies, either population based or functional, for each of the identified variants are not practicable. Therefore algorithms that yield valid predictions of a variant’s functional significance are needed. Over the past decade, several programs have been developed to predict the probability that an observed sequence variant will have a deleterious effect on protein function. These algorithms range from empirical programs that classify using known biochemical properties to statistical algorithms trained using a variety of data sources, including sequence conservation data, biochemical properties, and functional data. Using data from the pilot3 study of the 1000 Genomes Project available through Genetic Analysis Workshop 17, we compared the results of four programs (SIFT, PolyPhen, MAPP, and VarioWatch) used to predict the functional relevance of variants in 101 genes. Analysis was conducted without knowledge of the simulation model. Agreement between programs was modest ranging from 59.4% to 71.4% and only 3.5% of variants were classified as deleterious and 10.9% as tolerated across all four programs.
机译:最近的技术进步使我们能够以碱基对分辨率研究个体基因组,并证明了平均突出的遗骸超过15,000个遗传变异。然而,我们理解鉴定的变体的生物学意义和将这些观察到的变体与表型的能力有限。该方法的第一步是鉴定可能导致蛋白质结构和功能变化的遗传变异,因为对每个已识别的变体的群体或功能的详细研究是不可行的。因此,需要产生有效预测变体功能意义的算法。在过去的十年中,已经开发了几个程序来预测观察到的序列变体对蛋白质功能有害影响的可能性。这些算法范围从经验程序范围内容使用已知的生物化学属性对使用各种数据源进行训练的统计算法,包括序列节约数据,生物化学特性和功能数据。使用来自Pilot3研究的数据通过遗传分析研讨会17可获得的1000个基因组项目,我们比较了用于预测101个基因中变体的功能性相关性的四个程序(Sift,Polyphen,MAPP和VARIOWatch的结果。在没有仿真模型的情况下进行分析。方案之间的协议适度范围从59.4%到71.4%,只有3.5%的变种被归类为所有四个方案的有害和10.9%的宽容。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号