Large Scale Reconstruction of Haplotypes from Genotype Data

机译：基因型数据大规模重建单倍型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Critical to the understanding of the genetic basis for complex diseases is the modeling of human variation. Most of this variation can be characterized by single nucleotide polymorphisms (SNPs) which are mutations at a single nucleotide position. To characterize an individual's variation, we must determine an individual's haplotype or which nucleotide base occurs at each position of these common SNPs for each chromosome. In this paper, we present results for a highly accurate method for haplotype resolution from genotype data. Our method leverages a new insight into the underlying structure of haplotypes which shows that SNPs are organized in highly correlated "blocks". The majority of individuals have one of about four common haplotypes in each block. Our method partitions the SNPs into blocks and for each block, we predict the common haplotypes and each individual's haplotype. We evaluate our method over biological data. Our method predicts the common haplotypes perfectly and has a very low errorrate (0.47%) when taking into account the predictions for the uncommon haplotypes. Our method is extremely efficient compared to previous methods, (a matter of seconds where previous methods needed hours). Its efficiency allows us to find the block partition of the haplotypes, to cope with missing data and to work with large data sets such as genotypes for thousands of SNPs for hundreds of individuals. The algorithm is available via webserver at http://www.cs.colvunbia.edu/compbio/hap/ .

机译：对复杂疾病的遗传基础的理解至关重要是人类变异的建模。这些变化的大部分可以特征在于单一核苷酸多态性（SNP），其在单个核苷酸位置处于突变。为了表征个人的变异，我们必须确定个体的单倍型或在每种染色体的这些常见SNP的每个位置发生核苷酸碱。在本文中，我们对来自基因型数据的单倍型分辨率的高准确方法提供了结果。我们的方法利用了新的洞察到单倍型的底层结构，表明SNP在高度相关的“块”中组织。大多数人在每个街区中有大约四种常见单倍型。我们的方法将SNP分配成块和每个块，我们预测常见的单倍型和每个单独的单倍型。我们评估我们对生物数据的方法。我们的方法在考虑到罕见单倍型的预测时，我们完美地预测了常见的单倍型并且具有非常低的错误（0.47％）。与以前的方法相比，我们的方法非常有效，（先前方法需要几个小时的几秒钟）。其效率使我们能够找到单倍型的块分区，以应对缺失的数据，并使用大量数据集，诸如数千个SNP的基因型。该算法可通过网络服务器访问http://www.cs.colvunbia.edu/compbio/hap/。

著录项

来源
《Annual International Conference on Research in Computational Molecular Biology》|2003年||共10页
会议地点
作者
Eleazar Eskin; Eran Halperin; Richard M. Karp;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 Q811.4-532;
关键词

相似文献

外文文献
中文文献
专利

1. Haplotype reconstruction from genotype data using Imperfect Phylogeny [J] . Eran Halperin, Eleazar Eskin Bioinformatics . 2004,第12期

机译：利用不完善的系统发育从基因型数据重建单倍型
2. Haplotype reconstruction from genotype data using Imperfect Phylogeny [J] . Eran Halperin, Eleazar Eskin Bioinformatics . 2004,第12期

机译：利用不完善的系统发育从基因型数据重建单倍型
3. Haplotype inference for present-absent genotype data using previously identified haplotypes and haplotype patterns [J] . Yun Joo Yoo, Jianming Tang, Richard A. Kaslow, Bioinformatics . 2007,第18期

机译：使用先前确定的单倍型和单倍型模式对当前基因型数据进行单倍型推断
4. Large scale reconstruction of haplotypes from genotype data [C] . Eleazar Eskin, Eran Halperin, Richard M. Karp Annual international conference on Computational molecular biology;International conference on Computational molecular biology . 2003

机译：从基因型数据大规模重建单倍型
5. Algorithms for Inferring Haplotypes from Genotype Data of Pedigrees. [D] . Doan, Duong Dai. 2011

机译：从谱系基因型数据推断单倍型的算法。
6. A Comparison of Bayesian Methods for Haplotype Reconstruction from Population Genotype Data [O] . Matthew Stephens, Peter Donnelly 2003

机译：从人口基因型数据的单倍型重建的贝叶斯方法的比较。
7. Large scale reconstruction of haplotypes from genotype data [O] . Eleazar Eskin, Eran Halperin, Richard M. Karp 2003

机译：从基因型数据大规模重建单倍型

Large Scale Reconstruction of Haplotypes from Genotype Data

摘要

著录项

相似文献

相关主题

期刊订阅