首页> 外文会议>Algorithms in Bioinformatics >Placing Probes along the Genome Using Pairwise Distance Data

【24h】

Placing Probes along the Genome Using Pairwise Distance Data

机译：使用成对距离数据沿基因组放置探针

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe the theoretical basis of an approach using microarrays of probes and libraries of BACs to construct maps of the probes, by assigning relative locations to the probes along the genome. The method depends on several hybridization experiments: in each experiment, we sample (with replacement) a large library of BACs to select a small collection of BACs for hybridization with the probe arrays. The resulting data can be used to assign a local distance metric relating the arrayed probes, and then to position the probes with respect to each other. The method is shown to be capable of achieving surprisingly high accuracy within individual contigs and with less than 100 microarray hybridization experiments even when the probes and clones number about 10~5, thus involving potentially around 10~(10) individual hybridizations. This approach is not dependent upon existing BAC contig information, and so should be particularly useful in the application to previously uncharacterized genomes. Nevertheless, the method may be used to independently validate a BAC contig map or a minimal tiling path obtained by intensive genornic sequence determination. We provide a detailed probabilistic analysis to characterize the outcome of a single hybridization experiment and what information can be garnered about the physical distance between any pair of probes. This analysis then leads to a formulation of a likelihood optimization problem whose solution leads to the relative probe locations. After reformulating the optimization problem in a graph-theoretic setting and by exploiting the underlying probabilistic structure, we develop an efficient approximation algorithm for our original problem. We have implemented the algorithm and conducted several experiments for varied sets of parameters. Our empirical results are highly promising and are reported here as well. We also explore how the probabilistic analysis and algorithmic efficiency issues affect the design of the underlying biochemical experiments.

机译：我们描述了一种方法的理论基础，该方法使用了探针的微阵列和BAC的文库以通过沿基因组分配相对位置给探针来构建探针图。该方法取决于几个杂交实验：在每个实验中，我们取样（并替换）大的BAC库以选择少量的BAC与探针阵列杂交。所得数据可用于分配与阵列探针相关的局部距离度量，然后相对于彼此放置探针。结果表明，该方法即使在探针和克隆数约为10〜5的情况下，也能在单个重叠群内实现惊人的高精度，并且少于100个微阵列杂交实验，因此可能涉及约10〜（10）个单独杂交。该方法不依赖于现有的BAC重叠群信息，因此在应用于以前未表征的基因组中应特别有用。然而，该方法可用于独立地验证通过密集的基因组序列确定获得的BAC重叠群图或最小拼接路径。我们提供了详细的概率分析，以表征单个杂交实验的结果以及可以获取有关任何一对探针之间的物理距离的信息。然后，该分析导致了似然性优化问题的表述，其解决方案导致了相对的探针位置。在图论设置中重新优化问题并通过利用潜在的概率结构后，我们为原始问题开发了一种有效的近似算法。我们已经实现了该算法，并针对各种参数集进行了几次实验。我们的经验结果非常有前途，这里也有报道。我们还将探讨概率分析和算法效率问题如何影响基础生化实验的设计。

著录项

来源
《Algorithms in Bioinformatics》|2001年|p.52-68|共17页
会议地点
作者
Will Casey; Bud Mishra; Mike Wigler;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Computational pan-genome mapping and pairwise SNP-distance improve detection of Mycobacterium tuberculosis transmission clusters [J] . Christine Jandrasits, Stefan Kr?ger, Walter Haas, PLoS Computational Biology . 2019,第12期

机译：计算全基因组图谱和成对SNP距离可提高对结核分枝杆菌传播簇的检测
2. PHYLOGENETIC AND GENOME-WIDE PAIRWISE DISTANCE ANALYSIS OF THE GENUS LUTEOVIRUS [J] . Ali Muhammad, Tahir Muhammad, Hameed Shahid Pakistan Journal of Agricultural Sciences . 2017,第2期

机译：叶黄病毒属的系统发育和基因组对成对分析
3. On pairwise distances and median score of three genomes under DCJ [J] . Sergey Aganezov Jr, Max A Alekseyev BMC Bioinformatics . 2012,第SUPPLEMENTa19期

机译：DCJ下三个基因组的成对距离和中位数
4. Placing Probes along the Genome Using Pairwise Distance Data [C] . Will Casey, Bud Mishra, Mike Wigler Workshop on Bioinformatics . 2001

机译：使用成对距离数据沿着基因组放置探针
5. Pairwise Ranking and Removal Network Analysis of Genome-Wide Gene Expression Data: Theory, a New Algorithm, and Analysis of Data from the Cancer Genome Atlas [D] . Sainath Madduri, Abishek. 2017

机译：全基因组表达数据的成对排序和去除网络分析：理论，新算法和癌症基因组图谱的数据分析
6. Computational pan-genome mapping and pairwise SNP-distance improve detection of Mycobacterium tuberculosis transmission clusters [O] . Christine Jandrasits, Stefan Kröger, Walter Haas, 2019

机译：计算全基因组图谱和成对SNP距离可提高结核分枝杆菌传播簇的检测
7. Figure 5: Dendrogram representation of a Euclidean distance matrix derived from pairwise ANIb distances among Tomejil and Neorhizobium type strain genomes. [O] . -1

机译：图5：从Tomejil和Neorhizobium型应变基因组中的成对腹部距离衍生的欧几里德距离矩阵的树木表示。

Placing Probes along the Genome Using Pairwise Distance Data

摘要

著录项

相似文献

相关主题

期刊订阅