Estimating evolutionary distances between genomic sequences from spaced-word matches

机译：从间隔词匹配估计基因组序列之间的进化距离

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Alignment-free methods are increasingly used to calculate evolutionary distances between DNA and protein sequences as a basis of phylogeny reconstruction. Most of these methods, however, use heuristic distance functions that are not based on any explicit model of molecular evolution. Herein, we propose a simple estimator dN of the evolutionary distance between two DNA sequences that is calculated from the number N of (spaced) word matches between them. We show that this distance function is more accurate than other distance measures that are used by alignment-free methods. In addition, we calculate the variance of the normalized number N of (spaced) word matches. We show that the variance of N is smaller for spaced words than for contiguous words, and that the variance is further reduced if our spaced-words approach is used with multiple patterns of ‘match positions’ and ‘don’t care positions’. Our software is available online and as downloadable source code at: .

机译：无序列比的方法越来越多地用于计算DNA和蛋白质序列之间的进化距离，作为系统发育重建的基础。但是，大多数这些方法使用的启发式距离函数并非基于分子进化的任何明确模型。在此，我们提出了两个DNA序列之间进化距离的简单估计量dN，该估计量是根据两个词之间的（间隔）单词匹配数N计算得出的。我们表明，此距离函数比无对齐方法使用的其他距离度量更为准确。另外，我们计算（间隔的）单词匹配的归一化数量N的方差。我们发现，间隔词的N的方差小于连续词的N的方差，并且如果我们的间隔词方法与“匹配位置”和“无关位置”的多种模式一起使用，则方差会进一步减小。我们的软件可在线获得，也可以从以下位置下载源代码。

著录项

期刊名称 Algorithms for Molecular Biology : AMB
作者
Burkhard Morgenstern; Bingyao Zhu; Sebastian Horwege; Chris André Leimeister;
展开▼
作者单位

展开▼
年(卷),期 2015(10),-1
年度 2015
页码 5
总页数 12
原文格式 PDF
正文语种
中图分类应用微生物学;生化遗传学;生化药理学;
关键词
k-mers Spaced words Alignment-free Phylogeny Word frequency Distance estimation Variance Genome comparison;

机译：k-mers;间隔词;无对齐;系统发育;词频;距离估计;方差;基因组比较;

相似文献

外文文献
中文文献
专利

1. Estimating evolutionary distances between genomic sequences from spaced-word matches [J] . Burkhard Morgenstern, Bingyao Zhu, Sebastian Horwege, Algorithms for Molecular Biology . 2015,第1期

机译：从间隔词匹配估计基因组序列之间的进化距离
2. The influence of selection on the evolutionary distance estimated from the base changes observed between homologous nucleotide sequences. [J] . Otsuka J, Kawai Y, Sugaya N Journal of Theoretical Biology . 2001,第2期

机译：选择对从同源核苷酸序列之间观察到的碱基变化估计的进化距离的影响。
3. An Evolutionary Distance Based on Maximal Unique Matches [J] . FREDERIC GUYON, ALAIN GUENOCHE Communications in Statistics . 2010,第3a5期

机译：基于最大唯一匹配的进化距离
4. Estimating Evolutionary Distances from Spaced-Word Matches [C] . Burkhard Morgenstern, Binyao Zhu, Sebastian Horwege, International workshop on algorithms in bioinformatics . 2014

机译：估计间隔词匹配的进化距离
5. Utilizing Next Generation Sequencing to Generate Bacterial Genomic Sequences for Evolutionary Analysis. [D] . Scott, Derrick C. 2014

机译：利用下一代测序生成细菌基因组序列进行进化分析。
6. The number of k-mer matches between two DNA sequences as a function of k and applications to estimate phylogenetic distances [O] . Sophie Röhling, Alexander Linne, Jendrik Schellhorn, 2020

机译：两个DNA序列之间的k-mer匹配数作为k的函数并应用于估计系统发生距离
7. Estimating evolutionary distances between genomic sequences from spaced-word matches [O] . 2015

机译：从间隔词匹配估计基因组序列之间的进化距离

Estimating evolutionary distances between genomic sequences from spaced-word matches

摘要

著录项

相似文献

相关主题

期刊订阅