Faster Computation of Genome Mappability with one Mismatch

机译：用一个不匹配更快地计算基因组可用性

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Summary form only given. The genome mappability problem refers to cataloging repetitive occurrences of every substring of length m in a genome, and its k-mappability variant extends this to approximate repeats by allowing up to k mismatches. This problem is formulated as follows: Given a sequence S[1, n] of length n over the constant DNA alphabet Σ = {A, C, G, T}, and two integers k and m ≤ n, output an integer array F_k, such that: F_k[i] = |{j ≠ i|d_H(S[i, i + m - 1], S[j, j + m - 1]) ≤ k}| where d_H(·,·) represents the hamming distance. Derrien et al. [PLoS one 2012] represented this problem within the framework of genome analysis. In this work we present a provably efficient algorithm for 1-mappability with O(n log n) worst case run time and O(n) spece. The fundamental technique is the heavy path decomposition on the suffix tree (ST) of S, and the entire work is based on the framework by Thankachan et al. [RECOMB 2018]. The previous best known run time is O(n log n log log n) [Alzamel et al., COCOA 2017].

机译：摘要表格仅给出。基因组涂布性问题是指在基因组中的每一个长度M的亚流量的重复发生，并且其K-易用性变型通过允许高达k不匹配来延伸至近似重复。该问题的制定如下：给定长度N的序列S [1，n]，在恒定的DNA字母σ= {a，c，g，t}和两个整数k和m≤n上，输出整数阵列f_{k ，这样的：f_{k [i] = | {J≠I| D._{h （S [I，I + M-1]，S [J，J + M-1]）≤K} |其中d_{h （·，·）代表汉明距离。 Derrien等人。 [Plos 2012]在基因组分析框架内代表了这个问题。在这项工作中，我们呈现了一种可释放的有效算法，可提供与O（n log n）最坏情况运行时间和O（n）规格的1-oppappity算法。基本技术是S的后缀树（ST）的沉重路径分解，整个工作基于ChranthAn等人的框架。 [Recomb 2018]。以前最着名的运行时间是O（n log n log log n）[Alzamel等，Cocoa 2017]。}}}}

著录项

来源
《IEEE International Conference on Computational Advances in Bio and Medical Sciences》|2018年|54p|共1页
会议地点
作者
Sahar Hooshmand; Paniz Abedin; Daniel Gibney; Srinivas Aluru; Sharma V. Thankachan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP39-53;
关键词
Genomics; Bioinformatics; Computer science; Hamming distance; DNA; Indexes; Optimization;

机译：基因组学;生物信息学;计算机科学;汉明距离;DNA;指数;优化;

相似文献

外文文献
中文文献
专利

1. Faster, safer, and better DNA purification by ultracentrifugation using GelRed stain and development of mismatch oligo DNA for genome walking [J] . Kasajima Ichiro, Ohtsubo Norihiro, Sasaki Katsutomo Bioscience, Biotechnology, and Biochemistry . 2014,第11期

机译：通过使用凝胶污渍和Mismatch Oligo DNA的超速离心，更安全，更安全，更好的DNA纯化，用于基因组步行
2. MapDisto: fast and efficient computation of genetic linkage maps [J] . Lorieux Mathias Molecular Breeding . 2012,第2期

机译：MapDisto：快速有效地计算遗传连锁图
3. Using linkage maps to correct and scaffold de novo genome assemblies: methods, challenges, and computational tools [J] . Janna L. Fierst Frontiers in Genetics . 2015,第1期

机译：使用连锁图校正和支撑 de novo 基因组组装：方法，挑战和计算工具
4. Faster Computation of Genome Mappability with one Mismatch [C] . Sahar Hooshmand, Paniz Abedin, Daniel Gibney, IEEE International Conference on Computational Advances in Bio and Medical Sciences . 2018

机译：一种不匹配的基因组可定位性的更快计算
5. Bioinformatics: Using Computational Techniques (Whole Genome & Rna-Sequencing) to Understand Bacterial Genome Evolution and Biopharmaceutical Development =Bioinformatics: Using Computational Techniques (Whole Genome & RNA-Sequencing) to Understand Bac [D] . Duncan, David Jonith. 2020

机译：生物信息学：使用计算技术（全基因组和RNA测序）了解细菌基因组进化和生物制药发育=生物信息学：使用计算技术（全基因组＆amp; RNA测序）来理解BAC
6. GenMap: ultra-fast computation of genome mappability [O] . Christopher Pockrandt, Mai Alzamel, Costas S Iliopoulos, -1

机译：GenMap：基因组可定位性的超快速计算
7. GenMap: Fast and Exact Computation of Genome Mappability [O] . Christopher Pockrandt, Mai Alzamel, Costas S. Iliopoulos, 2019

机译：Genmap：基因组可用性的快速和精确计算
8. In Silico Genome Mismatch Scanning to Map Breast Cancer Genes in Extended Pedigrees [R] . Thomas, A. 2009

机译：在silico基因组错配扫描中绘制扩展谱系中的乳腺癌基因

Faster Computation of Genome Mappability with one Mismatch

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅