Locating regions in a sequence under density constraints

Burton B.A.; Hiron M.

首页> 外文期刊>SIAM Journal on Computing >Locating regions in a sequence under density constraints

【24h】

Locating regions in a sequence under density constraints

机译：在密度约束下按顺序定位区域

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Several biological problems require the identification of regions in a sequence where some feature occurs within a target density range: examples including the location of GC-rich regions, identification of CpG islands, and sequence matching. Mathematically, this corresponds to searching a string of 0's and 1's for a substring whose relative proportion of 1's lies between given lower and upper bounds. We consider the algorithmic problem of locating the longest such substring, as well as other related problems (such as finding the shortest substring or a maximal set of disjoint substrings). For locating the longest such substring, we develop an algorithm that runs in O(n) time, improving upon the previous best-known O(n log n) result. For the related problems we develop O(n log log n) algorithms, again improving upon the best-known O(n log n) results. Practical testing verifies that our new algorithms enjoy significantly smaller time and memory footprints, and can process sequences that are orders of magnitude longer as a result.

机译：几个生物学问题需要识别序列中某些特征在目标密度范围内发生的区域：示例包括富含GC的区域的位置，CpG岛的识别和序列匹配。从数学上讲，这对应于搜索0和1的字符串以寻找子字符串，该子字符串的1的相对比例位于给定的上下限之间。我们考虑定位最长的此类子字符串的算法问题，以及其他相关问题（例如，找到最短的子字符串或最大的不相交子字符串集）。为了找到最长的此类子字符串，我们开发了一种在O（n）时间内运行的算法，对先前最著名的O（n log n）结果进行了改进。针对相关问题，我们开发了O（n log log n）算法，再次改进了最著名的O（n log log n）结果。实际测试证明，我们的新算法所占用的时间和内存明显更少，并且可以处理的序列长了几个数量级。

著录项

来源
《SIAM Journal on Computing》 |2013年第3期|共15页
作者
Burton B.A.; Hiron M.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类 51.9;
关键词
Algorithms; Bioinformatics; String processing; Substring density;

机译：算法;生物信息学;字符串处理;子字符串密度;

相似文献

外文文献
中文文献
专利

1. Locating regions in a sequence under density constraints [J] . Burton B.A., Hiron M. SIAM Journal on Computing . 2013,第3期

机译：在密度约束下按顺序定位区域
2. Fine mapping and DNA fiber FISH analysis locates the tobamovirus resistance gene L3 of Capsicum chinense in a 400-kb region of R-like genes cluster embedded in highly repetitive sequences [J] . R. Tomita, J. Murai, Y. Miura, TAG Theoretical and Applied Genetics . 2008,第7期

机译：精细定位和DNA纤维FISH分析将辣椒的烟草花叶病毒抗性基因L 3 定位在高度重复序列中嵌入的R样基因簇的400 kb区域中
3. Multiobjective Optimization Models for Locating Vehicle Inspection Stations Subject to Stochastic Demand, Varying Velocity and Regional Constraints [J] . Guangdong Tian, MengChu Zhou, Peigen Li, IEEE Transactions on Intelligent Transportation Systems . 2016,第7期

机译：随机需求，速度变化和区域约束的车辆检查站定位多目标优化模型
4. Prediction of protein disordered regions in a protein sequence based on gap-constraint subsequence patterns [C] . Meijing Li, Xiuming Yu, Taewook Kim, The 4th International Conference on Awareness Science and Technology. . 2012

机译：基于空位约束子序列模式的蛋白质序列中蛋白质无序区的预测
5. Optimally Locating Level I Trauma Centers and Aeromedical Depots for Rural Regions of the State of Ohio [D] . Pepe, Linda R. 2017

机译：俄亥俄州农村地区的最佳定位级别Itauma中心和航空医疗仓库
6. Locating regions of differential variability in DNA and protein sequences. [O] . H Tang, R C Lewontin 1999

机译：定位DNA和蛋白质序列差异变异的区域。
7. Locating regions in a sequence under density constraints [O] . Burton Benjamin A., Hiron Mathias 2013

机译：在密度约束下按顺序定位区域

Locating regions in a sequence under density constraints

摘要

著录项

相似文献

相关主题

期刊订阅