Discovering regulatory motifs of genetic networks using the indexing-tree based algorithm: a parallel implementation

Almomany Abedalmuhdi; Al-Omari Ahmad M.; Jarrah Amin; Tawalbeh Mohammad

首页> 外文期刊>Engineering Computations >Discovering regulatory motifs of genetic networks using the indexing-tree based algorithm: a parallel implementation

【24h】

Discovering regulatory motifs of genetic networks using the indexing-tree based algorithm: a parallel implementation

机译：使用基于索引树的算法发现遗传网络的监管图案：并行实现

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Purpose The problem of motif discovery has become a significant challenge in the era of big data where there are hundreds of genomes requiring annotations. The importance of motifs has led many researchers to develop different tools and algorithms for finding them. The purpose of this paper is to propose a new algorithm to increase the speed and accuracy of the motif discovering process, which is the main drawback of motif discovery algorithms. Design/methodology/approach All motifs are sorted in a tree-based indexing structure where each motif is created from a combination of nucleotides: 'A', 'C', 'T' and 'G'. The full motif can be discovered by extending the search around 4-mer nucleotides in both directions, left and right. Resultant motifs would be identical or degenerated with various lengths. Findings The developed implementation discovers conserved string motifs in DNA without having prior information about the motifs. Even for a large data set that contains millions of nucleotides and thousands of very long sequences, the entire process is completed in a few seconds. Originality/value Experimental results demonstrate the efficiency of the proposed implementation; as for a real-sequence of 1,270,000 nucleotides spread into 2,000 samples, it takes 5.9 s to complete the overall discovering process when the code ran on an Intel Core i7-6700 @ 3.4 GHz machine and 26.7 s when running on an Intel Xeon x5670 @ 2.93 GHz machine. In addition, the authors have improved computational performance by parallelizing the implementation to run on multi-core machines using the OpenMP framework. The speedup achieved by parallelizing the implementation is scalable and proportional to the number of processors with a high efficiency that is close to 100%.

机译：目的motif发现的问题已成为在有数百个需要注释的基因组的大数据的时代显著的挑战。图案的重要性，导致许多研究人员开发不同的工具和算法寻找他们。本文的目的是提出一种新的算法，以提高主题发现的过程，这是motif发现算法的主要缺点的速度和准确性。设计/方法/接近所有基序在从核苷酸的组合创建的每个基序的基于树的索引结构来分类：“A”，“C”，“T”和“G”。完整的图案可以通过扩展在两个方向上约4个碱基核苷酸搜索发现，左，右。所得图案是相同或不同长度的退化。发现而发达执行发现的保守的DNA串的基序，而无需关于基序之前的信息。即使对于包含数百万个核苷酸，数千很长序列的一个大的数据集，整个过程在几秒钟内完成。创作/值实验结果表明所提出的实现的效率;为的127万个核苷酸传播真正的序列化为2000个样品，需要5.9 s到完成时，英特尔酷睿i7-6700 @ 3.4 GHz的机器上的代码RAN和26.7 S于英特尔至强X5670运行时，整个过程情迷@ 2.93 GHz的机器。另外，作者通过并行实施，使用OpenMP的框架多核机器上运行提升计算性能。通过并行执行所取得的加速是可扩展的和成比例的高效率接近于100％的处理器的数量。

著录项

来源
《Engineering Computations》 |2021年第1期|354-370|共17页
作者
Almomany Abedalmuhdi; Al-Omari Ahmad M.; Jarrah Amin; Tawalbeh Mohammad;
展开▼
作者单位

Yarmouk Univ Dept Comp Engn Irbid Jordan;

Yarmouk Univ Dept Biomed Syst & Bioinformat Engn Irbid Jordan;

Yarmouk Univ Dept Comp Engn Irbid Jordan;

Jordan Univ Sci & Technol Informat Technol Ctr Irbid Jordan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Optimization; OpenMP; Parallel processing; Genetic network; Multi-core; Regulation motif;

机译：优化;OpenMP;并行处理;遗传网络;多核;调节主题;

相似文献

外文文献
中文文献
专利

1. Research on the construction of the gene regulatory network based on the hybrid parallel genetic algorithms [J] . Basic & clinical pharmacology & toxicology. . 2020,第S3期

机译：基于杂交平行遗传算法的基因调节网络建设研究
2. Research on the construction of the gene regulatory network based on the hybrid parallel genetic algorithms [J] . Wang Pengfei, Wang Hongyong Basic & clinical pharmacology & toxicology. . 2019,第S1期

机译：基于杂交平行遗传算法的基因调节网络建设研究
3. Research on the construction of the gene regulatory network based on the hybrid parallel genetic algorithms [J] . Wang Pengfei, Wang Hongyong Basic & clinical pharmacology & toxicology. . 2019,第S1期

机译：基于杂交平行遗传算法的基因调节网络建设研究
4. A parallel algorithm for extracting transcriptional regulatory network motifs [C] . Tie Wang, Touchman, J.W., . 2005

机译：提取转录调控网络基序的并行算法
5. The design, analysis, and implementation of parallel simulated annealing and parallel genetic algorithms for the composite graph coloring problem [D] . Elmer, Brent Scott 1993

机译：复合图着色问题的并行模拟退火与并行遗传算法的设计，分析与实现
6. A Parallel Attractor Finding Algorithm Based on Boolean Satisfiability for Genetic Regulatory Networks [O] . Wensheng Guo, Guowu Yang, Wei Wu, -1

机译：基于布尔可满足性的遗传调节网络并行吸引子查找算法
7. A Parallel Attractor Finding Algorithm Based on Boolean Satisfiability for Genetic Regulatory Networks [O] . Wensheng Guo, Guowu Yang, Wei Wu, 2016

机译：基于布尔可满足性的遗传调控网并行吸引子查找算法
8. Identification of continuous-time dynamical systems: Neural network based algorithms and parallel implementation. [R] . Farber, R. M., Lapedes, A. S., Rico-Martinez, R., 1993

机译：连续时间动态系统的识别：基于神经网络的算法和并行实现。

Discovering regulatory motifs of genetic networks using the indexing-tree based algorithm: a parallel implementation

摘要

著录项

相似文献

相关主题

期刊订阅