Efficient mining gapped sequential patterns for motifs in biological sequences

Vance Chiang-Chi Liao; Ming-Syan Chen

首页> 外文期刊>BMC Veterinary Research >Efficient mining gapped sequential patterns for motifs in biological sequences

【24h】

Efficient mining gapped sequential patterns for motifs in biological sequences

机译：高效挖掘生物序列中基序的缺口连续模式

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

BackgroundPattern mining for biological sequences is an important problem in bioinformatics and computational biology. Biological data mining yield impact in diverse biological fields, such as discovery of co-occurring biosequences, which is important for biological data analyses. The approaches of mining sequential patterns can discover all-length motifs of biological sequences. Nevertheless, traditional approaches of mining sequential patterns inefficiently mine DNA and protein data since the data have fewer letters and lengthy sequences. Furthermore, gap constraints are important in computational biology since they cope with irrelative regions, which are not conserved in evolution of biological sequences.ResultsWe devise an approach to efficiently mine sequential patterns (motifs) with gap constraints in biological sequences. The approach is the Depth-First Spelling algorithm for mining sequential patterns of biological sequences with Gap constraints (termed DFSG).ConclusionsPrefixSpan is one of the most efficient methods in traditional approaches of mining sequential patterns, and it is the basis of GenPrefixSpan. GenPrefixSpan is an approach built on PrefixSpan with gap constraints, and therefore we compare DFSG with GenPrefixSpan. In the experimental results, DFSG mines biological sequences much faster than GenPrefixSpan.

机译：背景技术生物序列的模式挖掘是生物信息学和计算生物学中的重要问题。生物数据挖掘对不同生物学领域的产量产生影响，例如发现共生生物序列，这对于生物学数据分析很重要。挖掘顺序模式的方法可以发现生物序列的全长主题。但是，传统的挖掘顺序模式的方法无法有效地挖掘DNA和蛋白质数据，因为这些数据具有较少的字母和较长的序列。此外，缺口约束在计算生物学中很重要，因为它们可以处理非相关区域，这些序列在生物学序列的进化中是不保守的。该方法是深度优先拼写算法，用于挖掘具有Gap约束的生物序列的顺序模式（称为DFSG）。结论PrefixSpan是传统的挖掘顺序模式方法中最有效的方法之一，它是GenPrefixSpan的基础。 GenPrefixSpan是一种在具有间隙约束的PrefixSpan上构建的方法，因此我们将DFSG与GenPrefixSpan进行了比较。在实验结果中，DFSG挖掘生物序列的速度比GenPrefixSpan快得多。

著录项

来源
《BMC Veterinary Research》 |2013年第4期|共页
作者
Vance Chiang-Chi Liao; Ming-Syan Chen;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类动物医学（兽医学）;
关键词

相似文献

外文文献
中文文献
专利

1. Efficient mining gapped sequential patterns for motifs in biological sequences [J] . Vance Chiang-Chi Liao, Ming-Syan Chen BMC Systems Biology . 2013,第S4期

机译：高效挖掘生物序列中基序的缺口序列模式
2. Efficient mining gapped sequential patterns for motifs in biological sequences [J] . Vance Chiang-Chi Liao, Ming-Syan Chen BMC Veterinary Research . 2013,第SUPPLEMENTa4期

机译：高效挖掘生物序列中基序的缺口顺序模式
3. Efficient Mining of Interesting Patterns in Large Biological Sequences [J] . Md. Mamunur Rashid, Md. Rezaul Karim, Byeong-Soo Jeong, Genomics & Informatics . 2012,第1期

机译：大型生物序列中有趣模式的有效挖掘
4. An efficient sequential pattern mining algorithm for motifs with gap constraints [C] . Liao Vance Chiang-Chi, Chen Ming-Syan 2012 IEEE International Conference on Bioinformatics and Biomedicine. . 2012

机译：带有间隙约束的图案的高效顺序模式挖掘算法
5. Mining High Utility Sequential Patterns from Uncertain Web Access Sequences using the PL-WAP [D] . Vangala, Sravya. 2017

机译：使用PL-WAP从不确定的Web访问序列中挖掘高实用程序顺序模式
6. Efficient mining gapped sequential patterns for motifs in biological sequences [O] . Vance Chiang-Chi Liao, Ming-Syan Chen 2013

机译：高效挖掘生物序列中基序的缺口序列模式
7. Efficient mining gapped sequential patterns for motifs in biological sequences [O] . 2013

机译：高效挖掘生物序列中基序的缺口序列模式

Efficient mining gapped sequential patterns for motifs in biological sequences

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅