...
首页> 外文期刊>BMC Veterinary Research >Efficient mining gapped sequential patterns for motifs in biological sequences
【24h】

Efficient mining gapped sequential patterns for motifs in biological sequences

机译:高效挖掘生物序列中基序的缺口连续模式

获取原文

摘要

BackgroundPattern mining for biological sequences is an important problem in bioinformatics and computational biology. Biological data mining yield impact in diverse biological fields, such as discovery of co-occurring biosequences, which is important for biological data analyses. The approaches of mining sequential patterns can discover all-length motifs of biological sequences. Nevertheless, traditional approaches of mining sequential patterns inefficiently mine DNA and protein data since the data have fewer letters and lengthy sequences. Furthermore, gap constraints are important in computational biology since they cope with irrelative regions, which are not conserved in evolution of biological sequences.ResultsWe devise an approach to efficiently mine sequential patterns (motifs) with gap constraints in biological sequences. The approach is the Depth-First Spelling algorithm for mining sequential patterns of biological sequences with Gap constraints (termed DFSG).ConclusionsPrefixSpan is one of the most efficient methods in traditional approaches of mining sequential patterns, and it is the basis of GenPrefixSpan. GenPrefixSpan is an approach built on PrefixSpan with gap constraints, and therefore we compare DFSG with GenPrefixSpan. In the experimental results, DFSG mines biological sequences much faster than GenPrefixSpan.
机译:背景技术生物序列的模式挖掘是生物信息学和计算生物学中的重要问题。生物数据挖掘对不同生物学领域的产量产生影响,例如发现共生生物序列,这对于生物学数据分析很重要。挖掘顺序模式的方法可以发现生物序列的全长主题。但是,传统的挖掘顺序模式的方法无法有效地挖掘DNA和蛋白质数据,因为这些数据具有较少的字母和较长的序列。此外,缺口约束在计算生物学中很重要,因为它们可以处理非相关区域,这些序列在生物学序列的进化中是不保守的。该方法是深度优先拼写算法,用于挖掘具有Gap约束的生物序列的顺序模式(称为DFSG)。结论PrefixSpan是传统的挖掘顺序模式方法中最有效的方法之一,它是GenPrefixSpan的基础。 GenPrefixSpan是一种在具有间隙约束的PrefixSpan上构建的方法,因此我们将DFSG与GenPrefixSpan进行了比较。在实验结果中,DFSG挖掘生物序列的速度比GenPrefixSpan快得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号