首页> 外文期刊>Journal of Intelligent Systems >Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search Optimization
【24h】

Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search Optimization

机译:采用使用萤火虫杂交和组搜索优化的限制的挖掘DNA序列模式

获取原文
获取原文并翻译 | 示例
           

摘要

DNA sequence mining is essential in the study of the structure and function of the DNA sequence. A few exploration works have been published in the literature concerning sequence mining in information mining task. Similarly, in our past paper, an effective sequence mining was performed on a DNA database utilizing constraint measures and group search optimization (GSO). In that study, GSO calculation was utilized to optimize the sequence extraction process from a given DNA database. However, it is apparent that, occasionally, such an arbitrary seeking system does not accompany the optimal solution in the given time. To overcome the problem, we proposed in this work multiple constraints with hybrid firefly and GSO (HFGSO) algorithm. The complete DNA sequence mining process comprised the following three modules: (i) applying prefix span algorithm; (ii) calculating the length, width, and regular expression (RE) constraints; and (iii) optimal mining via HFGSO. First, we apply the concept of prefix span, which detects the frequent DNA sequence pattern using a prefix tree. Based on this prefix tree, length, width, and RE constraints are applied to handle restrictions. Finally, we adopt the HFGSO algorithm for the completeness of the mining result. The experimentation is carried out on the standard DNA sequence dataset, and the evaluation with DNA sequence dataset and the results show that our approach is better than the existing approach.
机译:DNA序列挖掘对于DNA序列的结构和功能研究至关重要。有关信息挖掘任务的序列挖掘的文献中发表了一些探索作品。类似地,在我们过去的纸张中,利用约束测量和组搜索优化(GSO)对DNA数据库进行有效序列挖掘。在该研究中,利用GSO计算从给定的DNA数据库优化序列提取过程。然而,显而易见的是,这种任意寻求系统在给定时间内没有伴随最佳解决方案。为了克服这个问题,我们在这项工作中提出了与混合萤火虫和GSO(HFGSO)算法的多个约束。完整的DNA序列采矿过程包括以下三个模块:(i)应用前缀跨度算法; (ii)计算长度,宽度和正则表达式(RE)约束; (iii)通过HFGSO的最佳开采。首先,我们应用前缀跨度的概念,它使用前缀树检测频繁的DNA序列模式。基于此前缀树,长度,宽度和重新约束以处理限制。最后,我们采用HFGSO算法进行挖掘结果的完整性。实验在标准DNA序列数据集上进行,并使用DNA序列数据集进行评估,结果表明我们的方法优于现有方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号