首页> 外文会议>ICONIP 2008;International conference on advances in neuro-information processing >An Improved Genetic Algorithm for DNA Motif Discovery with Public Domain Information
【24h】

An Improved Genetic Algorithm for DNA Motif Discovery with Public Domain Information

机译:具有公共领域信息的DNA主题发现的改进遗传算法

获取原文

摘要

Recognition of transcription factor binding sites (TFBSs or DNA motifs) to help with understanding the regulation of gene expression is one of the major challenges in the post-genomics era. Computational approaches have been developed to perform binding sites discovery based on brute-force search techniques or heuristic search algorithms, and numbers of them have achieved some degrees of success. However, the prediction accuracy of the algorithm can be relatively influenced by the natural low signal-to-noise ratio of the DNA sequence. In this paper, a novel DNA motif discovery approach using a genetic algorithm is proposed to explore the ways to improve the algorithm performance. We take account of the publicly available motif models such as Position Frequency Matrix (PFM) to initialize the population. By considering both conservation and complexity of the DNA motifs, a novel fitness function is developed to better evaluate the motif models during the evolution process. A final model refinement process is also introduced for optimizing the motif models. The experimental results demonstrate a comparable (superior) performance of our approach to recently proposed two genetic algorithm motif discovery approaches.
机译:识别转录因子结合位点(TFBS或DNA基序)以帮助理解基因表达的调控是后基因组学时代的主要挑战之一。已经开发出基于蛮力搜索技术或启发式搜索算法的计算方法来执行结合位点发现,并且其中许多方法都取得了一定程度的成功。但是,该算法的预测精度可能会受到DNA序列自然的低信噪比的相对影响。在本文中,提出了一种使用遗传算法的新的DNA基序发现方法,以探索提高算法性能的方法。我们考虑到诸如位置频率矩阵(PFM)之类的公共可用主题模型来初始化总体。通过同时考虑DNA基序的保守性和复杂性,开发了一种新的适应度函数,以便在进化过程中更好地评估基序模型。还引入了最终的模型优化过程,以优化图案模型。实验结果表明,我们的方法与最近提出的两种遗传算法基元发现方法具有可比的(优越)性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号