首页> 外文会议>Annual genetic and evolutionary computation conference;GECCO-2010 >Challenges Rising from Learning Motif Evaluation Functions Using Genetic Programming
【24h】

Challenges Rising from Learning Motif Evaluation Functions Using Genetic Programming

机译:使用遗传编程学习母题评估功能带来的挑战

获取原文

摘要

Motif discovery is an important Bioinformatics problem for deciphering gene regulation. Numerous sequence-based approaches have been proposed employing human specialist motif models (evaluation functions), but performance is so unsatisfactory on benchmarks that the underlying information seems to have already been exploited. However, we have found that even a simple modified representation still achieves considerably high performance on a challenging benchmark, implying potential for sequence-based motif discovery. Thus we raise the problem of learning motif evaluation functions. We employ Genetic programming (GP) which has the potential to evolve human competitive models. We take advantage of the terminal set containing specialist-model-like components and have tried three fitness functions. Results exhibit both great challenges and potentials. No models learnt can perform universally well on the challenging benchmark, where one reason may be the data appropriateness for sequence-based motif discovery. However, when applied on different widely-tested datasets, the same models achieve comparable performance to existing approaches based on specialist models. The study calls for further novel GP to learn different levels of effective evaluation models from strict to loose ones on exploiting sequence information for motif discovery, namely quantitative functions, cardinal rankings, and learning feasibility classifications.
机译:主题发现是破译基因调控的重要生物信息学问题。已经提出了许多基于序列的方法,这些方法都采用了人类专家的主题模型(评估函数),但是在基准上的性能是如此差强人意,以至于潜在的信息似乎已经被利用了。但是,我们发现,即使是简单的修饰表示形式,在具有挑战性的基准测试下仍可实现相当高的性能,这意味着基于序列的基序发现具有潜力。因此,我们提出了学习主题评估功能的问题。我们采用遗传编程(GP),它具有发展人类竞争模型的潜力。我们利用了包含专家模型样组件的终端机,并尝试了三种适应功能。结果既有巨大的挑战,也有潜力。没有一个学习的模型能够在具有挑战性的基准上普遍表现良好,其中一个原因可能是基于序列的基序发现的数据适当性。但是,当将这些模型应用于经过广泛测试的不同数据集时,其性能可与基于专家模型的现有方法相媲美。这项研究呼吁进一步的新型GP来学习不同级别的有效评估模型,从严格模型到宽松模型,都需要利用序列信息进行基序发现,即定量功能,基数排名和学习可行性分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号