首页> 外文会议>2011 IEEE Congress on Evolutionary Computation >An evolutionary-based approach for feature generation: Eukaryotic promoter recognition
【24h】

An evolutionary-based approach for feature generation: Eukaryotic promoter recognition

机译:基于进化的特征生成方法:真核启动子识别

获取原文

摘要

Prediction of promoter regions continues to be a challenging subproblem in mapping out eukaryotic DNA. While this task is key to understanding the regulation of differential transcription, the gene-specific architecture of promoter sequences does not readily lend itself to general strategies. To date, the best approaches are based on Support Vector Machines (SVMs) that employ standard ”spectrum” features and achieve promoter region classification accuracies from a low of 84% to a high of 94% depending on the particular species involved. In this paper, we propose a general and powerful methodology that uses Genetic Programming (GP) techniques to generate more complex and more gene-specific features to be used with a standard SVM for promoter region identification. We evaluate our methodology on three data sets from different species and observe consistent classification accuracies in the 94–95% range. In addition, because the GP-generated features are gene-specific, they can be used by biologists to advance their understanding of the architecture of eukaryotic promoter regions.
机译:在映射出真核性DNA时,启动子区域的预测仍然是一个具有挑战性的亚数。虽然该任务是了解差异转录调节的关键,但启动子序列的基因特异性架构并不容易借给一般策略。迄今为止,最佳方法基于支持向量机(SVM),其使用标准“光谱”特征,并根据所涉及的特定物种,实现从低于84%至高度的高度为94%的启动子区域分类精度。在本文中,我们提出了一种使用遗传编程(GP)技术来产生更复杂和更多基因特征的一般和强大的方法,以用于启动子区域鉴定标准SVM。我们在不同物种的三个数据集上评估我们的方法,并观察在94-95%范围内的一致分类精度。此外,由于GP产生的特征是基因特异性,因此可以通过生物学家使用它们来推进对真核启动子区域的结构的理解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号