首页> 外文会议>Annual genetic and evolutionary computation conference;GECCO-2010 >Selecting Predictive Features for Recognition of Hypersensitive Sites of Regulatory Genomic Sequences with an Evolutionary Algorithm
【24h】

Selecting Predictive Features for Recognition of Hypersensitive Sites of Regulatory Genomic Sequences with an Evolutionary Algorithm

机译:使用进化算法选择预测特征来识别调控基因组序列的超敏感位点

获取原文

摘要

This paper proposes a method to improve the recognition of regulatory genomic sequences. Annotating sequences that regulate gene transcription is an emerging challenge in ge-nomics research. Identifying regulatory sequences promises to reveal underlying reasons for phenotypic differences among cells and for diseases associated with pathologies in protein expression. Computational approaches have been limited by the scarcity of experimentally-known features specific to regulatory sequences. High-throughput experimental technology is finally revealing a wealth of hypersensitive (HS) sequences that are reliable markers of regulatory sequences and currently the focus of classification methods. The contribution of this paper is a novel method that combines evolutionary computation and SVM classification to improve the recognition of HS sequences. Based on experimental evidence that HS regions employ sequence features to interact with enzymes, the method seeks motifs to discriminate between HS and non-HS sequences. An evolutionary algorithm (EA) searches the space of sequences of different lengths to obtain such motifs. Experiments reveal that these motifs improve recognition of HS sequences by more than 10% compared to state-of-the-art classification methods. Analysis of these motifs reveals interesting insight into features employed by regulatory sequences to interact with DNA-binding enzymes.
机译:本文提出了一种方法,以提高监管基因组序列的识别。注释调控基因转录的序列是基因组学研究中的一个新兴挑战。鉴定调控序列有望揭示细胞之间表型差异以及与蛋白质表达病理学相关疾病的根本原因。计算方法受到缺乏特定于调节序列的实验已知特征的限制。高通量实验技术最终揭示了许多超敏(HS)序列,它们是调节序列的可靠标记,目前是分类方法的重点。本文的贡献是一种将进化计算和SVM分类相结合以提高HS序列识别能力的新颖方法。基于HS区域利用序列特征与酶相互作用的实验证据,该方法寻求可区分HS和非HS序列的基序。进化算法(EA)搜索不同长度序列的空间以获得此类基序。实验表明,与最新的分类方法相比,这些基序可将HS序列的识别率提高10%以上。对这些基序的分析揭示了对调节序列与DNA结合酶相互作用的特征的有趣见解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号