首页> 美国卫生研究院文献>Scientific Reports >EnhancerPred: a predictor for discovering enhancers based on the combination and selection of multiple features
【2h】

EnhancerPred: a predictor for discovering enhancers based on the combination and selection of multiple features

机译:EnhancerPred:一种预测器用于基于多种功能的组合和选择来发现增强剂

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Enhancers are cis elements that play an important role in regulating gene expression by enhancing it. Recent study of modifications revealed that enhancers are a large group of functional elements with many different subgroups, which have different biological activities and regulatory effects on target genes. As powerful auxiliary tools, several computational methods have been proposed to distinguish enhancers from other regulatory elements, but only one method has been considered to clustering them into subgroups. In this study, we developed a predictor (called EnhancerPred) to distinguish between enhancers and nonenhancers and to determine enhancers’ strength. A two-step wrapper-based feature selection method was applied in high dimension feature vector from bi-profile Bayes and pseudo-nucleotide composition. Finally, the combination of 104 features from bi-profile Bayes, 1 feature from nucleotide composition and 9 features from pseudo-nucleotide composition yielded the best performance for identifying enhancers and nonenhancers, with overall Acc of 77.39%. The combination of 89 features from bi-profile Bayes and 10 features from pseudo-nucleotide composition yielded the best performance for identifying strong and weak enhancers, with overall Acc of 68.19%. The process and steps of feature optimization illustrated that it is necessary to construct a particular model for identifying strong enhancers and weak enhancers.
机译:增强子是顺式元件,在通过增强基因表达来调节基因表达中起重要作用。最近对修饰的研究表明,增强子是具有许多不同亚组的一大类功能元件,它们对靶基因具有不同的生物学活性和调节作用。作为强大的辅助工具,已经提出了几种计算方法来将增强子与其他调控元素区分开,但是仅考虑了一种将其聚类为子组的方法。在这项研究中,我们开发了一种预测因子(称为EnhancerPred),以区分增强剂和非增强剂,并确定增强剂的强度。基于两步包装的特征选择方法被应用于来自双谱贝叶斯和伪核苷酸组成的高维特征向量。最后,双谱Bayes的104个特征,核苷酸组成的1个特征和假核苷酸组成的9个特征的组合产生了识别增强子和非增强子的最佳性能,总Acc为77.39%。来自双谱贝叶斯的89个特征和来自伪核苷酸组成的10个特征的组合产生了识别强和弱增强子的最佳性能,总Acc率为68.19%。特征优化的过程和步骤说明,有必要构建一个用于识别强增强剂和弱增强剂的特定模型。

著录项

  • 期刊名称 Scientific Reports
  • 作者

    Cangzhi Jia; Wenying He;

  • 作者单位
  • 年(卷),期 -1(6),-1
  • 年度 -1
  • 页码 38741
  • 总页数 7
  • 原文格式 PDF
  • 正文语种
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号