首页> 外文会议>ICIC 2013 >Machine Learning-Based Approaches Identify a Key Physicochemical Property for Accurately Predicting Polyadenlylation Signals in Genomic Sequences
【24h】

Machine Learning-Based Approaches Identify a Key Physicochemical Property for Accurately Predicting Polyadenlylation Signals in Genomic Sequences

机译:基于机器学习的方法鉴定了用于准确预测基因组序列中的聚腺苷酸化信号的关键物理化学性质

获取原文

摘要

Accurately predicting poly(A) signals (PASs) is one of important topics in bioinformatics for high-quality genome annotation and transcription regulation mechanism investigation. In this study, we identified a powerful physicochemical property of DNA sequence for computationally predicting PASs using machine learning technologies. On the basis of this feature, we built a PAS prediction model by capturing the position-specific information from the region surrounding PASs. The cross-validation results demonstrated that the prediction accuracies of our constructed model on 12 categories of human PASs are comparable to those of recently published PAS predictor Dragon PolyA Spotter. Further analysis revealed that the region 25 nucleotides downstream of PASs is the most important region for the accurate prediction of PASs.
机译:准确预测聚(a)信号(通过)是用于高质量基因组注释和转录调控机制调查的生物信息学中的重要主题之一。在这项研究中,我们确定了使用机器学习技术计算预测通过的DNA序列的强大物理化学性质。在此特征的基础上,我们通过从周围的区域的区域捕获特定位置信息来构建PAS预测模型。交叉验证结果表明,12类人通行证的构建模型的预测准确性与最近公布的PAS预测龙多亚观察器的预测准确性相当。进一步的分析显示,通过下游的区域25核苷酸是准确预测通过的最重要区域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号