首页> 美国卫生研究院文献>International Journal of Biological Sciences >Research on folding diversity in statistical learning methods for RNA secondary structure prediction
【2h】

Research on folding diversity in statistical learning methods for RNA secondary structure prediction

机译:统计学习方法中折叠多样性的RNA二级结构预测研究

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

How to improve the prediction accuracy of RNA secondary structure is currently a hot topic. The existing prediction methods for a single sequence do not fully consider the folding diversity which may occur among RNAs with different functions or sources. This paper explores the relationship between folding diversity and prediction accuracy, and puts forward a new method to improve the prediction accuracy of RNA secondary structure. Our research investigates the following: 1. The folding feature based on stochastic context-free grammar is proposed. By using dimension reduction and clustering techniques, some public data sets are analyzed. The results show that there is significant folding diversity among different RNA families. 2. To assign folding rules to RNAs without structural information, a classification method based on production probability is proposed. The experimental results show that the classification method proposed in this paper can effectively classify the RNAs of unknown structure. 3. Based on the existing prediction methods of statistical learning models, an RNA secondary structure prediction framework is proposed, namely “Cluster - Training - Parameter Selection - Prediction”. The results show that, with information on folding diversity, prediction accuracy can be significantly improved.
机译:如何提高RNA二级结构的预测准确性是当前的热门话题。现有的针对单个序列的预测方法并未完全考虑可能在具有不同功能或来源的RNA之间发生的折叠多样性。探讨了折叠多样性与预测精度之间的关系,提出了提高RNA二级结构预测精度的新方法。我们的研究主要包括以下几个方面:1.提出了基于随机上下文无关文法的折叠特征。通过使用降维和聚类技术,分析了一些公共数据集。结果表明,不同RNA家族之间存在显着的折叠多样性。 2.为了给没有结构信息的RNA分配折叠规则,提出了一种基于生产概率的分类方法。实验结果表明,本文提出的分类方法可以有效地对未知结构的RNA进行分类。 3.基于统计学习模型的现有预测方法,提出了RNA二级结构预测框架,即“聚类-训练-参数选择-预测”。结果表明,利用折叠多样性信息,可以显着提高预测准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号