...
首页> 外文期刊>Journal of Bioinformatics and Computational Biology >A max-margin training of RNA secondary structure prediction integrated with the thermodynamic model
【24h】

A max-margin training of RNA secondary structure prediction integrated with the thermodynamic model

机译:与热力学模型相结合的RNA二级结构预测的最大边缘训练

获取原文
获取原文并翻译 | 示例

摘要

A popular approach for predicting RNA secondary structure is the thermodynamic nearest neighbor model that finds a thermodynamically most stable secondary structure with the minimum free energy (MFE). For further improvement, an alternative approach that is based on machine learning techniques has been developed. The machine learning based approach can employ a fine-grained model that includes much richer feature representations with the ability to fit the training data. Although a machine learning based fine-grained model achieved extremely high performance in prediction accuracy, a possibility of the risk of overfitting for such model has been reported. Results: In this paper, we propose a novel algorithm for RNA secondary structure prediction that integrates the thermodynamic approach and the machine learning based weighted approach. Our fine-grained model combines the experimentally determined thermodynamic parameters with a large number of scoring parameters for detailed contexts of features that are trained by the structured support vector machine (SSVM) with the l(1) regularization to avoid overfitting. Our benchmark shows that our algorithm achieves the best prediction accuracy compared with existing methods, and heavy overfitting cannot be observed. Availability: The implementation of our algorithm is available at https://github.com/keio-bioinformatics/mxfold.
机译:用于预测RNA二级结构的流行方法是热力学最近邻模型,其发现具有最小自由能(MFE)的热力学上最稳定的二级结构。为了进一步改进,已经开发了一种基于机器学习技术的替代方法。基于机器的基于机器的方法可以采用细粒度模型,该模型包括具有适合训练数据的能力的更丰富的特征表示。尽管基于机器学习的细粒度模型以预测精度实现了极高的性能,但报道了这种模型的过度装备风险的可能性。结果:在本文中,我们提出了一种用于RNA二级结构预测的新算法,其集成了热力学方法和基于机器的加权方法。我们的细粒度模型将实验确定的热力学参数与大量评分参数结合起来,用于使用由L(1)正则化的结构化支持向量机(SSVM)训练的特征的详细上下文,以避免过度拟合。我们的基准显示,与现有方法相比,我们的算法实现了最佳预测精度,并且无法观察到沉重的过度装备。可用性:我们的算法的实现是在https://github.com/keio-bioinformatics/mxfold中获得的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号