首页> 外文期刊>Journal of Bioinformatics and Computational Biology >PVsiRNAPred: Prediction of plant exclusive virus-derived small interfering RNAs by deep convolutional neural network
【24h】

PVsiRNAPred: Prediction of plant exclusive virus-derived small interfering RNAs by deep convolutional neural network

机译:PVSIRNAPRED:深度卷积神经网络预测植物专用病毒衍生的小干扰RNA

获取原文
获取原文并翻译 | 示例
           

摘要

Plant exclusive virus-derived small interfering RNAs (vsiRNAs) regulate various biological processes, especially important in antiviral immunity. The identification of plant vsiRNAs is important for understanding the biogenesis and function mechanisms of vsiRNAs and further developing anti-viral plants. In this study, we extracted plant vsiRNA sequences from the PVsiRNAdb database. We then utilized deep convolutional neural network (CNN) to develop a deep learning algorithm for predicting plant vsiRNAs based on vsiRNA sequence composition, known as PVsiRNAPred. The key part of PVsiRNAPred is the CNN module, which automatically learns hierarchical representations of vsiRNA sequences related to vsiRNA profiles in plants. When evaluated using an independent testing dataset, the accuracy of the model was 65.70%, which was higher than those of five conventional machine learning method-based classifiers. In addition, PVsiRNAPred obtained a sensitivity of 67.11%, specificity of 64.26% and Matthews correlation coefficient (MCC) of 0.31, and the area under the receiver operating characteristic (ROC) curve (AUC) of PVsiRNAPred was 0.71 in the independent test. The permutation test with 1000 shuffles resulted in a p value of < 0.001. The above results reveal that PVsiRNAPred has favorable generalization capabilities. We hope PVsiRNAPred, the first bioinformatics algorithm for predicting plant vsiRNAs, will allow efficient discovery of new vsiRNAs.
机译:植物独家病毒衍生的小干扰RNA(Vsirnas)调节各种生物过程,在抗病毒免疫中尤其重要。植物Vsirnas的鉴定对于了解Vsirnas的生物发生和功能机制以及进一步发展抗病毒植物是重要的。在本研究中,我们从PVSIRNADB数据库中提取了植物vsiRNA序列。然后,我们利用了深度卷积神经网络(CNN)来发展基于vsiRNA序列组合物的预测植物vsirnas的深度学习算法,称为pvsirnapred。 PVSIRNAPRED的关键部分是CNN模块,其自动学习与植物中与vsiRNA型材相关的vsiRNA序列的分层表示。当使用独立的测试数据集进行评估时,该模型的准确性为65.70%,高于基于五种基于机器学习方法的分类器的精度。此外,PVSIRNAPRED获得的灵敏度为67.11%,特异性64.26%,马太福德相关系数(MCC)为0.31,以及PVSIRNAPRED的接收器操作特征(ROC)曲线(AUC)的面积在独立测试中为0.71。具有1000个洗涤器的排列测试导致p值<0.001。上述结果表明,PVSIRNAPRED具有良好的泛化能力。我们希望PVSIRNAPRED是预测工厂VSIRNA的第一种生物信息学算法,将允许有效地发现新的VSIRNA。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号