首页> 美国卫生研究院文献>PLoS Clinical Trials >Integrated Strategy Improves the Prediction Accuracy of miRNA in Large Dataset
【2h】

Integrated Strategy Improves the Prediction Accuracy of miRNA in Large Dataset

机译:集成策略可提高大数据集中miRNA的预测准确性

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

MiRNAs are short non-coding RNAs of about 22 nucleotides, which play critical roles in gene expression regulation. The biogenesis of miRNAs is largely determined by the sequence and structural features of their parental RNA molecules. Based on these features, multiple computational tools have been developed to predict if RNA transcripts contain miRNAs or not. Although being very successful, these predictors started to face multiple challenges in recent years. Many predictors were optimized using datasets of hundreds of miRNA samples. The sizes of these datasets are much smaller than the number of known miRNAs. Consequently, the prediction accuracy of these predictors in large dataset becomes unknown and needs to be re-tested. In addition, many predictors were optimized for either high sensitivity or high specificity. These optimization strategies may bring in serious limitations in applications. Moreover, to meet continuously raised expectations on these computational tools, improving the prediction accuracy becomes extremely important. In this study, a meta-predictor mirMeta was developed by integrating a set of non-linear transformations with meta-strategy. More specifically, the outputs of five individual predictors were first preprocessed using non-linear transformations, and then fed into an artificial neural network to make the meta-prediction. The prediction accuracy of meta-predictor was validated using both multi-fold cross-validation and independent dataset. The final accuracy of meta-predictor in newly-designed large dataset is improved by 7% to 93%. The meta-predictor is also proved to be less dependent on datasets, as well as has refined balance between sensitivity and specificity. This study has two folds of importance: First, it shows that the combination of non-linear transformations and artificial neural networks improves the prediction accuracy of individual predictors. Second, a new miRNA predictor with significantly improved prediction accuracy is developed for the community for identifying novel miRNAs and the complete set of miRNAs. Source code is available at:
机译:MiRNA是约22个核苷酸的短非编码RNA,在基因表达调控中起关键作用。 miRNA的生物发生很大程度上取决于其亲本RNA分子的序列和结构特征。基于这些功能,开发了多种计算工具来预测RNA转录本是否包含miRNA。尽管非常成功,但这些预测变量近年来已开始面临多重挑战。使用数百个miRNA样本的数据集优化了许多预测因子。这些数据集的大小比已知miRNA的数目小得多。因此,这些预测变量在大型数据集中的预测准确性变得未知,需要重新测试。另外,许多预测因子针对高灵敏度或高特异性进行了优化。这些优化策略可能会对应用程序造成严重限制。而且,为了满足对这些计算工具的不断提高的期望,提高预测精度变得极为重要。在这项研究中,通过将一组非线性转换与元策略集成在一起,开发了元预测器mirMeta。更具体地说,首先使用非线性变换对五个预测变量的输出进行预处理,然后将其输入到人工神经网络中以进行元预测。使用多重交叉验证和独立数据集验证了元预测器的预测准确性。新设计的大型数据集中的元预测器的最终准确性提高了7%至93%。还证明了元预测变量对数据集的依赖性较小,并且在敏感性和特异性之间具有精确的平衡。这项研究具有两个方面的重要性:首先,它表明非线性变换和人工神经网络的组合提高了单个预测变量的预测准确性。其次,为社区开发了一种新的具有显着提高的预测准确性的miRNA预测因子,用于鉴定新型miRNA和整套miRNA。 可从以下位置获取源代码

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号