首页> 美国卫生研究院文献>Scientific Reports >Improving classification of mature microRNA by solving class imbalance problem
【2h】

Improving classification of mature microRNA by solving class imbalance problem

机译:解决类不平衡问题提高成熟microRNA的分类

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

MicroRNAs (miRNAs) are ~20–25 nucleotides non-coding RNAs, which regulated gene expression in the post-transcriptional level. The accurate rate of identifying the start sit of mature miRNA from a given pre-miRNA remains lower. It is noting that the mature miRNA prediction is a class-imbalanced problem which also leads to the unsatisfactory performance of these methods. We improved the prediction accuracy of classifier using balanced datasets and presented MatFind which is used for identifying 5′ mature miRNAs candidates from their pre-miRNA based on ensemble SVM classifiers with idea of adaboost. Firstly, the balanced-dataset was extract based on K-nearest neighbor algorithm. Secondly, the multiple SVM classifiers were trained in orderly using the balance datasets base on represented features. At last, all SVM classifiers were combined together to form the ensemble classifier. Our results on independent testing dataset show that the proposed method is more efficient than one without treating class imbalance problem. Moreover, MatFind achieves much higher classification accuracy than other three approaches. The ensemble SVM classifiers and balanced-datasets can solve the class-imbalanced problem, as well as improve performance of classifier for mature miRNA identification. MatFind is an accurate and fast method for 5′ mature miRNA identification.
机译:MicroRNA(miRNA)是〜20–25个核苷酸的非编码RNA,可在转录后水平调控基因表达。从给定的pre-miRNA鉴定成熟miRNA起始位置的准确率仍然较低。值得注意的是,成熟的miRNA预测是类别不平衡的问题,这也导致这些方法的性能无法令人满意。我们使用平衡的数据集提高了分类器的预测准确性,并提出了MatFind,该方法用于基于adaboost的整体SVM分类器从其前miRNA识别5'成熟miRNA候选物。首先,基于K最近邻算法提取平衡数据集。其次,使用基于表示特征的余额数据集有序地训练了多个SVM分类器。最后,将所有SVM分类器组合在一起形成整体分类器。我们在独立测试数据集上的结果表明,所提出的方法比不处理类不平衡问题的方法更为有效。此外,与其他三种方法相比,MatFind实现了更高的分类精度。集成的SVM分类器和平衡数据集可以解决类不平衡问题,并提高分类器用于成熟miRNA识别的性能。 MatFind是一种准确,快速的5'成熟miRNA鉴定方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号