首页> 外文期刊>Intelligent data analysis >A novel density-based ensemble learning algorithm with application to protein structural classification
【24h】

A novel density-based ensemble learning algorithm with application to protein structural classification

机译:一种新的基于密度的集成学习算法及其在蛋白质结构分类中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

Ensemble learning is an effective technique in classifying high-dimensional data such as bioinformatics sequences since it combines some learning models to improve the overall prediction accuracy. The key point in success of an ensemble algorithm is to build a set of diverse classifiers. In this regard, a novel density-based lazy stacking algorithm, called DBLS, is proposed in this paper. It takes the advantages of both lazy learning, in finding local optimal solutions, and the stacking method, in achieving classifier diversity, to obtain better performance while keeping the complexity intact. DBLS uses a stacking framework with lazy local learners based on density for building an ensemble of classifiers to predict the structural classes of proteins. To evaluate the performance of DBLS, it is compared against four rival classification methods. For this purpose, some real-world UCI datasets beside to three benchmark protein datasets are used in the experiments. The experimental results confirmed that DBLS significantly (with 95% confidence) outperforms other methods in terms of classification accuracy; over 3% advantage in absolute accuracy.
机译:集成学习是一种对高维数据(例如生物信息学序列)进行分类的有效技术,因为它结合了一些学习模型以提高总体预测准确性。集成算法成功的关键是建立一组多样化的分类器。在这方面,本文提出了一种新的基于密度的惰性堆叠算法,称为DBLS。它既利用了惰性学习的优势,找到了局部最优解,又利用了堆叠方法,实现了分类器的多样性,从而在保持复杂性不变的同时获得了更好的性能。 DBLS使用基于密度的具有惰性本地学习器的堆栈框架来构建分类器的整体,以预测蛋白质的结构类别。为了评估DBLS的性能,将其与四种竞争对手的分类方法进行了比较。为此,在实验中使用了除三个基准蛋白质数据集外的一些实际UCI数据集。实验结果证实,在分类准确性方面,DBLS显着(置信度为95%)优于其他方法。绝对精度超过3%的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号