首页> 外文期刊>Combinatorial chemistry & high throughput screening >Controlling feature selection in random forests of decision trees using a genetic algorithm: classification of class I MHC peptides.
【24h】

Controlling feature selection in random forests of decision trees using a genetic algorithm: classification of class I MHC peptides.

机译:使用遗传算法控制决策树随机森林中的特征选择:I类MHC肽分类。

获取原文
获取原文并翻译 | 示例
           

摘要

Feature selection is an important challenge in many classification problems, especially if the number of features greatly exceeds the number of examples available. We have developed a procedure--GenForest--which controls feature selection in random forests of decision trees by using a genetic algorithm. This approach was tested through our entry into the Comparative Evaluation of Prediction Algorithms 2006 (CoEPrA) competition (accessible online at: http://www.coepra.org). CoEPrA was a modeling competition organized to provide an objective testing for various classification and regression algorithms via the process of blind prediction. In the competition GenForest ranked 10/23, 5/16 and 9/16 on CoEPrA classification problems 1, 3 and 4, respectively, which involved the classification of type I MHC nonapeptides i.e. peptides containing nine amino acids. These problems each involved the classification of different sets of nonapeptides. Associated with each amino acid was a set of 643 features for a total of 5787 features per peptide. The method, its application to the CoEPrA datasets, and its performance in the competition are described.
机译:在许多分类问题中,特征选择是一项重要的挑战,特别是如果特征的数量大大超过可用示例的数量时。我们已经开发了一个过程-GenForest-通过使用遗传算法来控制决策树的随机森林中的特征选择。我们通过参加2006年预测算法比较评估(CoEPrA)竞赛对这种方法进行了测试(可在线访问:http://www.coepra.org)。 CoEPrA是一场建模竞赛,旨在通过盲目预测过程为各种分类和回归算法提供客观的测试。在竞赛中,GenForest在CoEPrA分类问题1、3和4上分别排名10 / 23、5 / 16和9/16,涉及I型MHC九肽,即包含九个氨基酸的肽的分类。这些问题均涉及不同组九肽的分类。与每个氨基酸相关的是一组643个特征,每个肽共有5787个特征。描述了该方法,其在CoEPrA数据集中的应用以及其在比赛中的表现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号