首页> 外文会议>IEEE Central American and Panama Convention >Classification of peptides using ensembles by applying different strategies that deal with imbalanced data and combination rules
【24h】

Classification of peptides using ensembles by applying different strategies that deal with imbalanced data and combination rules

机译:通过应用处理不平衡数据和组合规则的不同策略,使用集成体对肽进行分类

获取原文
获取外文期刊封面目录资料

摘要

The discovery and synthesis of peptides with antimicrobial properties is a promising alternative to fight against multi-resistant bacteria. There are multiple studies that deal with the classification of peptides according with their probability to possess antimicrobial activity. One of the challenges in these classification processes is related with the amount of available data. For the case of antibacterial peptides classifiers, the size of the positive class is much bigger than the negative class. In this work, we propose two strategies to deal with the imbalance situation of the data by using ensembles. The first one is based on algorithm modifications and the second one with data management. For each strategy we used five combination rules. The performance of the ensembles was calculated using the area under the ROC curve (AUC). Our results suggest that care must be taken with ensembles and that individual classifiers must be studied in-depth.
机译:具有抗菌特性的肽的发现和合成是对抗多抗细菌的一种有前途的替代方法。有许多研究根据肽具有抗菌活性的可能性来对肽进行分类。这些分类过程中的挑战之一与可用数据量有关。对于抗菌肽分类器,阳性类别的大小比阴性类别大得多。在这项工作中,我们提出了两种策略来通过使用集成来处理数据的不平衡情况。第一个基于算法修改,第二个基于数据管理。对于每种策略,我们使用了五个组合规则。使用ROC曲线(AUC)下的面积计算合奏的性能。我们的结果表明,必须谨慎对待合奏,并且必须对单个分类器进行深入研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号