...
首页> 外文期刊>Memetic Computing >A hierarchical multi-label classification ant colony algorithm for protein function prediction
【24h】

A hierarchical multi-label classification ant colony algorithm for protein function prediction

机译:蛋白质功能预测的分层多标签分类蚁群算法

获取原文
获取原文并翻译 | 示例
           

摘要

This paper proposes a novel ant colony optimisation (ACO) algorithm tailored for the hierarchical multi-label classification problem of protein function prediction. This problem is a very active research field, given the large increase in the number of uncharacterised proteins available for analysis and the importance of determining their functions in order to improve the current biological knowledge. Since it is known that a protein can perform more than one function and many protein functional-definition schemes are organised in a hierarchical structure, the classification problem in this case is an instance of a hierarchical multi-label problem. In this type of problem, each example may belong to multiple class labels and class labels are organised in a hierarchical structure—either a tree or a directed acyclic graph structure. It presents a more complex problem than conventional flat classification, given that the classification algorithm has to take into account hierarchical relationships between class labels and be able to predict multiple class labels for the same example. The proposed ACO algorithm discovers an ordered list of hierarchical multi-label classification rules. It is evaluated on sixteen challenging bioinformatics data sets involving hundreds or thousands of class labels to be predicted and compared against state-of-the-art decision tree induction algorithms for hierarchical multi-label classification.
机译:本文针对蛋白质功能预测的分层多标签分类问题提出了一种新颖的蚁群优化算法。鉴于可用于分析的未表征蛋白质的数量大量增加,并且确定其功能以改善当前生物学知识的重要性,这一问题是一个非常活跃的研究领域。由于已知一种蛋白质可以执行多种功能,并且许多蛋白质功能定义方案以分层结构组织,因此在这种情况下,分类问题就是分层多标签问题的一个实例。在这种类型的问题中,每个示例都可能属于多个类别标签,并且类别标签以层次结构(树或有向无环图结构)进行组织。鉴于分类算法必须考虑类别标签之间的层次关系,并且能够预测同一示例的多个类别标签,因此它提出了比常规平面分类更为复杂的问题。提出的ACO算法发现了分层的多标签分类规则的有序列表。在涉及数百个或数千个类别标签的16个具有挑战性的生物信息学数据集上进行评估,并将其与用于分层多标签分类的最新决策树归纳算法进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号