首页> 外文会议>2011 11th International Conference on Intelligent Systems Design and Applications >Hierarchical multi-label classification for protein function prediction: A local approach based on neural networks
【24h】

Hierarchical multi-label classification for protein function prediction: A local approach based on neural networks

机译:蛋白质功能预测的分层多标签分类:基于神经网络的局部方法

获取原文

摘要

In Hierarchical Multi-Label Classification problems, each instance can be classified into two or more classes simultaneously, differently from conventional classification. Additionally, the classes are structured in a hierarchy, in the form of either a tree or a directed acyclic graph. Hence, an instance can be assigned to two or more paths from the hierarchical structure, resulting in a complex classification problem with possibly hundreds of classes. Many methods have been proposed to deal with such problems, some of them employing a single classifier to deal with all classes simultaneously (global methods), and others employing many classifiers to decompose the original problem into a set of subproblems (local methods). In this work, we propose a novel local method named HMC-LMLP, which uses one Multi-Layer Perceptron per hierarchical level. The predictions in one level are used as inputs to the network responsible for the predictions in the next level. We make use of two distinct Multi-Layer Perceptron algorithms: Back-propagation and Resilient Back-propagation. In addition, we make use of an error measure specially tailored to multi-label problems for training the networks. Our method is compared to state-of-the-art hierarchical multi-label classification algorithms, in protein function prediction datasets. The experimental results show that our approach presents competitive predictive accuracy, suggesting that artificial neural networks constitute a promising alternative to deal with hierarchical multi-label classification of biological data.
机译:在分层多标签分类问题中,可以将每个实例同时分类为两个或更多类,这与常规分类不同。此外,这些类以树或有向无环图的形式按层次结构进行构造。因此,可以将一个实例分配给该层次结构中的两个或更多路径,从而导致复杂的分类问题,可能包含数百个类。已经提出了许多方法来处理此类问题,其中一些方法使用单个分类器同时处理所有类(全局方法),而另一些方法则使用许多分类器将原始问题分解为一组子问题(局部方法)。在这项工作中,我们提出了一种名为HMC-LMLP的新颖的本地方法,该方法在每个层次级别使用一个多层感知器。上一层的预测被用作负责下一层预测的网络的输入。我们利用两种截然不同的多层感知器算法:反向传播和弹性反向传播。另外,我们利用专门针对多标签问题的错误度量来训练网络。在蛋白质功能预测数据集中,将我们的方法与最新的分层多标签分类算法进行了比较。实验结果表明,我们的方法具有竞争性的预测准确性,表明人工神经网络构成了应对生物数据的分层多标签分类的有前途的替代方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号