首页> 外文会议>European conference on machine learning and knowledge discovery in databases >Probabilistic Clustering for Hierarchical Multi-Label Classification of Protein Functions
【24h】

Probabilistic Clustering for Hierarchical Multi-Label Classification of Protein Functions

机译:蛋白质功能的分层多标签分类的概率聚类

获取原文

摘要

Hierarchical Multi-Label Classification is a complex classification problem where the classes are hierarchically structured. This task is very common in protein function prediction, where each protein can have more than one function, which in turn can have more than one sub-function. In this paper, we propose a novel hierarchical multi-label classification algorithm for protein function prediction, namely HMC-PC. It is based on probabilistic clustering, and it makes use of cluster membership probabilities in order to generate the predicted class vector. We perform an extensive empirical analysis in which we compare our new approach to four different hierarchical multi-label classification algorithms, in protein function datasets structured both as trees and directed acyclic graphs. We show that HMC-PC achieves superior or comparable results compared to the state-of-the-art method for hierarchical multi-label classification.
机译:分层多标签分类是一个复杂的分类问题,其中类是分层结构的。这项功能在蛋白质功能预测中非常常见,其中每种蛋白质可以具有多个功能,而蛋白质又可以具有多个子功能。在本文中,我们提出了一种新的用于蛋白质功能预测的分层多标签分类算法,即HMC-PC。它基于概率聚类,并且利用聚类成员资格概率来生成预测的类向量。我们进行了广泛的经验分析,在此过程中,我们将新方法与以树和有向无环图构成的蛋白质功能数据集中的四种不同的分层多标签分类算法进行了比较。我们显示,与用于分层多标签分类的最新方法相比,HMC-PC可获得更好或可比的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号