【24h】

Equivalence Learning in Protein Classification

机译:蛋白质分类中的等价学习

获取原文
获取原文并翻译 | 示例

摘要

We present a method, called equivalence learning, which applies a two-class classification approach to object-pairs defined within a multi-class scenario. The underlying idea is that instead of classifying objects into their respective classes, we classify object pairs either as equivalent (belonging to the same class) or non-equivalent (belonging to different classes). The method is based on a vectorisation of the similarity between the objects and the application of a machine learning algorithm (SVM, ANN, LogReg, Random Forests) to learn the differences between equivalent and non-equivalent object pairs, and define a unique kernel function that can be obtained via equivalence learning. Using a small dataset of archaeal, bacterial and eukaryotic 3-phosphoglycerate-kinase sequences we found that the classification performance of equivalence learning slightly exceeds those of several simple machine learning algorithms at the price of a minimal increase in time and space requirements.
机译:我们提出一种称为对等学习的方法,该方法将两类分类方法应用于在多类场景中定义的对象对。基本思想是,不是将对象分类为各自的类,而是将对象对分类为等效的(属于同一类)或非等效的(属于不同的类)。该方法基于对象之间相似度的矢量化和机器学习算法(SVM,ANN,LogReg,随机森林)的应用,以了解等效对象对和非等效对象对之间的差异,并定义唯一的内核函数可以通过对等学习获得。使用古细菌,细菌和真核生物3-磷酸甘油酸激酶序列的小型数据集,我们发现等价学习的分类性能略微超过了几种简单的机器学习算法,但其代价是时间和空间需求的增加最小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号