首页> 外文会议>Asian conference on intelligent information and database systems >Hybrid One-Class Ensemble for High-Dimensional Data Classification
【24h】

Hybrid One-Class Ensemble for High-Dimensional Data Classification

机译:用于高维数据分类的混合一类集合

获取原文

摘要

The advance of high-throughput techniques, such as gene microarrays and protein chips have a major impact on contemporary biology and medicine. Due to the high-dimensionality and complexity of the data, it is impossible to analyze it manually. Therefore machine learning techniques play an important role in dealing with such data. In this paper we propose to use a one-class approach to classifying microar-rays. Unlike canonical classifiers, these models rely only on objects coming from single class distributions. They distinguish observations coming from the given class from any other possible states of the object, that were unseen during the classification step. While having less information to dichotomize between classes, one-class models can easily learn the specific properties of a given dataset and are robust to difficulties embedded in the nature of the data. We show, that using one-class ensembles can give as good results as canonical multi-class classifiers, while allowing to deal with imbalanced distribution and unexpected noise in the data. To cope with high dimensionality of the feature space, we propose a novel hybrid one-class ensemble utilizing combination of weighted Bagging and Random Subspaces. Experimental investigations, carried on public datasets, prove the usefulness of the proposed approach.
机译:基因芯片和蛋白质芯片等高通量技术的发展对当代生物学和医学产生了重大影响。由于数据的高维度和复杂性,因此无法进行手动分析。因此,机器学习技术在处理此类数据中起着重要作用。在本文中,我们建议使用一类方法对微射线进行分类。与规范分类器不同,这些模型仅依赖于来自单个类分布的对象。它们将来自给定类的观察结果与对象的其他任何可能状态(在分类步骤中未发现)区分开来。一类模型虽然没有太多信息可以在类之间进行二分法划分,但可以轻松地学习给定数据集的特定属性,并且对于嵌入数据本质中的困难具有较强的鲁棒性。我们证明,使用一类合奏可以提供与规范的多类分类器一样好的结果,同时允许处理数据中的不平衡分布和意外噪声。为了应对特征空间的高维性,我们提出了一种新颖的混合一类集成方法,它利用加权Bagging和随机子空间的组合。在公共数据集上进行的实验研究证明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号