首页> 外文会议>European conference on machine learning and knowledge discovery in databases >Hub Co-occurrence Modeling for Robust High-Dimensional kNN Classification
【24h】

Hub Co-occurrence Modeling for Robust High-Dimensional kNN Classification

机译:鲁棒高维kNN分类的集线器共现建模

获取原文

摘要

The emergence of hubs in k-nearest neighbor (kNN) topologies of intrinsically high dimensional data has recently been shown to be quite detrimental to many standard machine learning tasks, including classification. Robust hubness-aware learning methods are required in order to overcome the impact of the highly uneven distribution of influence. In this paper, we have adapted the Hidden Naive Bayes (HNB) model to the problem of modeling neighbor occurrences and co-occurrences in high-dimensional data. Hidden nodes are used to aggregate all pairwise occurrence dependencies. The result is a novel kNN classification method tailored specifically for intrinsically high-dimensional data, the Augmented Naive Hubness Bayesian k nearest Neighbor (ANHBNN). Neighbor co-occurrence information forms an important part of the model and our analysis reveals some surprising results regarding the influence of hubness on the shape of the co-occurrence distribution in high-dimensional data. The proposed approach was tested in the context of object recognition from images in class imbalanced data and the results show that it offers clear benefits when compared to the other hubness-aware kNN baselines.
机译:最近显示,本征高维数据的k最近邻(kNN)拓扑中的集线器的出现对许多标准的机器学习任务(包括分类)非常不利。为了克服影响分布高度不均的影响,需要鲁棒的了解中心的学习方法。在本文中,我们将隐藏的朴素贝叶斯(HNB)模型改编为对高维数据中的邻居出现和共现进行建模的问题。隐藏节点用于汇总所有成对出现的依赖关系。结果是一种专门针对固有高维数据量身定制的新颖kNN分类方法,即增强朴素贝叶斯贝叶斯k最近邻(ANHBNN)。邻居共现信息构成该模型的重要组成部分,我们的分析揭示了一些有关中枢对高维数据中共现分布形状的影响的令人惊讶的结果。在从类不平衡数据中的图像进行对象识别的背景下,对所提出的方法进行了测试,结果表明,与其他感知枢纽的kNN基线相比,该方法具有明显的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号