Hub Co-occurrence Modeling for Robust High-Dimensional kNN Classification

机译：鲁棒高维kNN分类的集线器共现建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The emergence of hubs in k-nearest neighbor (kNN) topologies of intrinsically high dimensional data has recently been shown to be quite detrimental to many standard machine learning tasks, including classification. Robust hubness-aware learning methods are required in order to overcome the impact of the highly uneven distribution of influence. In this paper, we have adapted the Hidden Naive Bayes (HNB) model to the problem of modeling neighbor occurrences and co-occurrences in high-dimensional data. Hidden nodes are used to aggregate all pairwise occurrence dependencies. The result is a novel kNN classification method tailored specifically for intrinsically high-dimensional data, the Augmented Naive Hubness Bayesian k nearest Neighbor (ANHBNN). Neighbor co-occurrence information forms an important part of the model and our analysis reveals some surprising results regarding the influence of hubness on the shape of the co-occurrence distribution in high-dimensional data. The proposed approach was tested in the context of object recognition from images in class imbalanced data and the results show that it offers clear benefits when compared to the other hubness-aware kNN baselines.

机译：最近显示，本征高维数据的k最近邻（kNN）拓扑中的集线器的出现对许多标准的机器学习任务（包括分类）非常不利。为了克服影响分布高度不均的影响，需要鲁棒的了解中心的学习方法。在本文中，我们将隐藏的朴素贝叶斯（HNB）模型改编为对高维数据中的邻居出现和共现进行建模的问题。隐藏节点用于汇总所有成对出现的依赖关系。结果是一种专门针对固有高维数据量身定制的新颖kNN分类方法，即增强朴素贝叶斯贝叶斯k最近邻（ANHBNN）。邻居共现信息构成该模型的重要组成部分，我们的分析揭示了一些有关中枢对高维数据中共现分布形状的影响的令人惊讶的结果。在从类不平衡数据中的图像进行对象识别的背景下，对所提出的方法进行了测试，结果表明，与其他感知枢纽的kNN基线相比，该方法具有明显的优势。

著录项

来源
《European conference on machine learning and knowledge discovery in databases》|2013年|643-659|共17页
会议地点
作者
Nenad Tomasev; Dunja Mladenic;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
hubs; k-nearest neighbor; classification; curse of dimensionality; Bayesian; co-occurrences;

机译：集线器k近邻;分类;维度诅咒贝叶斯共现;

相似文献

外文文献
中文文献
专利

1. Hubness-aware kNN classification of high-dimensional data in presence of label noise [J] . Tomasev Nenad, Buza Krisztian Neurocomputing . 2015,第jula21期

机译：存在标签噪声的高维数据的基于感知度的kNN分类
2. HIBoost: A hubness-aware ensemble learning algorithm for high-dimensional imbalanced data classification [J] . Wu Qin, Lin Yaping, Zhu Tuanfei, Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2020,第1期

机译：Hiboost：高维不平衡数据分类的载体感知合奏学习算法
3. Hubness-based fuzzy measures for high-dimensional k-nearest neighbor classification [J] . Nenad Tomasev, Milos Radovanovic, Dunja Mladenic, International journal of machine learning and cybernetics . 2014,第3期

机译：高维k最近邻分类的基于中心度的模糊测度
4. Hub Co-occurrence Modeling for Robust High-Dimensional kNN Classification [C] . Nenad Tomasev, Dunja Mladenic European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases . 2013

机译：集线器共同发生建模，用于稳健的高维knn分类
5. Joint-Stochastic Spectral Inference for Robust Co-Occurrence Modeling and Latent Topic Analysis [D] . Lee, Moontae. 2018

机译：鲁棒共同发生建模和潜在谱分析的关节随机谱推断
6. Robust Distance Measures for kNN Classification of Cancer Data [O] . Rezvan Ehsani, Finn Drabløs 2020

机译：癌症数据的核武器分类的鲁棒距离措施
7. Hub Co-occurrence Modeling for Robust High-dimensional kNN Classification [O] . 2015

机译：鲁棒高维kNN分类的Hub共现模型

Hub Co-occurrence Modeling for Robust High-Dimensional kNN Classification

摘要

著录项

相似文献

相关主题

期刊订阅