Robust Bloom Filters for Large Multilabel Classification Tasks

机译：适用于大型多限制任务的强大绽放过滤器

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents an approach to multilabel classification (MLC) with a large number of labels. Our approach is a reduction to binary classification in which label sets are represented by low dimensional binary vectors. This representation follows the principle of Bloom filters, a space-efficient data structure originally designed for approximate membership testing. We show that a naive application of Bloom filters in MLC is not robust to individual binary classifiers' errors. We then present an approach that exploits a specific feature of real-world datasets when the number of labels is large: many labels (almost) never appear together. Our approach is provably robust, has sublinear training and inference complexity with respect to the number of labels, and compares favorably to state-of-the-art algorithms on two large scale multilabel datasets.

机译：本文介绍了具有大量标签的多书分类（MLC）的方法。我们的方法是减少到二进制分类，其中标签集由低维二进制向量表示。此表示遵循盛开过滤器的原理，最初设计用于占隶属测试的空间高效的数据结构。我们表明MLC中的绽放过滤器的天真应用对单个二进制分类器的错误并不稳健。然后，当标签数量大：许多标签（几乎）从未出现在一起时，我们将利用现实世界数据集的特定特征的方法。我们的方法是可怕的，具有级数培训和推理复杂性的标签数量，并对两个大规模多标签数据集上的最先进的算法进行比较。

著录项

来源
《Annual conference on Neural Information Processing Systems》|2013年||共9页
会议地点
作者
Moustapha Cisse; Nicolas Usunier; Thierry Artieres; Patrick Gallinari;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. The robustness of majority voting compared to filtering misclassified instances in supervised classification tasks [J] . Smith Michael R., Martinez Tony Artificial Intelligence Review: An International Science and Engineering Journal . 2018,第1期

机译：与过滤监督分类任务中的错误分类实例相比，多数投票的鲁棒性
2. Label Filters for Large Scale Multilabel Classification [J] . Alexandru Niculescu-Mizil, Ehsan Abbasnejad JMLR: Workshop and Conference Proceedings . 2017,第1期

机译：大规模多标签分类的标签过滤器
3. Label Filters for Large Scale Multilabel Classification [J] . Alexandru Niculescu-Mizil, Ehsan Abbasnejad JMLR: Workshop and Conference Proceedings . 2017,第2009期

机译：大规模多标签分类的标签过滤器
4. Robust Bloom Filters for Large Multilabel Classification Tasks [C] . Moustapha Cisse, Nicolas Usunier, Thierry Artieres, Annual conference on Neural Information Processing Systems . 2013

机译：适用于大型多标签分类任务的鲁棒Bloom过滤器
5. Investigating Noise Robustness of Convolutional Neural Networks for Image Classification Using Gabor Filters [D] . Jeong, Sangwon. 2020

机译：使用Gabor过滤器调查卷积神经网络的噪声稳健性
6. PNAS Plus: Mismatch-tolerant alignment-free sequence classification using multiple spaced seeds and multiindex Bloom filters [O] . Justin Chu, Hamid Mohamadi, Emre Erhan, 2020

机译：PNA加：不匹配使用多个间隔种子和多向盛开滤波器的不匹配序列分类
7. Cuckoo Filters and Bloom Filters: Comparison and Application to Packet Classification [O] . Pedro Reviriego, Jorge Martinez, David Larrabeiti, 2020

机译：Cuckoo滤镜和绽放过滤器：对数据包分类的比较和应用程序

Robust Bloom Filters for Large Multilabel Classification Tasks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅