AnnexML: Approximate Nearest Neighbor Search for Extreme Multi-label Classification

Yukihiro Tagami

首页> 外文期刊>SIGKDD explorations >AnnexML: Approximate Nearest Neighbor Search for Extreme Multi-label Classification

【24h】

AnnexML: Approximate Nearest Neighbor Search for Extreme Multi-label Classification

机译：附件：近似最近的邻权搜索极端多标签分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Extreme multi-label classification methods have been widely used in Web-scale classification tasks such as Web page tagging and product recommendation. In this paper, we present a novel graph embedding method called "AnnexML". At the training step, AnnexML constructs a k-nearest neighbor graph of label vectors and attempts to reproduce the graph structure in the embedding space. The prediction is efficiently performed by using an approximate nearest neighbor search method that efficiently explores the learned k-nearest neighbor graph in the embedding space. We conducted evaluations on several large-scale real-world data sets and compared our method with recent state-of-the-art methods. Experimental results show that our AnnexML can significantly improve prediction accuracy, especially on data sets that have larger a label space. In addition, AnnexML improves the trade-off between prediction time and accuracy. At the same level of accuracy, the prediction time of AnnexML was up to 58 times faster than that of SLEEC, which is a state-of-the-art embedding-based method.

机译：极端的多标签分类方法已广泛应用于Web级分类任务，例如网页标记和产品推荐。在本文中，我们介绍了一种名为“AnnexML”的新型植物嵌入方法。在训练步骤中，Annexml构造了标签向量的K-Collect邻图，并尝试重现嵌入空间中的图形结构。通过使用近似最近的邻居搜索方法有效地执行预测，其有效地探索嵌入空间中的学习k最近邻图。我们对几个大型现实世界数据集进行了评估，并将我们的方法与最近的最先进的方法进行了比较。实验结果表明，我们的附件可以显着提高预测准确性，特别是在具有较大标签空间的数据集上。此外，AnnexML改善了预测时间和准确性之间的权衡。在相同的准确度，附件的预测时间速度快于SELEC的速度快58倍，这是一种基于最先进的嵌入的方法。

著录项

来源
《SIGKDD explorations》 |2017年第cdarom期|共10页
作者
Yukihiro Tagami;
展开▼
作者单位

Yahoo Japan Corporation Department of Intelligence Science and Technology Kyoto University;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类 TP274.2;
关键词
Extreme multi-label classification; K-nearest neighbor graph; Approximate nearest neighbor search; Learning-to-rank;

机译：极端多标签分类;k-最近邻图;近似最近邻的搜索;学习 - 排名;

相似文献

外文文献
中文文献
专利

1. AnnexML: Approximate Nearest Neighbor Search for Extreme Multi-label Classification [J] . Yukihiro Tagami SIGKDD explorations . 2017,第CDaROM期

机译：附件：近似最近的邻权搜索极端多标签分类
2. Recursive Nearest Neighbor Graph Partitioning for Extreme Multi-Label Learning [J] . Yukihiro TAGAMI IEICE transactions on information and systems . 2019,第3期

机译：用于极端多标签学习的递归最近邻图分区
3. A novel multi-label classification algorithm based on K-nearest neighbor and random walk [J] . Zhen-Wu Wang, Si-Kai Wang, Ben-Ting Wan, International Journal of Distributed Sensor Networks . 2020,第3期

机译：一种基于K-Colly Exbank和随机步行的新型多标签分类算法
4. Fast Approximate Nearest Neighbor Search via k-Diverse Nearest Neighbor Graph [C] . Yan Xiao, Jiafeng Guo, Yanyan Lan, AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence . 2018

机译：快速近似邻近邻近邻近邻居搜索
5. Unsupervised Binary Code Learning for Approximate Nearest Neighbor Search in Large-scale Datasets. [D] . Zhang, Hao. 2016

机译：大规模数据集中近似邻居搜索的无监督二进制代码学习。
6. Approximate Nearest Neighbor Search by Residual Vector Quantization [O] . Yongjian Chen, Tao Guan, Cheng Wang 2010

机译：残差矢量量化的近似最近邻搜索
7. Speeding up Extreme Multi-Label Classifier by Approximate Nearest Neighbor Search [O] . Yukihiro TAGAMI 2018

机译：通过近似邻近搜索加速极端多标签分类器

AnnexML: Approximate Nearest Neighbor Search for Extreme Multi-label Classification

摘要

著录项

相似文献

相关主题

期刊订阅