Multi-Label Punitive kNN with Self-Adjusting Memory for Drifting Data Streams

Roseberry Martha; Krawczyk Bartosz; Cano Alberto

首页> 外文期刊>ACM transactions on knowledge discovery from data >Multi-Label Punitive kNN with Self-Adjusting Memory for Drifting Data Streams

【24h】

Multi-Label Punitive kNN with Self-Adjusting Memory for Drifting Data Streams

机译：具有自调整内存的多标签惩罚性kNN，可用于漂移数据流

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In multi-label learning, data may simultaneously belong to more than one class. When multi-label data arrives as a stream, the challenges associated with multi-label learning are joined by those of data stream mining, including the need for algorithms that are fast and flexible, able to match both the speed and evolving nature of the stream. This article presents a punitive k nearest neighbors algorithm with a self-adjusting memory (MLSAMPkNN) for multi-label, drifting data streams. The memory adjusts in size to contain only the current concept and a novel punitive system identifies and penalizes errant data examples early, removing them from the window. By retaining and using only data that are both current and beneficial, MLSAMPkNN is able to adapt quickly and efficiently to changes within the data stream while still maintaining a low computational complexity. Additionally, the punitive removal mechanism offers increased robustness to various data-level difficulties present in data streams, such as class imbalance and noise. The experimental study compares the proposal to 24 algorithms using 30 real-world and 15 artificial multi-label data streams on six multi-label metrics, evaluation time, and memory consumption. The superior performance of the proposed method is validated through non-parametric statistical analysis, proving both high accuracy and low time complexity. MLSAMPkNN is a versatile classifier, capable of returning excellent performance in diverse stream scenarios.

机译：在多标签学习中，数据可能同时属于一个以上的类别。当多标签数据作为流到达时，与多标签学习相关的挑战会伴随着数据流挖掘的挑战，包括对快速，灵活，能够匹配流的速度和不断发展的性质的算法的需求。本文提出了一种惩罚性k最近邻算法，该算法具有用于多标签，漂移数据流的自调整内存（MLSAMPkNN）。内存的大小调整为仅包含当前概念，并且新颖的惩罚性系统可以尽早识别并惩罚错误的数据示例，并将其从窗口中删除。通过仅保留和使用既有用又有用的数据，MLSAMPkNN能够快速而有效地适应数据流中的变化，同时仍保持较低的计算复杂度。此外，惩罚性删除机制还提高了数据流中存在的各种数据级别困难（例如类别不平衡和噪声）的鲁棒性。实验研究将提案与24种算法进行了比较，其中使用了30种真实世界和15种人工多标签数据流，并采用了六个多标签指标，评估时间和内存消耗。通过非参数统计分析验证了该方法的优越性能，证明了该方法的高精度和低时间复杂度。 MLSAMPkNN是一种通用分类器，能够在各种流场景中返回出色的性能。

著录项

来源
《ACM transactions on knowledge discovery from data》 |2019年第6期|60.1-60.31|共31页
作者
Roseberry Martha; Krawczyk Bartosz; Cano Alberto;
展开▼
作者单位

Virginia Commonwealth Univ 401 W Main St E4251 Richmond VA 23284 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Multi-label classification; nearest neighbor; data stream; concept drift;

机译：多标签分类;最近的邻居;数据流;概念漂移;

相似文献

外文文献
中文文献
专利

1. Multi-label kNN Classifier with Self Adjusting Memory for Drifting Data Streams [J] . Martha Roseberry, Alberto Cano JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：多标签KNN分类器，具有自调节内存，用于漂移数据流
2. Self-adjusting k nearest neighbors for continual learning from multi-label drifting data streams [J] . Roseberry Martha, Krawczyk Bartosz, Djenouri Youcef, Neurocomputing . 2021,第Juna28期

机译：从多标签漂移数据流不断学习的自我调整k最近邻居
3. Efficient Ensemble Classification for Multi-Label Data Streams with Concept Drift [J] . Yange Sun, Han Shao, Shasha Wang Information . 2019,第5期

机译：具有概念漂移的多标签数据流的高效集成分类
4. Concept drift detection with False Positive rate for multi-label classification in IoT data stream [C] . Pingfan Wang, Nanlin Jin, Gerhard Fehringer International Conference on UK-China Emerging Technologies . 2020

机译：用于物联网数据流中多标签分类的误报率概念漂移检测
5. The GC3 framework grid density based clustering for classification of streaming data with concept drift. [D] . Sethi, Tegjyot Singh. 2013

机译：基于GC3框架网格密度的聚类，用于通过概念漂移对流数据进行分类。
6. Cost-Sensitive Classification for Evolving Data Streams with Concept Drift and Class Imbalance [O] . Yange Sun, Meng Li, Lei Li, 2021

机译：具有与概念漂移和类不平衡的演化数据流的成本敏感分类
7. Self-Adjusting Memory: How to Deal with Diverse Drift Types [O] . Viktor Losing, Barbara Hammer, Heiko Wersing 2017

机译：自调整内存：如何处理多样化的漂移类型
8. Free-Drifting Buoy Trajectories in the Gulf Stream System (1975-1978). A Data Report. [R] . Richardson, P. L., Wheat, J. J., Bennett, D. 1979

机译：湾流系统中的自由漂流浮标轨迹（1975-1978）。数据报告。

Multi-Label Punitive kNN with Self-Adjusting Memory for Drifting Data Streams

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅