A drift detection method based on dynamic classifier selection

Pinage Felipe; dos Santos Eulanda M.; Gama Joao

首页> 外文期刊>Data mining and knowledge discovery >A drift detection method based on dynamic classifier selection

【24h】

A drift detection method based on dynamic classifier selection

机译：一种基于动态分类器选择的漂移检测方法

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Machine learning algorithms can be applied to several practical problems, such as spam, fraud and intrusion detection, and customer preferences, among others. In most of these problems, data come in streams, which mean that data distribution may change over time, leading to concept drift. The literature is abundant on providing supervised methods based on error monitoring for explicit drift detection. However, these methods may become infeasible in some real-world applications-where there is no fully labeled data available, and may depend on a significant decrease in accuracy to be able to detect drifts. There are also methods based on blind approaches, where the decision model is updated constantly. However, this may lead to unnecessary system updates. In order to overcome these drawbacks, we propose in this paper a semi-supervised drift detector that uses an ensemble of classifiers based on self-training online learning and dynamic classifier selection. For each unknown sample, a dynamic selection strategy is used to choose among the ensemble's component members, the classifier most likely to be the correct one for classifying it. The prediction assigned by the chosen classifier is used to compute an estimate of the error produced by the ensemble members. The proposed method monitors such a pseudo-error in order to detect drifts and to update the decision model only after drift detection. The achievement of this method is relevant in that it allows drift detection and reaction and is applicable in several practical problems. The experiments conducted indicate that the proposed method attains high performance and detection rates, while reducing the amount of labeled data used to detect drift.

机译：机器学习算法可以应用于若干实际问题，例如垃圾邮件，欺诈和入侵检测，以及客户偏好等。在大多数这些问题中，数据进入溪流，这意味着数据分布可能随时间变化，导致概念漂移。文献在提供基于出现明确漂移检测的错误监控的监督方法方面是丰富的。然而，这些方法可能在某些现实世界应用中变得不可行 - 如果没有完全标记的数据可用，并且可能取决于能够检测漂移的精度显着降低。还存在基于盲方法的方法，其中决策模型不断更新。但是，这可能导致不必要的系统更新。为了克服这些缺点，我们提出了一个半监督漂移探测器，它使用基于自我训练在线学习和动态分类器选择的分类器的集合。对于每个未知的样本，使用动态选择策略用于在合奏的组件成员中进行选择，分类器最有可能是用于对其进行分类的正确策略。所选择的分类器分配的预测用于计算集合成员产生的错误的估计。所提出的方法监视这种伪误差，以便检测漂移并仅在漂移检测之后更新决策模型。实现该方法的实现是相关的，因为它允许漂移检测和反应，并且适用于若干实际问题。进行的实验表明，该方法达到了高性能和检测率，同时降低了用于检测漂移的标记数据量。

著录项

来源
《Data mining and knowledge discovery》 |2020年第1期|共25页
作者
Pinage Felipe; dos Santos Eulanda M.; Gama Joao;
展开▼
作者单位

Univ Fed Amazonas Inst Comp Manaus Amazonas Brazil;

Univ Fed Amazonas Inst Comp Manaus Amazonas Brazil;

Univ Porto Inst Engn &

Comp Syst Porto Portugal;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Concept drift; Drift detection; Ensemble classifiers; Self-training; Data streams;

机译：概念漂移;漂移检测;合奏分类器;自我训练;数据流;

相似文献

外文文献
中文文献
专利

1. A drift detection method based on dynamic classifier selection [J] . Pinage Felipe, dos Santos Eulanda M., Gama Joao Data mining and knowledge discovery . 2020,第1期

机译：一种基于动态分类器选择的漂移检测方法
2. Performance assessment of a bleeding detection algorithm for endoscopic video based on classifier fusion method and exhaustive feature selection [J] . Farah Deeba, Monzurul Islam, Francis M. Bui, Biomedical signal processing and control . 2018,第feba期

机译：基于分类器融合方法和穷举特征选择的内窥镜视频出血检测算法性能评估
3. A dynamic selection ensemble method for target recognition based on clustering and randomized reference classifier [J] . Fan Xueman, Hu Shengliang, He Jingbo International journal of machine learning and cybernetics . 2019,第3期

机译：基于聚类和随机参考分类器的动态选择集成目标识别方法
4. Performance Analysis Of Fuzzy Rough Set-Based And Correlation-Based Attribute Selection Methods On Detection Of Chronic Kidney Disease With Various Classifiers [C] . Muhammet Sinan Başarslan, Fatih Kayaalp 2019 Scientific Meeting on Electrical-Electronics amp; Biomedical Engineering and Computer Science . 2019

机译：基于模糊粗糙集和基于相关性的属性选择方法在各种分类器检测慢性肾脏病中的性能分析
5. On Concept Drift, Deployability, and Adversarial Selection in Machine Learning-Based Malware Detection. [D] . Singh, Anshuman. 2012

机译：基于机器学习的恶意软件检测中的概念漂移，可部署性和对抗选择。
6. Metal Oxide Gas Sensor Drift Compensation Using a Dynamic Classifier Ensemble Based on Fitting [O] . Hang Liu, Zhenan Tang 2013

机译：基于拟合的动态分类器组件对金属氧化物气体传感器漂移的补偿
7. Concept Drift Detection and Adaption in Big Imbalance Industrial IoT Data Using an Ensemble Learning Method of Offline Classifiers [O] . Chun-Cheng Lin, Der-Jiunn Deng, Chin-Hung Kuo, 2019

机译：使用离线分类器的集合学习方法概念漂移检测和对大不平衡工业物联网数据的适应

A drift detection method based on dynamic classifier selection

摘要

著录项

相似文献

相关主题

期刊订阅