...
首页> 外文期刊>Journal of Artificial Intelligence and Soft Computing Research >A Novel Drift Detection Algorithm Based on Features’ Importance Analysis in a Data Streams Environment
【24h】

A Novel Drift Detection Algorithm Based on Features’ Importance Analysis in a Data Streams Environment

机译:一种新型漂移检测算法基于数据流环境中的特征的重要性分析

获取原文
           

摘要

The training set consists of many features that influence the classifier in different degrees. Choosing the most important features and rejecting those that do not carry relevant information is of great importance to the operating of the learned model. In the case of data streams, the importance of the features may additionally change over time. Such changes affect the performance of the classifier but can also be an important indicator of occurring concept-drift. In this work, we propose a new algorithm for data streams classification, called Random Forest with Features Importance (RFFI), which uses the measure of features importance as a drift detector. The RFFT algorithm implements solutions inspired by the Random Forest algorithm to the data stream scenarios. The proposed algorithm combines the ability of ensemble methods for handling slow changes in a data stream with a new method for detecting concept drift occurrence. The work contains an experimental analysis of the proposed algorithm, carried out on synthetic and real data.
机译:培训集包括许多功能影响不同程度的分类器的功能。选择最重要的特征和拒绝那些不携带相关信息的特征对于学习模型的运营非常重要。在数据流的情况下,特征的重要性可以随着时间的推移而变化。这种变化会影响分类器的性能,但也可以是发生概念漂移的重要指标。在这项工作中,我们提出了一种新的数据流分类算法,称为随机林,具有重要性(rffi),它使用vift探测器的特征重要性度量。 RFFT算法实现了随机林算法激发的解决方案到数据流方案。所提出的算法结合了组合方法来处理数据流中的缓慢变化的能力,以便检测概念漂移发生的新方法。该工作包含对合成和实数据进行的所提出的算法的实验分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号