首页> 外文期刊>Information Sciences: An International Journal >Knowledge-maximized ensemble algorithm for different types of concept drift
【24h】

Knowledge-maximized ensemble algorithm for different types of concept drift

机译:用于不同类型概念漂移的知识最大化的集合算法

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

AbstractKnowledge extraction from data streams has attracted attention in recent years due to its wide range of applications, including sensor networks, web clickstreams, and user interest analysis. Concept drift is one of the most important research topics in data stream mining. Many algorithms that can adapt to concept drift have been proposed. However, most of them specialize in only one type of concept drift and can rarely be used in the environments with a large number of unavailable sample labels. In this study, we propose a new data stream classifier called knowledge-maximized ensemble (KME). First, supervised and unsupervised knowledge are leveraged to detect concept drift, recognize recurrent concepts, and evaluate the weights of ensemble members. Second, the preserved labelled instances in past blocks can be reused to enhance the recognition ability of the candidate member. The final decision for an incoming observation is derived from all the prediction results of the component classifiers. Accordingly, the maximum utilization of the relevant information in a data stream can be achieved, which is critical to models with limited training data. Third, KME can react to multiple types of concept drift by combining the mechanisms of online and chunk-based ensembles. Finally, we compare KME with eight state-of-the-art classifiers on several synthetic and real-world datasets. The comparison demonstrates the effectiveness of KME in various types of concept drift scenarios.]]>
机译:<![cdata [ 抽象 近年来,由于其广泛的应用,包括传感器网络,Web结局和用户兴趣分析,因此近年来的知识提取引起了注意力。概念漂移是数据流挖掘中最重要的研究主题之一。已经提出了可以适应概念漂移的许多算法。然而,他们中的大多数专门从一种类型的概念漂移专门化,并且很少在具有大量不可用的样本标签的环境中使用。在本研究中,我们提出了一种名为知识最大化的集合(KME)的新数据流分类器。首先,监督和无监督的知识被利用来检测概念漂移,识别经常性概念,并评估集合成员的重量。其次,可以重复使用过去块中的保留标记的实例以增强候选成员的识别能力。输入观察的最终决定是从组件分类器的所有预测结果导出。因此,可以实现数据流中相关信息的最大利用,这对于具有有限训练数据的模型至关重要。第三,通过组合在线和基于块的集合的机制来实现多种类型的概念漂移。最后,我们在几个合成和现实世界数据集中与八个最先进的分类器进行比较。比较展示了KME在各种类型的概念漂移方案中的有效性。 ]]>

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号