首页> 外文会议>International Conference on Computer Communication and Informatics >Attribute Pattern Weights (APW): A Scale to Detect Concept Drift in Data Stream Mining Models
【24h】

Attribute Pattern Weights (APW): A Scale to Detect Concept Drift in Data Stream Mining Models

机译:属性模式权重(APW):一种用于检测数据流挖掘模型中概念漂移的量表

获取原文

摘要

Extracting data structures from dynamic real-time data records is gaining prominence across industries. The need for massive mining of data sequences is increasingly observed in a wide range of user applications including social network platforms, banking sector, genomics, telecom sector, e-commerce and other sectors. To analyse multiple streams of data that is, for understanding rapid sequences of data flowing at continuous intervals, researchers are focusing on continuous improvements in data stream mining. Application of data mining models (like classifiers) in data streaming scenario mandates accurate detection of data distribution. Further, the model should adapt quickly to any variations in the distribution patterns to ensure the sustained performance of model predictability. Referred to as drift detection, the process can be gradual or abrupt. Extensive research has been made, designing several algorithms to accurately detect the type of drift and to adapt to shifts drift approaches. However, even the most reputed concept drift models have limited ability to adapt to both types of drift. The relationship between the adaptability and predictor variables is based on data distribution features and its sensitivity to in-built parameters. In this context, concept drift detection using attribute pattern weight (APW) is proposed here in this manuscript. Unlike the many of existing models, the proposed model is not dependent of any of the process targeted to apply on streaming data. The other significance of the proposed model is to detect the both types of concept drift that is gradual or abrupt. The experimental study that carried is evincing the scalability and robustness, and significance of the proposed model.
机译:从动态实时数据记录中提取数据结构在各个行业中都越来越重要。在广泛的用户应用程序中,包括社交网络平台,银行部门,基因组学,电信部门,电子商务和其他部门,越来越多地发现了对大量数据序列进行挖掘的需求。为了分析多个数据流,以便了解连续间隔的快速数据流,研究人员将重点放在数据流挖掘的持续改进上。在数据流场景中数据挖掘模型(如分类器)的应用要求对数据分布进行准确的检测。此外,模型应迅速适应分布模式的任何变化,以确保模型可预测性的持续表现。称为漂移检测,该过程可以是逐渐的或突然的。已经进行了广泛的研究,设计了几种算法来准确检测漂移的类型并适应于漂移的漂移方法。但是,即使是最著名的概念漂移模型,也难以适应两种类型的漂移。适应性和预测变量之间的关系基于数据分布特征及其对内置参数的敏感性。在本文中,本文提出了使用属性模式权重(APW)进行概念漂移检测。与许多现有模型不同,建议的模型不依赖于要应用于流数据的任何过程。提出的模型的另一个重要意义是检测渐进或突变的两种类型的概念漂移。进行的实验研究证明了该模型的可扩展性,鲁棒性和意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号