首页> 外文期刊>Engineering Applications of Artificial Intelligence >On utilizing weak estimators to achieve the online classification of data streams
【24h】

On utilizing weak estimators to achieve the online classification of data streams

机译:关于利用弱估计量实现数据流的在线分类

获取原文
获取原文并翻译 | 示例
           

摘要

Classification, typically, deals with unique and distinct training and testing phases. This paper pioneers the concept when these phases are not so clearly well-defined. More specifically, we consider the case where the testing patterns can subsequently be considered as training patterns. The paradigm is further complicated because we assume that the class-conditional distributions of the features/classes are non-stationary, as in the case of most real-world applications. Specifically, we consider the model where the training phase is non-stationary and that it is, further, interleaved with the testing, and where it can be done online and in a real-time manner.We propose a novel online classifier for complex data streams which are generated from non-stationary stochastic properties. Instead of using a single training model with "counters" that maintain important data statistics, our online classifier scheme provides a real-time self-adjusting learning model. The learning model utilizes the multiplication-based update algorithm of the Stochastic Learning Weak Estimator (SLWE) at each time instant as a new labeled instance arrives. In this way, the data statistics are updated every time a new element is seen, without requiring that we have to rebuild the model when changes occur in the data distributions. Finally, and most importantly, the model operates with the understanding that the correct classes of previously-classified patterns become available at a later juncture subsequent to some time instances. This forces us to update the training set, the training model and the class conditional distributions as the testing proceeds.The results from rigorous empirical analysis on two-dimensional/multi-dimensional and binomial/multinomial distributions are remarkable. We also report some results on two real-life datasets adapted to this model of computation, demonstrating the advantages of the novel scheme for both binomial and multinomial non-stationary distributions.
机译:通常,分类涉及独特且独特的培训和测试阶段。当这些阶段的定义不太明确时,本文将率先提出这一概念。更具体地,我们考虑测试模式随后可以被视为训练模式的情况。范式更加复杂,因为我们假设要素/类的类条件分布是非平稳的,就像在大多数实际应用程序中一样。具体来说,我们考虑了模型,其中训练阶段是不稳定的,并且进一步与测试交错,并且可以在线和实时地进行训练。针对复杂数据,我们提出了一种新颖的在线分类器由非平稳随机特性产生的流。我们的在线分类器方案提供了实时的自我调整学习模型,而不是使用具有“计数器”的单一训练模型来维护重要的数据统计信息。当新的标记实例到达时,学习模型在每个时刻利用随机学习弱估计器(SLWE)的基于乘法的更新算法。这样,每次看到一个新元素时,数据统计信息都会更新,而无需在数据分布发生变化时必须重建模型。最后,最重要的是,该模型在理解以下前提下运行:先前分类的模式的正确类别在某些时间实例之后的稍后时刻可用。随着测试的进行,这迫使我们更新训练集,训练模型和类条件分布。对二维/多维和二项式/多项式分布进行严格的经验分析得出的结果是显着的。我们还报告了适用于此计算模型的两个实际数据集的一些结果,证明了针对二项式和多项式非平稳分布的新颖方案的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号