首页> 外文会议>10th IEEE International Conference on Data Mining >Polishing the Right Apple: Anytime Classification Also Benefits Data Streams with Constant Arrival Times
【24h】

Polishing the Right Apple: Anytime Classification Also Benefits Data Streams with Constant Arrival Times

机译:抛光合适的苹果:随时分类也有益于持续到达时间的数据流

获取原文

摘要

Classification of items taken from data streams requires algorithms that operate in time sensitive and computationally constrained environments. Often, the available time for classification is not known a priori and may change as a consequence of external circumstances. Many traditional algorithms are unable to provide satisfactory performance while supporting the highly variable response times that exemplify such applications. In such contexts, anytime algorithms, which are amenable to trading time for accuracy, have been found to be exceptionally useful and constitute an area of increasing research activity. Previous techniques for improving anytime classification have generally been concerned with optimizing the probability of correctly classifying individual objects. However, as we shall see, serially optimizing the probability of correctly classifying individual objects K times, generally gives inferior results to batch optimizing the probability of correctly classifying K objects. In this work, we show that this simple observation can be exploited to improve overall classification performance by using an anytime framework to allocate resources among a set of objects buffered from a fast arriving stream. Our ideas are independent of object arrival behavior, and, perhaps unintuitively, even in data streams with constant arrival rates our technique exhibits a marked improvement in performance. The utility of our approach is demonstrated with extensive experimental evaluations conducted on a wide range of diverse datasets.
机译:从数据流中获取项目的分类需要在时间敏感和受计算限制的环境中运行的算法。通常,分类的可用时间不是先验的,并且可能会由于外部环境而改变。许多传统算法在支持举例说明此类应用程序的高度可变的响应时间时,无法提供令人满意的性能。在这种情况下,已经发现适合于交易时间以求准确性的随时算法是非常有用的,并且构成了不断增加的研究活动的领域。用于改善随时分类的先前技术通常与优化正确地对单个对象进行分类的可能性有关。但是,正如我们将看到的那样,串行优化正确地对单个对象进行K次分类的概率,对于批量优化正确地对K个对象进行分类的概率,通常给出的结果较差。在这项工作中,我们表明可以通过使用随时框架在从快速到达的流缓冲的一组对象之间分配资源来利用这种简单的观察方法来改善整体分类性能。我们的想法与对象的到达行为无关,并且,即使在恒定到达率的数据流中,我们的技术也表现出明显的性能提升,这是不直观的。我们的方法的实用性通过对各种不同数据集进行的广泛实验评估得到证明。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号