首页> 外文会议>IEEE International Conference on Data Mining >Polishing the Right Apple: Anytime Classification Also Benefits Data Streams with Constant Arrival Times
【24h】

Polishing the Right Apple: Anytime Classification Also Benefits Data Streams with Constant Arrival Times

机译:抛光右苹果:随时分类也会使数据流具有持续到达时间

获取原文
获取外文期刊封面目录资料

摘要

Classification of items taken from data streams requires algorithms that operate in time sensitive and computationally constrained environments. Often, the available time for classification is not known a priori and may change as a consequence of external circumstances. Many traditional algorithms are unable to provide satisfactory performance while supporting the highly variable response times that exemplify such applications. In such contexts, anytime algorithms, which are amenable to trading time for accuracy, have been found to be exceptionally useful and constitute an area of increasing research activity. Previous techniques for improving anytime classification have generally been concerned with optimizing the probability of correctly classifying individual objects. However, as we shall see, serially optimizing the probability of correctly classifying individual objects K times, generally gives inferior results to batch optimizing the probability of correctly classifying K objects. In this work, we show that this simple observation can be exploited to improve overall classification performance by using an anytime framework to allocate resources among a set of objects buffered from a fast arriving stream. Our ideas are independent of object arrival behavior, and, perhaps unintuitively, even in data streams with constant arrival rates our technique exhibits a marked improvement in performance. The utility of our approach is demonstrated with extensive experimental evaluations conducted on a wide range of diverse datasets.
机译:从数据流中取出的项目的分类需要在时间敏感和计算受限环境中运行的算法。通常,分类的可用时间尚不清楚,并且可能因外部环境而改变。许多传统算法无法提供令人满意的性能,同时支持举例说明此类应用程序的高度可变响应时间。在这种情况下,已经发现,已经针对准确性的交易时间提供了适用于交易时间的任何时间算法都特别有用,并构成了研究活动的增加。以前用于改善随时分类的技术通常涉及优化正确分类单个对象的概率。然而,正如我们所看到的,串行优化正确分类单个物体K次的概率,通常会给批量优化正确分类K对象的概率。在这项工作中,我们表明,可以利用这种简单的观察来通过使用随时框架来提高整体分类性能来在从快速到达流缓冲的一组对象中分配资源。我们的想法与对象到达行为无关,也许是无意中的,即使在持续到达速率的数据流中,我们的技术表现出显着提高性能。我们的方法的效用在广泛的不同数据集中进行了广泛的实验评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号