首页> 外文会议>INNS Conference on Big Data >Don't pay for validation : Detecting drifts from unlabeled data using margin density
【24h】

Don't pay for validation : Detecting drifts from unlabeled data using margin density

机译:不要支付验证:使用边距密度检测从未标记数据的漂移

获取原文

摘要

Validating online stream classifiers has traditionally assumed the availability of labeled samples, which can be monitored over time, to detect concept drift. However, labeling in streaming domains is expensive, time consuming and in certain applications, such as land mine detection, not a possibility at all. In this paper, the Margin Density Drift Detection (MD3) approach is proposed, which can signal change using unlabeled samples and requires labeling only for retraining, in the event of a drift. The MD3 approach when evaluated on 5 synthetic and 5 real world drifting data streams, produced statistically equivalent classification accuracy to that of a fully labeled accuracy tracking drift detector, and required only a third of the samples to be labeled, on average.
机译:验证在线流分类器传统上假设标记样本的可用性,可以随时间监测,以检测概念漂移。然而,在流域中的标记是昂贵的,耗时的,并且在某些应用中,例如土地矿区检测,而不是可能的可能性。在本文中,提出了利润密度漂移检测(MD3)方法,其可以使用未标记的样品来信号变化,并且需要仅在漂移时标记用于再培训。 MD3方法在评估5个合成和5个现实世界漂移数据流时,为完全标记的精度跟踪漂移探测器的分类准确性产生了统计上等效的分类精度,并且仅需三分之一的样品平均被标记。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号