首页> 外文会议>IEEE International Conference on Data Science and Advanced Analytics >Forest of Normalized Trees: Fast and Accurate Density Estimation of Streaming Data
【24h】

Forest of Normalized Trees: Fast and Accurate Density Estimation of Streaming Data

机译:标准化树森林:流数据的快速准确密度估计

获取原文

摘要

Density estimation of streaming data is a relevant task in numerous domains. In this paper, a novel non-parametric density estimator called FRONT (forest of normalized trees) is introduced. It uses a structure of multiple normalized trees, segments the feature space of the data stream through a periodically updated linear transformation and is able to adapt to ever evolving data streams. FRONT provides accurate density estimation and performs favorably compared to existing online density estimators in terms of the average log score on multiple standard data sets. Its low complexity, linear runtime as well as constant memory usage, makes FRONT by design suitable for large data streams. Finally, the paper provides a variation of FRONT called N-FRONT suitable for statistically independent data streams and correction methods for badly initialized trees to further improve performance.
机译:流数据的密度估计是许多领域中的一项相关任务。本文介绍了一种称为FRONT(归一化树的森林)的新型非参数密度估计器。它使用多个标准化树的结构,通过定期更新的线性变换来分割数据流的特征空间,并且能够适应不断发展的数据流。就多个标准数据集的平均对数得分而言,FRONT提供准确的密度估计,并且与现有的在线密度估计器相比,性能要好。它的低复杂度,线性运行时间以及恒定的内存使用率,使FRONT从设计上就适合于大型数据流。最后,本文提供了一种称为N-FRONT的FRONT变体,适用于统计上独立的数据流,以及针对初始化不良的树的校正方法,以进一步提高性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号