【24h】

Forecasting High-Dimensional Data

机译:预测高维数据

获取原文

摘要

We propose a method for forecasting high-dimensional data (hundreds of attributes, trillions of attribute combinations) for a duration of several months. Our motivating application is guaranteed display advertising, a multi-billion dollar industry, whereby advertisers can buy targeted (high-dimensional) user visits from publishers many months or even years in advance. Forecasting high-dimensional data is challenging because of the many possible attribute combinations that need to be forecast. To address this issue, we propose a method whereby only a sub-set of attribute combinations are explicitly forecast and stored, while the other combinations are dynamically forecast on-the-fly using high-dimensional attribute correlation models. We evaluate various attribute correlation models, from simple models that assume the independence of attributes to more sophisticated sample-based models that fully capture the correlations in a high-dimensional space. Our evaluation using real-world display advertising data sets shows that fully capturing high-dimensional correlations leads to significant forecast accuracy gains. A variant of the proposed method has been implemented in the context of Yahoo! 's guaranteed display advertising system.
机译:我们提出了一种方法,用于预测高维数据(数百个属性,十亿个属性组合)的时间几个月。我们的激励申请是保证展示广告,一个数十亿美元的行业,广告商可以提前几个月甚至几年从出版商购买目标(高维)用户访问。由于需要预测的许多可能的属性组合,预测高维数据具有具有挑战性。为了解决这个问题,我们提出了一种方法,其中仅明确地预测和存储了一个属性组合的子集,而其他组合使用高维属性相关模型动态预测。我们评估各种属性相关模型,从假设属性的独立性到更复杂的基于样本的模型的简单模型,该模型完全捕获高维空间中的相关性。我们使用真实展示广告数据集的评估显示,完全捕获的高维相关性导致了显着的预测精度增益。在雅虎的背景下实施了所提出的方法的变体。保证展示广告系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号