首页> 外国专利> MULTIPLE IMPUTATION OF MISSING DATA IN MULTI-DIMENSIONAL RETAIL SALES DATA SETS VIA TENSOR FACTORIZATION

MULTIPLE IMPUTATION OF MISSING DATA IN MULTI-DIMENSIONAL RETAIL SALES DATA SETS VIA TENSOR FACTORIZATION

机译:通过张量因子化在多维零售销售数据集中对缺失数据进行多次插补

摘要

A system, method and computer program product provides for multiple imputation of missing data elements in retail data sets used for modeling and decision-support applications based on the multi-dimensional, tensor structure of the data sets, and a fast, scalable scheme is implemented that is suitable for large data sets. The method generates multiple imputations comprising a set of complete data sets each containing one of a plurality of imputed realizations for the missing data values in the original data set, so that the variability in the magnitudes of these missing data values can be captured for subsequent statistical analysis. The method is based on the multi-dimensional structure of the retail data sets incorporating tensor factorization, that in a preferred embodiment can be implemented using fast, scalable imputation methods suitable for large data sets, to obtain multiple complete data sets in which the original missing values are replaced by various imputed values.
机译:一种系统,方法和计算机程序产品,基于数据集的多维,张量结构,为用于建模和决策支持应用程序的零售数据集中提供了缺失数据元素的多个插补,并实现了一种快速,可扩展的方案适用于大型数据集。该方法生成包括一组完整数据集的多个估算,每个完整数据集包含用于原始数据集中的缺失数据值的多个估算实现中的一个,从而可以捕获这些缺失数据值的大小的可变性以用于后续统计分析。该方法基于结合了张量因子分解的零售数据集的多维结构,在优选实施例中,该方法可以使用适用于大型数据集的快速,可扩展的插补方法来实现,以获得多个完整的数据集,其中原始缺失值将替换为各种估算值。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号