The Big Data Newsvendor: Practical Insights from Machine Learning

Ban Gah-Yi; Rudin Cynthia

首页> 外文期刊>Operations Research: The Journal of the Operations Research Society of America >The Big Data Newsvendor: Practical Insights from Machine Learning

【24h】

The Big Data Newsvendor: Practical Insights from Machine Learning

机译：大数据新闻国师：机器学习的实用洞察力

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We investigate the data-driven newsvendor problem when one has n observations of p features related to the demand as well as historical demand data. Rather than a two-step process of first estimating a demand distribution then optimizing for the optimal order quantity, we propose solving the "big data" newsvendor problem via singlestep machine-learning algorithms. Specifically, we propose algorithms based on the empirical risk minimization (ERM) principle, with and without regularization, and an algorithm based on kernel-weights optimization (KO). The ERM approaches, equivalent to high-dimensional quantile regression, can be solved by convex optimization problems and the KO approach by a sorting algorithm. We analytically justify the use of features by showing that their omission yields inconsistent decisions. We then derive finite-sample performance bounds on the out-of-sample costs of the feature-based algorithms, which quantify the effects of dimensionality and cost parameters. Our bounds, based on algorithmic stability theory, generalize known analyses for the newsvendor problem without feature information. Finally, we apply the feature-based algorithms for nurse staffing in a hospital emergency room using a data set from a large UK teaching hospital and find that (1) the best ERM and KO algorithms beat the best practice benchmark by 23% and 24%, respectively, in the out-of-sample cost, and (2) the best KO algorithm is faster than the best ERM algorithm by three orders of magnitude and the best practice benchmark by two orders of magnitude.

机译：我们调查数据驱动的新闻监督者问题当一个与需求相关的P特征和历史需求数据有关的P特征时。我们首次估算需求分配的两步过程，然后优化最佳订单数量，而是通过Singlestep机器学习算法解决“大数据”新闻国问题。具体而言，我们基于经验风险最小化（ERM）原理，在没有正则化的基础上提出算法，以及基于内核权重优化（KO）的算法。 ERM方法，其等于高维数值回归，可以通过分类算法通过凸优化问题和KO方法来解决。我们通过表明他们的遗漏产生不一致的决定，我们分析了使用功能的原理。然后，我们在基于特征的算法的样本成本上导出有限样本的性能界限，这量化了维度和成本参数的影响。我们的界限基于算法稳定性理论，概括了NewsVendor问题的已知分析而没有特征信息。最后，我们使用大英国教学医院的数据集应用于医院急诊室的护士人员的特征算法，并发现（1）最好的ERM和KO算法以23％和24％击败了最佳实践基准。分别在样品外的成本中，（2）最好的KO算法比最佳ERM算法快三个数量级和最佳实践基准的速度比两个数量级。

著录项

来源
《Operations Research: The Journal of the Operations Research Society of America》 |2019年第1期|共19页
作者
Ban Gah-Yi; Rudin Cynthia;
展开▼
作者单位

London Business Sch Management Sci &

Operat London NW1 4SA England;

Duke Univ Dept Comp Sci Dept Elect &

Comp Engn &

Stat Sci Durham NC 27708 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类运筹学;
关键词
big data; newsvendor; machine learning; sample average approximation; statistical learning theory; quantile regression;

机译：大数据;新闻国企;机器学习;样本平均近似;统计学习理论;定量回归;

相似文献

外文文献
中文文献
专利

1. The Big Data Newsvendor: Practical Insights from Machine Learning [J] . Ban Gah-Yi, Rudin Cynthia Operations Research: The Journal of the Operations Research Society of America . 2019,第1期

机译：大数据新闻国师：机器学习的实用洞察力
2. Predictive Abilities of Machine Learning Techniques May Be Limited by Dataset Characteristics: Insights From the UNOS Database [J] . Miller P. Elliott, Pawar Sumeet, Vaccaro Benjamin, Journal of cardiac failure . 2019,第6期

机译：机器学习技术的预测能力可能受到数据集特征的限制：来自UNOS数据库的见解
3. DAML: Practical Secure Protocol for Data Aggregation Based on Machine Learning [J] . Zhao Ping, Sun Jiaxin, Zhang Guanglin ACM transactions on sensor networks . 2020,第4期

机译：基于机器学习的数据聚合的实用安全协议
4. From SCADA to lifetime assessment and performance optimization: how to use models and machine learning to extract useful insights from limited data [C] . Nikolay Dimitrov, Anand Natarajan WindEurope Conference and Exhibition . 2019

机译：从SCADA到终身评估和性能优化：如何使用模型和机器学习从有限数据中提取有用的见解
5. Practical Machine Learning for Sequential Decision Problems on the Internet [D] . Yan, Francis Y. 2020

机译：实用机器学习互联网上的顺序决策问题
6. Review of Data Mining: Practical Machine Learning Tools and Techniques by Witten and Frank [O] . Francisco Azuaje 2006

机译：Witten和Frank的数据挖掘：实用的机器学习工具和技术评论
7. The Big Data Newsvendor: Practical Insights from Machine Learning [O] . Cynthia Rudin, GahhYi Vahn 2015

机译：大数据新闻国师：机器学习的实用见解

The Big Data Newsvendor: Practical Insights from Machine Learning

摘要

著录项

相似文献

相关主题

期刊订阅