首页> 外文学位 >Energy Storage Applications of the Knowledge Gradient for Calibrating Continuous Parameters, Approximate Policy Iteration using Bellman Error Minimization with Instrumental Variables, and Covariance Matrix Estimation using an Errors-in-Variables Factor Model.
【24h】

Energy Storage Applications of the Knowledge Gradient for Calibrating Continuous Parameters, Approximate Policy Iteration using Bellman Error Minimization with Instrumental Variables, and Covariance Matrix Estimation using an Errors-in-Variables Factor Model.

机译:知识梯度的能量存储应用,用于校准连续参数,使用带工具变量的Bellman误差最小化进行近似策略迭代以及使用可变误差因子模型进行协方差矩阵估计。

获取原文
获取原文并翻译 | 示例

摘要

We describe an adaptation of the knowledge gradient, originally developed for discrete ranking and selection problems, to the problem of calibrating continuous parameters for the purpose of tuning a simulator. The knowledge gradient for continuous parameters uses a continuous approximation of the expected value of a single measurement to guide the choice of where to collect information next. We show how to find the parameter setting that maximizes the expected value of a measurement by optimizing a continuous but nonconcave surface. We compare the method to sequential kriging for a series of test surfaces, and then demonstrate its performance in the calibration of an expensive industrial simulator.;We next describe an energy storage problem which combines energy from wind and the grid along with a battery to meet a stochastic load. We formulate the problem as an infinite horizon Markov decision process. We first discretize the state space and action space on a simplfied version of the problem to get optimal solutions using exact value iteration. We then evaluate several approximate policy iteration algorithms and evaluate their performance. We show that Bellman error minimization with instrumental variables is equivalent to projected Bellman error minimization, previously believed to be two different policy evaluation algorithms. Furthermore, we provide a convergence proof for Bellman error minimization with instrumental variables under certain assumptions. We compare approximate policy iteration and direct policy search on the simplified benchmark problems along with the full continuous problems.;Finally, we describe a portfolio selection method for choosing virtual electricity contracts in the PJM electricity markets, contracts whose payoffs depend on the difference between the day-ahead and real-time locational marginal electricity prices in PJM. We propose an errors-in-variables factor model which is an extension of the classical capital asset pricing model. We show how the model can be used to estimate the covariance matrix of the returns of assets. For US equities and PJM virtual contracts, we show the benefits of the portfolios produced with the new covariance estimation method.
机译:我们描述了最初为离散排序和选择问题而开发的知识梯度对校准连续参数以优化模拟器的目的的适应。连续参数的知识梯度使用单个测量的期望值的连续近似值来指导下一步在何处收集信息。我们展示了如何通过优化连续但不凹的表面来找到使测量的期望值最大化的参数设置。我们将该方法与一系列测试表面的顺序克里金法进行比较,然后证明其在校准昂贵的工业模拟器中的性能。接下来,我们将描述一个能量存储问题,该问题将风能和电网的能量与电池结合在一起,以满足随机负载。我们将问题表述为无限地平线马尔可夫决策过程。我们首先在问题的简化版本上离散状态空间和动作空间,以使用精确值迭代获得最佳解决方案。然后,我们评估几种近似的策略迭代算法并评估其性能。我们证明了具有工具变量的Bellman错误最小化等效于预计的Bellman错误最小化,以前认为这是两种不同的策略评估算法。此外,在某些假设下,我们为使用工具变量的Bellman误差最小化提供了收敛证明。我们比较了简化基准问题和完全连续问题之间的近似策略迭代和直接策略搜索。最后,我们描述了一种在PJM电力市场中选择虚拟电力合同的投资组合选择方法,其收益取决于合同之间的差异。 PJM的日前和实时位置边际电价。我们提出了一个变量误差因子模型,该模型是经典资本资产定价模型的扩展。我们展示了如何使用该模型来估计资产收益的协方差矩阵。对于美国股票和PJM虚拟合约,我们展示了使用新的协方差估算方法产生的投资组合的好处。

著录项

  • 作者

    Scott, Warren Robert.;

  • 作者单位

    Princeton University.;

  • 授予单位 Princeton University.;
  • 学科 Operations research.
  • 学位 Ph.D.
  • 年度 2012
  • 页码 161 p.
  • 总页数 161
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:42:39
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号