首页> 外文学位 >Quantifying shared information value in a supply chain using decentralized Markov decision processes with restricted observations.
【24h】

Quantifying shared information value in a supply chain using decentralized Markov decision processes with restricted observations.

机译:使用具有受限观察结果的分散马尔可夫决策过程量化供应链中的共享信息价值。

获取原文
获取原文并翻译 | 示例

摘要

Information sharing in a two-stage and three-stage supply chain is studied. Assuming the customer demand distribution is known along the supply chain, the information to be shared is the inventory level of each supply chain member. In order to study the value of shared information, the supply chain is examined under different information sharing schemes. A Markov decision process (MDP) approach is used to model the supply chain, and the optimal policy given each scheme is determined. By comparing these schemes, the value of shared information can be quantified. Since the optimal policy maximizes the total profit within a supply chain, allocation of the profit among supply chain members, or transfer cost/price negotiation, is also discussed.; The information sharing schemes include full information sharing, partial information sharing and no information sharing. In the case of full information sharing, the supply chain problem is modeled as a single agent Markov decision process with complete observations (a traditional MDP) which can be solved based on the policy iteration method of Howard (1960). In the case of partial information sharing or no information sharing, the supply chain problem is modeled as a decentralized Markov decision process with restricted observations (DEC-ROMDP). Each agent may have complete observation of the process, or may have only restricted observation of the process. In order to solve the DEC-ROMDP, an evolutionary coordination algorithm is introduced, which proves to be effective if coupled with policy perturbation and multiple start strategies.
机译:研究了两阶段和三阶段供应链中的信息共享。假设客户需求分布在整个供应链中是已知的,则要共享的信息是每个供应链成员的库存水平。为了研究共享信息的价值,在不同的信息共享方案下检查了供应链。使用马尔可夫决策过程(MDP)方法对供应链进行建模,并根据给定的每个方案确定最佳策略。通过比较这些方案,可以量化共享信息的价值。由于最优政策使供应链中的总利润最大化,因此还讨论了在供应链成员之间分配利润或转移成本/价格谈判。信息共享方案包括完全信息共享,部分信息共享和不信息共享。在完全信息共享的情况下,将供应链问题建模为具有完整观测值的传统智能马尔可夫决策过程(传统的MDP),可以基于霍华德(1960)的策略迭代方法进行求解。在部分信息共享或没有信息共享的情况下,将供应链问题建模为具有受限观察结果的分散式马尔可夫决策过程(DEC-ROMDP)。每个代理程序可能对过程有完整的观察,也可能仅对过程有严格的观察。为了解决DEC-ROMDP,引入了一种进化协调算法,该算法在与策略扰动和多启动策略结合使用时被证明是有效的。

著录项

  • 作者

    Wei, Wenbin.;

  • 作者单位

    North Carolina State University.;

  • 授予单位 North Carolina State University.;
  • 学科 Engineering Industrial.
  • 学位 Ph.D.
  • 年度 2005
  • 页码 79 p.
  • 总页数 79
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 一般工业技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号