Asynchronous action-reward learning for nonstationary serial supply chain inventory control

Kim CO; Kwon IH; Baek JG

首页> 外文期刊>Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies >Asynchronous action-reward learning for nonstationary serial supply chain inventory control

【24h】

Asynchronous action-reward learning for nonstationary serial supply chain inventory control

机译：用于非平稳串行供应链库存控制的异步行动奖励学习

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Action-reward learning is a reinforcement learning method. In this machine learning approach, an agent interacts with non-deterministic control domain. The agent selects actions at decision epochs and the control domain gives rise to rewards with which the performance measures of the actions are updated. The objective of the agent is to select the future best actions based on the updated performance measures. In this paper, we develop an asynchronous action-reward learning model which updates the performance measures of actions faster than conventional action-reward learning. This learning model is suitable to apply to nonstationary control domain where the rewards for actions vary over time. Based on the asynchronous action-reward learning, two situation reactive inventory control models (centralized and decentralized models) are proposed for a two-stage serial supply chain with nonstationary customer demand. A simulation based experiment was performed to evaluate the performance of the proposed two models.

机译：行动奖励学习是一种强化学习方法。在这种机器学习方法中，代理与非确定性控制域进行交互。代理在决策时期选择动作，并且控制域产生奖励，通过奖励来更新动作的绩效度量。代理的目标是根据更新的绩效指标选择未来的最佳措施。在本文中，我们开发了一个异步行动奖励学习模型，该模型比传统的行动奖励学习更快地更新了行动的绩效指标。这种学习模型适合应用于非平稳控制领域，在该领域中，行动的奖励会随着时间而变化。基于异步行动奖励学习，针对具有非固定客户需求的两阶段串行供应链，提出了两种情况的反应性库存控制模型（集中式和分散式模型）。进行了基于仿真的实验，以评估所提出的两个模型的性能。

著录项

来源
《Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies》 |2008年第1期|共16页
作者
Kim CO; Kwon IH; Baek JG;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
action reward learning; machine learning; asynchronous performance measure update; situation reactive inventory control; two-stage serial supply chain; nonstationary customer demand; MODELS; SYSTEM; POLICIES; DEMAND; TIME;

机译：行动奖励学习;机器学习;异步绩效度量更新;现场无功库存控制;两阶段串行供应链;不稳定客户需求;模型;系统;政策;需求;时间;

相似文献

外文文献
中文文献
专利

1. Asynchronous action-reward learning for nonstationary serial supply chain inventory control [J] . Chang Ouk Kim, Ick-Hyun Kwon, Jun-Geol Baek Applied Intelligence . 2009,第2期

机译：用于非平稳串行供应链库存控制的异步行动奖励学习
2. Asynchronous action-reward learning for nonstationary serial supply chain inventory control [J] . Chang Ouk Kim, Ick-Hyun Kwon, Jun-Geol Baek Applied Intelligence . 2008,第1期

机译：用于非平稳串行供应链库存控制的异步行动奖励学习
3. Asynchronous action-reward learning for nonstationary serial supply chain inventory control [J] . Kim CO, Kwon IH, Baek JG Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2008,第1期

机译：用于非平稳串行供应链库存控制的异步行动奖励学习
4. Optimal Strategic Supply Chain Inventory Positioning For Serial Supply Chains [C] . Ricki G. Ingalls Institute of Industrial Engineers Annual Conference . 2003

机译：串行供应链的最佳战略供应链库存定位
5. Optimal inventory policies in serial supply chains: Bounds, heuristics, and insights. [D] . Shang, Kevin Huei-Min. 2002

机译：串行供应链中的最佳库存策略：范围，启发式方法和见解。
6. Stochastic Inventory Model for Minimizing Blood Shortage and Outdating in a Blood Supply Chain under Supply and Demand Uncertainty [O] . Han Shih, Suchithra Rajendran 2020

机译：随机库存模型以最大限度地减少供需不确定性的血液供应链中的血清缺失
7. Asynchronous action-reward learning for nonstationary serial supply chain inventory control [O] . Chang Ouk Kim, Ick-Hyun Kwon, Jun-Geol Baek 2007

机译：异步动作 - 奖励学习非平稳串行供应链库存控制

Asynchronous action-reward learning for nonstationary serial supply chain inventory control

摘要

著录项

相似文献

相关主题

期刊订阅