首页> 外文期刊>Journal of Revenue and Pricing Management >Deep reinforcement learning in seat inventory control problem: an action generation approach
【24h】

Deep reinforcement learning in seat inventory control problem: an action generation approach

机译:座椅库存控制问题的深度加固学习:行动生成方法

获取原文
获取原文并翻译 | 示例
       

摘要

Nowadays, firms intend to use customer choice-based models instead of an independent demand paradigm to generate more revenue. In this paper, we address choice-based seat inventory control problem with stochastic demand using a deep reinforcement learning technique named Deep Q-Network (DQN). DQN can naturally address large state space problems with its integrated function approximation. However, it becomes intractable in the case of large discrete action space. To address this issue, we propose an Action Generation (AGen) algorithm. AGen is a greedy heuristic algorithm designed to be integrated into DQN to overcome the complexity of the original problem. It aims to greedily generate a set of "effective" actions to replace the original action space. This leads to the main achievement of this study which is to dramatically decrease the complexity of the solution method without negatively affecting its performance in a large-scale choice-based seat inventory allocation problem.
机译:如今,公司打算使用基于客户选择的模型而不是独立的需求范例来产生更多收入。 本文使用名为Deep Q-Network(DQN)的深度加强学习技术来解决基于选择的座椅库存控制问题。 DQN自然可以通过集成函数近似来解决大状态空间问题。 然而,在大型离散动作空间的情况下它变得棘手。 要解决此问题,我们提出了一个动作生成(Agen)算法。 Agen是一种贪婪的启发式算法,旨在集成到DQN中以克服原始问题的复杂性。 它旨在贪婪地生成一组“有效的”动作来取代原始动作空间。 这导致本研究的主要成就,这是为了显着降低解决方案方法的复杂性而不会对大规模选择的座位库存分配问题产生负面影响其性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号