首页> 外文学位 >Classification problems for MDPs and optimal customer admission to M/M/k/N queues.
【24h】

Classification problems for MDPs and optimal customer admission to M/M/k/N queues.

机译:MDP的分类问题和M / M / k / N队列的最佳客户准入。

获取原文
获取原文并翻译 | 示例

摘要

My dissertation addresses the unichain classification problem for any finite Markov decision process (MDP) with a recurrent or stopping state and the optimal admission problem for an M/M/k/N queue with holding costs. In the first chapter, we study the unichain classification problem for MDPs. The unichain classification problem is to detect whether an MDP with finite states and actions is unichain or not. This problem has been proven to be NP-hard. We study this problem while an MDP contains a state which is either recurrent under all deterministic policies or absorbing under some action. We introduce the definitions of avoidable and reach- able sets and provide the corresponding polynomial algorithms that finds the states from which a given set is avoidable or reachable. We also provide a polynomial algorithm that detects whether a state is recurrent and solves the unichain classification problem for an MDP with a recurrent state and a polynomial algorithm for finding all recurrent and stopping states and solving the unichain classification problem with recurrent or stopping states. At the end of the first chapter, we discuss detecting all transient states in an MDP in polynomial time.;In the second chapter, we study optimal admission of arrivals to an M/M/k/N queue. The arriving customers are classified into m types, where m ≥ 1. The rewards and holding costs depend on customer types. Upon admitting an arriving customer, the system collects the reward from the admitted customer and pays the holding cost to the admitted customer. We study average reward per unit time for the problem. We prove the existence of an optimal trunk reservation policy and describe the structures of stationary optimal, canonical, bias optimal, and Blackwell optimal policies. If there exist two or more stationary optimal policies, we apply more sensitive optimality criteria to detect the best policy among all stationary optimal policies. We show that bias optimal and Blackwell optimal policies are unique, coincide, and are the trunk reservation policies with the largest optimal control level for each customer type.;Key Words: Unichain classification, Markov decision process, recurrent state, polynomial algorithm, queueing system, trunk reservation policy, stationary optimal policy, canonical policy, bias optimal policy, Blackwell optimal policy.
机译:本文针对具有递归或停止状态的任何有限马尔可夫决策过程(MDP)的单链分类问题,以及具有持有成本的M / M / k / N队列的最优接纳问题。在第一章中,我们研究了MDP的单链分类问题。单链分类问题是检测具有有限状态和动作的MDP是否为单链。已经证明此问题是NP难题。我们研究这个问题,而MDP包含的状态在所有确定性政策下都反复出现,或者在某种行动下吸收。我们介绍了可避免和可到达集合的定义,并提供了相应的多项式算法,该算法可找到可避免或可到达给定集合的状态。我们还提供了一种多项式算法,该算法可检测状态是否为重复状态,并解决具有重复状态的MDP的单链分类问题,以及提供多项式算法,以查找所有重复和停止状态并解决具有重复或停止状态的单链分类问题。在第一章的末尾,我们讨论了在多项式时间内检测MDP中的所有瞬态状态。在第二章中,我们研究了到达M / M / k / N队列的最佳到达接纳。到达的客户分为m个类型,其中m≥1。奖励和持有成本取决于客户类型。在接纳到达的顾客时,系统从接纳的顾客那里收集奖励,并将持有成本支付给接纳的顾客。我们研究该问题每单位时间的平均奖励。我们证明了最优干线预留策略的存在,并描述了静态最优,规范,偏向最优和布莱克韦尔最优策略的结构。如果存在两个或多个固定最优策略,我们将应用更敏感的最优性准则来检测所有固定最优策略中的最佳策略。我们证明了偏向最优策略和布莱克韦尔最优策略是唯一的,一致的,并且是每种客户类型具有最大最优控制水平的中继线保留策略。关键词:单链分类,马尔可夫决策过程,递归状态,多项式算法,排队系统,中继线预留策略,固定最优策略,规范策略,偏差最优策略,Blackwell最优策略。

著录项

  • 作者

    Yang, Fenghsu.;

  • 作者单位

    State University of New York at Stony Brook.;

  • 授予单位 State University of New York at Stony Brook.;
  • 学科 Applied Mathematics.;Operations Research.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 75 p.
  • 总页数 75
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号