首页> 外文学位 >Classification problems for MDPs and optimal customer admission to M/M/k/N queues.

【24h】

Classification problems for MDPs and optimal customer admission to M/M/k/N queues.

机译：MDP的分类问题和M / M / k / N队列的最佳客户准入。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

My dissertation addresses the unichain classification problem for any finite Markov decision process (MDP) with a recurrent or stopping state and the optimal admission problem for an M/M/k/N queue with holding costs. In the first chapter, we study the unichain classification problem for MDPs. The unichain classification problem is to detect whether an MDP with finite states and actions is unichain or not. This problem has been proven to be NP-hard. We study this problem while an MDP contains a state which is either recurrent under all deterministic policies or absorbing under some action. We introduce the definitions of avoidable and reach- able sets and provide the corresponding polynomial algorithms that finds the states from which a given set is avoidable or reachable. We also provide a polynomial algorithm that detects whether a state is recurrent and solves the unichain classification problem for an MDP with a recurrent state and a polynomial algorithm for finding all recurrent and stopping states and solving the unichain classification problem with recurrent or stopping states. At the end of the first chapter, we discuss detecting all transient states in an MDP in polynomial time.;In the second chapter, we study optimal admission of arrivals to an M/M/k/N queue. The arriving customers are classified into m types, where m ≥ 1. The rewards and holding costs depend on customer types. Upon admitting an arriving customer, the system collects the reward from the admitted customer and pays the holding cost to the admitted customer. We study average reward per unit time for the problem. We prove the existence of an optimal trunk reservation policy and describe the structures of stationary optimal, canonical, bias optimal, and Blackwell optimal policies. If there exist two or more stationary optimal policies, we apply more sensitive optimality criteria to detect the best policy among all stationary optimal policies. We show that bias optimal and Blackwell optimal policies are unique, coincide, and are the trunk reservation policies with the largest optimal control level for each customer type.;Key Words: Unichain classification, Markov decision process, recurrent state, polynomial algorithm, queueing system, trunk reservation policy, stationary optimal policy, canonical policy, bias optimal policy, Blackwell optimal policy.

机译：本文针对具有递归或停止状态的任何有限马尔可夫决策过程（MDP）的单链分类问题，以及具有持有成本的M / M / k / N队列的最优接纳问题。在第一章中，我们研究了MDP的单链分类问题。单链分类问题是检测具有有限状态和动作的MDP是否为单链。已经证明此问题是NP难题。我们研究这个问题，而MDP包含的状态在所有确定性政策下都反复出现，或者在某种行动下吸收。我们介绍了可避免和可到达集合的定义，并提供了相应的多项式算法，该算法可找到可避免或可到达给定集合的状态。我们还提供了一种多项式算法，该算法可检测状态是否为重复状态，并解决具有重复状态的MDP的单链分类问题，以及提供多项式算法，以查找所有重复和停止状态并解决具有重复或停止状态的单链分类问题。在第一章的末尾，我们讨论了在多项式时间内检测MDP中的所有瞬态状态。在第二章中，我们研究了到达M / M / k / N队列的最佳到达接纳。到达的客户分为m个类型，其中m≥1。奖励和持有成本取决于客户类型。在接纳到达的顾客时，系统从接纳的顾客那里收集奖励，并将持有成本支付给接纳的顾客。我们研究该问题每单位时间的平均奖励。我们证明了最优干线预留策略的存在，并描述了静态最优，规范，偏向最优和布莱克韦尔最优策略的结构。如果存在两个或多个固定最优策略，我们将应用更敏感的最优性准则来检测所有固定最优策略中的最佳策略。我们证明了偏向最优策略和布莱克韦尔最优策略是唯一的，一致的，并且是每种客户类型具有最大最优控制水平的中继线保留策略。关键词：单链分类，马尔可夫决策过程，递归状态，多项式算法，排队系统，中继线预留策略，固定最优策略，规范策略，偏差最优策略，Blackwell最优策略。

著录项

作者
Yang, Fenghsu.;
展开▼
作者单位

State University of New York at Stony Brook.;

展开▼
授予单位 State University of New York at Stony Brook.;
学科 Applied Mathematics.;Operations Research.
学位 Ph.D.
年度 2010
页码 75 p.
总页数 75
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Optimal SMDP‐Based Connection Admission Control Mechanism in Cognitive Radio Sensor Networks [J] . Elahe Hosseini, Reza Berangi ETRI journal . 2017,第3期

机译：认知无线电传感器网络中基于SMDP的最佳连接准入控制机制
2. Optimal Switching Strategy between Admission Control and Pricing Control Policies with Two Types of Customers and Search Costs [J] . Jae-Dong Son Advances in Operations Research . 2015,第Null期

机译：具有两种类型的顾客和搜索成本的准入控制和定价控制策略之间的最优切换策略
3. Optimal control of queue in the M|G|1|infinity system with possibility of customer admission restriction [J] . Grishunina Yu. B. Automation and Remote Control . 2015,第3期

机译：M | G | 1 |无穷大系统中排队的最优控制，可能会限制客户进入
4. An SMDP-based optimal admission control scheme in cognitive radio networks [C] . Qiang Deng, Dong-liang Xie, Bo Hu, 2010 Global Mobile Congress . 2010

机译：认知无线电网络中基于SMDP的最优准入控制方案
5. OPTIMAL SCHEDULING IN NETWORKS OF QUEUES. [D] . MOUSTAFA, MAGDI SAMI. 1987

机译：排队网络中的最佳调度。
6. The optimal delivery time and order quantity in an oligopoly market with time-sensitive customers [O] . Haijiao Li, Weijin Xu, Kuan Yang 2019

机译：寡头垄断市场中的最佳交货时间和订单数量与时敏客户
7. Optimally Scheduling Admission of N Customers from Low–Price Buffer to High-Price Queue, Part I: Asymptotic Rate [O] . Daniel C. Lee 2013

机译：N个客户从低价缓冲区到高价队列的最优调度安排，第一部分：渐近率

Classification problems for MDPs and optimal customer admission to M/M/k/N queues.

摘要

著录项

相似文献

相关主题

期刊订阅