首页> 外文期刊>Journal of the Franklin Institute >Optimal sensor scheduling for station-keeping in denied environments
【24h】

Optimal sensor scheduling for station-keeping in denied environments

机译:在拒绝的环境中进行站维护的最佳传感器调度

获取原文
获取原文并翻译 | 示例
       

摘要

Consider that a particle-like agent, affected by exogenous disturbances, seeks to remain as close as possible to a reference point. Its state evolves as a Markov decision process in discrete time and the actuation effort is cost-free. A denied environment within which state measurements must be requested and are costly encloses the reference point. Measurements outside the denied region are provided cost-free without the need for a request. No control is applied in the absence of a measurement. At each time step, the agent has the authority to decide whether to wander like a random walk or to request a measurement and use it to move towards the reference point. This paper investigates measurement request policies that minimize an objective function that comprises the expected mean squared deviation of the agent from the reference point and the cost of requesting a measurement inside the denied region. The goal is to characterize the trade-off between paying to access the state immediately and waiting for a free measurement that occurs when the agent is carried outside the denied region by the accrued effect of the disturbances over time. We show that the analysis of this problem simplifies by recasting it as a renewal reward process, for which the maximum wait time between the most recent renewal and a measurement request parametrizes all policies. Our analysis concerning wait-time optimization enabled us to establish conditions under which any local minimum (if it exists) is also global within a pre-specified interval, thus facilitating the search for a minimizer. Our results are discussed for the cases in which the agent's loci are the integers or a finite-dimensional Euclidean space. (C) 2018 The Franklin Institute. Published by Elsevier Ltd. All rights reserved.
机译:考虑到受外源干扰影响的颗粒状物质试图保持尽可能接近参考点。它的状态随着离散时间的马尔可夫决策过程的发展而变化,致动工作是无成本的。在拒绝环境中,必须要求进行状态测量且该环境价格昂贵,因此将参考点封闭起来。无需请求即可免费提供拒绝区域之外的测量。在没有测量的情况下不应用任何控件。在每个时间步长,代理都有权决定是像随机行走一样漫游还是请求测量并使用它来朝参考点移动。本文研究了使目标函数最小化的测量请求策略,该目标函数包括代理与参考点的期望均方差以及在拒绝区域内请求测量的成本。目的是表征在付费以立即访问国家与等待免费测量之间的权衡,该自由测量是随着时间的推移,由于干扰的累积影响而将代理带到拒绝的区域之外时发生的。我们显示,通过将其重铸为续订奖励过程可以简化此问题的分析,为此,最近一次续订与度量请求之间的最大等待时间将所有策略参数化。我们对等待时间优化的分析使我们能够建立条件,在该条件下,任何局部最小值(如果存在)在预定的间隔内也是全局的,从而有利于寻找最小化方法。对于代理位置为整数或有限维欧几里得空间的情况,我们讨论了我们的结果。 (C)2018富兰克林研究所。由Elsevier Ltd.出版。保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号