首页> 外文学位 >The Effects of Sensor Performance as Modeled by Signal Detection Theory on the Performance of Reinforcement Learning in a Target Acquisition Task.
【24h】

The Effects of Sensor Performance as Modeled by Signal Detection Theory on the Performance of Reinforcement Learning in a Target Acquisition Task.

机译:通过信号检测理论建模的传感器性能对目标获取任务中强化学习性能的影响。

获取原文
获取原文并翻译 | 示例

摘要

Unmanned Aerial Systems (UASs) today are fulfilling more roles than ever before. There is a general push to have these systems feature more advanced autonomous capabilities in the near future. To achieve autonomous behavior requires some unique approaches to control and decision making. More advanced versions of these approaches are able to adapt their own behavior and examine their past experiences to increase their future mission performance. To achieve adaptive behavior and decision making capabilities this study used Reinforcement Learning algorithms. In this research the effects of sensor performance, as modeled through Signal Detection Theory (SDT), on the ability of RL algorithms to accomplish a target localization task are examined. Three levels of sensor sensitivity are simulated and compared to the results of the same system using a perfect sensor. To accomplish the target localization task, a hierarchical architecture used two distinct agents. A simulated human operator is assumed to be a perfect decision maker, and is used in the system feedback. An evaluation of the system is performed using multiple metrics, including episodic reward curves and the time taken to locate all targets. Statistical analyses are employed to detect significant differences in the comparison of steady-state behavior of different systems.
机译:今天的无人机系统(UAS)担当着比以往更多的角色。人们普遍要求在不久的将来使这些系统具有更高级的自治功能。要实现自主行为,需要一些独特的方法来进行控制和决策。这些方法的更高级版本能够适应自己的行为并检查其过去的经验,以提高其未来的任务绩效。为了获得适应性行为和决策能力,本研究使用了强化学习算法。在这项研究中,研究了通过信号检测理论(SDT)建模的传感器性能对RL算法完成目标定位任务的能力的影响。模拟了三个级别的传感器灵敏度,并将它们与使用完美传感器的同一系统的结果进行比较。为了完成目标本地化任务,分层体系结构使用了两个不同的代理。假定模拟的人工操作员是完美的决策者,并将其用于系统反馈中。使用多种指标对系统进行评估,包括情节奖励曲线和定位所有目标所花费的时间。统计分析用于检测不同系统稳态行为比较中的显着差异。

著录项

  • 作者

    Quirion, Nate.;

  • 作者单位

    Embry-Riddle Aeronautical University.;

  • 授予单位 Embry-Riddle Aeronautical University.;
  • 学科 Psychology Industrial.;Sociology Organizational.;Engineering Aerospace.
  • 学位 M.S.H.F.S.
  • 年度 2013
  • 页码 128 p.
  • 总页数 128
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:41:44

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号