首页> 外文OA文献 >Machine Learning for Intelligent Control: Application of Reinforcement Learning Techniques to the Development of Flight Control Systems for Miniature UAV Rotorcraft

【2h】

Machine Learning for Intelligent Control: Application of Reinforcement Learning Techniques to the Development of Flight Control Systems for Miniature UAV Rotorcraft

机译：用于智能控制的机器学习：强化学习技术在微型无人机旋翼机飞行控制系统开发中的应用

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This thesis investigates the possibility of using reinforcement learning (RL) techniques to create a flight controller for a quadrotor Micro Aerial Vehicle (MAV).A capable flight control system is a core requirement of any unmanned aerial vehicle. The challenging and diverse applications in which MAVs are destined to be used, mean that considerable time and effort need to be put into designing and commissioning suitable flight controllers. It is proposed that reinforcement learning, a subset of machine learning, could be used to address some of the practical difficulties.While much research has delved into RL in unmanned aerial vehicle applications, this work has tended to ignore low level motion control, or been concerned only in off-line learning regimes. This thesis addresses an area in which accessible information is scarce: the performance of RLwhen used for on-policy motion control.Trying out a candidate algorithm on a real MAV is a simple but expensive proposition. In place of such an approach, this research details the development of a suitable simulator environment, in which a prototype controller might be evaluated. Then inquiry then proposes a possible RL-based control system, utilising the Q-learning algorithm, with an adaptive RBF-network providing function approximation.The operation of this prototypical control system is then tested in detail, to determine both the absolute level of performance which can be expected, and the effect which tuning critical parameters of the algorithm has on the functioning of the controller. Performance is compared against a conventional PID controller to maximise the usability of the results by a wide audience. Testing considers behaviour in the presence of disturbances, and run-time changes in plant dynamics.Results show that given sufficient learning opportunity, a RL-based control system performs as well as a simple PID controller. However, unstable behaviour during learning is an issue for future analysis.Additionally, preliminary testing is performed to evaluate the feasibility of implementing RL algorithms in an embedded computing environment, as a general requirement for a MAV flight controller. Whilst the algorithm runs successfully in an embedded context, observation revealsfurther development would be necessary to reduce computation time to a level where a controller was able to update sufficiently quickly for a real-time motion control application.In summary, the study provides a critical assessment of the feasibility of using RL algorithms for motion control tasks, such as MAV flight control. Advantages which merit interest are exposed, though practical considerations suggest at this stage, that such a control system is not a realistic proposition. There is a discussion of avenues which may uncover possibilities to surmount these challenges. This investigation will prove useful for engineers interested in the opportunities which reinforcement learning techniques represent.

机译：本文研究了使用强化学习（RL）技术为四旋翼微型飞行器（MAV）创建飞行控制器的可能性。有能力的飞行控制系统是任何无人机的核心要求。注定要使用MAV的挑战性和多样化应用意味着需要在设计和调试合适的飞行控制器上投入大量的时间和精力。有人建议将强化学习作为机器学习的一个子集来解决一些实际困难。尽管在无人飞行器应用中对RL进行了大量研究，但这项工作往往忽略了低水平运动控制，或者仅关注离线学习机制。本文针对的领域是可访问信息稀缺：RL用于策略运动控制时的性能。在真实的MAV上尝试一种候选算法是一个简单但昂贵的提议。代替这种方法，本研究详细介绍了合适的模拟器环境的开发，在该环境中可以评估原型控制器。然后询问提出了一个可能的基于RL的控制系统，利用Q学习算法，通过自适应RBF网络提供函数逼近，然后详细测试该原型控制系统的运行情况，以确定绝对性能水平可以预期的结果，以及调整算法的关键参数对控制器功能的影响。将性能与常规PID控制器进行比较，以最大程度地提高广大观众对结果的可用性。测试考虑了存在干扰时的行为以及工厂动态的运行时变化。结果表明，在有足够的学习机会的情况下，基于RL的控制系统的性能与简单的PID控制器相同。然而，学习过程中的不稳定行为是未来分析的一个问题。此外，作为MAV飞行控制器的一般要求，还进行了初步测试以评估在嵌入式计算环境中实施RL算法的可行性。虽然该算法在嵌入式环境中成功运行，但观察表明，需要进一步开发，才能将计算时间减少到控制器能够为实时运动控制应用进行足够快的更新的水平。总之，本研究提供了关键的评估将RL算法用于运动控制任务（例如MAV飞行控制）的可行性。尽管在现阶段提出了实际考虑，但值得一提的优点却暴露了，这种控制系统不是现实的建议。对途径的讨论可能揭示了克服这些挑战的可能性。对于对强化学习技术所代表的机会感兴趣的工程师来说，这项研究将非常有用。

著录项

作者
Hayes Edwin Laurie;
展开▼
作者单位

展开▼
年度 2013
总页数
原文格式 PDF
正文语种 en
中图分类

相似文献

外文文献
中文文献
专利

1. Advanced UAV Flight Controller with Deep Reinforcement Learning [J] . Lee SeungGi, Cho Deun-Sol, Kim Won-Tae Basic & clinical pharmacology & toxicology. . 2019,第S3期

机译：高级UAV飞行控制器，具有深层加固学习
2. Advanced UAV Flight Controller with Deep Reinforcement Learning [J] . Lee SeungGi, Cho Deun-Sol, Kim Won-Tae Basic & clinical pharmacology & toxicology. . 2019,第S1期

机译：高级UAV飞行控制器，具有深层加固学习
3. Regularized extreme learning machine-based intelligent adaptive control for uncertain nonlinear systems in networked control systems [J] . Chen Liang, Sun Jianyan, Xu Chunxiang Personal and Ubiquitous Computing . 2019,第3a4期

机译：网络控制系统中不确定非线性系统的基于正则极限学习机的智能自适应控制
4. Learn-to-Recover: Retrofitting UAVs with Reinforcement Learning-Assisted Flight Control Under Cyber-Physical Attacks [C] . Fan Fei, Zhan Tu, Dongyan Xu, IEEE International Conference on Robotics and Automation . 2020

机译：学习恢复：在网络物理攻击下，通过增强型学习辅助飞行控制对无人机进行改装
5. Applying Machine and Statistical Learning Techniques to Intelligent Transport Systems: Bottleneck Identification and Prediction, Dynamic Travel Time Prediction, Driver Stoprun Behavior Modeling, and Autonomous Vehicle Control at Intersections [D] . Elhenawy, Mohammed Mamdouh Zakaria. 2015

机译：将机器和统计学习技术应用于智能交通系统：瓶颈识别和预测，动态行驶时间预测，驾驶员停车行为模型以及交叉口的自主车辆控制
6. Editorial: Hybrid Intelligent Algorithms Based Learning Optimization and Application to Autonomic Control Systems [O] . Yanzheng Zhu, Hak-Keung Lam, Ting Yang, -1

机译：社论：基于混合智能算法的学习优化及其在自主控制系统中的应用
7. Book reviewsSlope stabilization and erosion control: a bioengineering approach. (1st edition.) MorganR. P. C. and RicksonR. J. (eds). E FN Spon, London, 1995. ISBN 0 419 15630 5, £47.50, 274 pp.Dredging: A handbook for engineers. (2nd edition.) BrayR. N., BatesA. D. and LandJ. M.. Arnold, London, 1997. ISBN 0 340 54524 0, US$225, 434 pp.Ground classification systems in tunnel construction. (1st edition.) CrabbG. I.. Transport Research Laboratory, 1997. ISSN 0968 4107, 30 pp.Recent developments in soil and pavement mechanics. (1st edition.) AlmeidaM. (ed.). A. A. Balkema, 1997. ISBN 90 5410 885 1, Hfl 165, 502 pp.Soil mechanics—concepts and applications. (1st edition.) PowrieW.. E FN Spon, 1997. ISBN 0 419 19720 6, £24.99 (paperback), 420 pp.Geotechnical engineering. (2nd edition.) LancellottaR.. A. A. Balkema, Amsterdam, 1995. ISBN 90 5410 179 2, £46.00 (paperbound) and 90 5410 178 4, £76 (hardbound), 440 pp.Fundamentals of earthquake prediction. (1st edition.) LomnitzC.. John Wiley Sons, New York, 1994. ISBN 0 471 57419 8, £75, 326 pp.Tunnel engineering handbook. (2nd edition.) BickelJ. O., KueselT. R. and KingE. H. (eds). Chapman and Hall, 1996. ISBN 0412 99291 4, £85, 544 pp.Tunnel boring machines—trends in design and construction of mechanized tunnelling. (1st edition.) WagnerH. and SchulterA. (eds). A. A. Balkema, Rotterdam, 1996. ISBN 90 5410 8118, Hfl 120, 260 pp.Deep excavations–a practical manual. (1st edition.) PullerM.. Thomas Telford, London, 1996. ISBN 0 7277 1987 4, £80, 435 pp.Barriers, liners and cover systems for containment and control of land contamination. (1st edition.) PrivettK. D., MatthewsS. C. and HodgesR. A.. Construction Industry Research and Information Association, Special Publication 124, 1996. ISBN 0 86017 437 9, £77.00 (£34.00 to CIRIA members), 278 pp.Earth reinforcement and soil structures. (3rd edition.) JonesC. J. F. P.. Thomas Telford and ASCE Press, 1996. ISBN 0 7277 2525 4, £45, 379 pp. [O] . A. V. D. Bica, L. A. Bressani, C. S. Covil, 1999

机译：书籍评论链稳定和侵蚀控制：生物工程方法。（第一个版本。）Morganr。 P. C.和Ricksonr。 J.（EDS）。 E＆Fn Spon，伦敦，1995. ISBN 0 419 15630 5，£47.50,274 PP.Dredge：工程师手册。（第2版。）Brayr。 N.，Batesa。 D.和Landj。 M ..Arnold，伦敦，1997. ISBN 0 340 54524 0，225美元，434个PP.Ground隧道施工分类系统。（第一个版本。）crabbg。 I ..运输研究实验室，1997年。ISSN 0968 4107,30 PP.Recent在土壤和路面力学的发展。（第1版。）Almeidam。（ed。）。 A. A. Balkema，1997. ISBN 90 5410 885 1，HFL 165,502 PP.Soil Mechanics-概念和应用。（第一版。）Pow威.. E＆Fn Spon，1997. Isbn 0 419 19720 6，£24.99（平装图），420 PP.GeoTechnical工程。（第2版。）兰肯塔.. A. A. Balkema，阿姆斯特丹，1995年。ISBN 90 5410 179 2，£46.00（纸张）和90 5410 178 4，£76（Sallbound），440 pp.Reatemake预测。（第1版。）Lomnitzc .. John Wiley＆Sons，纽约，1994. Isbn 0 471 57419 8，£75,326 PP.Tunnel工程手册。（第2版。）bickelj。 O.，Kueselt。 R.和Kinge。 H.（EDS）。 Chapman和Hall，1996。ISBN 0412 99291 4，£85,544 PP.Tunnel镗床 - 机械化隧道设计和构建的趋势。（第一个版本。）瓦格纳。和schultera。（EDS）。 A. A. Balkema，Rotterdam，1996。ISBN 90 5410 8118，HFL 120,260 PP.Deep挖掘 - 实用手册。（第1版。）拉米..托马斯特福，伦敦，1996。ISBN 0 7277 1987 4，£80,435 PP.Barriers，衬里和覆盖土地污染的控制和控制。（第一个版本。）privettk。 D.，Matthewss。 C.和hodgesr。 A ..建筑业研究和信息协会，特刊124,1996. ISBN 0 86017 437 9，£77.00（互联议员34.00英镑），278 PP.Earth加固和土壤结构。（第3版。）Jonesc。 J. F. P .. Thomas Telford和Asce Press，1996。ISBN 0 7277 2525 4，£45,379 PP。

Machine Learning for Intelligent Control: Application of Reinforcement Learning Techniques to the Development of Flight Control Systems for Miniature UAV Rotorcraft

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅