首页> 外文期刊>Automation in construction >Complete coverage path planning using reinforcement learning for Tetromino based cleaning and maintenance robot
【24h】

Complete coverage path planning using reinforcement learning for Tetromino based cleaning and maintenance robot

机译:使用基于Tetromino的清洁和维护机器人的强化学习来完成覆盖范围规划

获取原文
获取原文并翻译 | 示例
           

摘要

Tiling robotics have been deployed in autonomous complete area coverage tasks such as floor cleaning, building inspection, and maintenance, surface painting. One class of tiling robotics, polyomino-based reconfigurable robots, overcome the limitation of fixed-form robots in achieving high-efficiency area coverage by adopting different morphologies to suit the needs of the current environment. Since the reconfigurable actions of these robots are produced by real-time intelligent decisions during operations, an optimal path planning algorithm is paramount to maximize the area coverage while minimizing the energy consumed by these robots. This paper proposes a complete coverage path planning (CCPP) model trained using deep blackreinforcement learning (RL) for the tetromino based reconfigurable robot platform called hTetro to simultaneously generate the optimal set of shapes for any pretrained arbitrary environment shape with a trajectory that has the least overall cost. To this end, a Convolutional Neural Network (CNN) with Long Short Term Memory (LSTM) layers is trained using Actor Critic Experience Replay (ACER) reinforcement learning algorithm. The results are compared with existing approaches which are based on the traditional tiling theory model, including zigzag, spiral, and greedy search schemes. The model is also compared with the Travelling salesman problem (TSP) based Genetic Algorithm (GA) and Ant Colony Optimization (ACO) schemes. The proposed scheme generates a path with lower cost while also requiring lesser time to generate it. The model is also highly robust and can generate a path in any pretrained arbitrary environments.
机译:平铺机器人已部署在自主的完整区域覆盖任务中,例如地板清洁,建筑物检查和维护,表面涂装。一类平铺机器人(基于多米诺骨牌的可重构机器人)通过采用不同的形态来适应当前环境的需要,克服了固定形式的机器人在实现高效区域覆盖方面的局限性。由于这些机器人的可重配置动作是在操作过程中通过实时智能决策产生的,因此最佳路径规划算法对于最大化区域覆盖范围同时最小化这些机器人消耗的能量至关重要。本文针对基于tetromino的可重构机器人平台hTetro提出了一种使用深度黑强化学习(RL)训练的完整覆盖路径规划(CCPP)模型,该模型可同时针对任何预训练的任意环境形状(具有最小的轨迹)同时生成最佳形状集总成本。为此,使用“演员评论体验重播”(ACER)强化学习算法对具有长短期记忆(LSTM)层的卷积神经网络(CNN)进行了训练。将结果与基于传统切片理论模型的现有方法(包括锯齿形,螺旋形和贪婪搜索方案)进行了比较。该模型还与基于旅行商问题(TSP)的遗传算法(GA)和蚁群优化(ACO)方案进行了比较。所提出的方案产生具有较低成本的路径,同时还需要较少的时间来产生它。该模型还非常健壮,可以在任何预先训练的任意环境中生成路径。

著录项

  • 来源
    《Automation in construction》 |2020年第4期|103078.1-103078.11|共11页
  • 作者

  • 作者单位

    Singapore Univ Technol & Design ROAR Lab Engn Prod Dev Singapore 487372 Singapore|Birla Inst Technol & Sci Comp Sci Dept Pilani 333031 Rajasthan India;

    Singapore Univ Technol & Design ROAR Lab Engn Prod Dev Singapore 487372 Singapore;

    Ton Duc Thang Univ Fac Elect & Elect Engn Optoelect Res Grp Ho Chi Minh City 700000 Vietnam;

    Birla Inst Technol & Sci Comp Sci Dept Pilani 333031 Rajasthan India;

    Singapore Univ Technol & Design ROAR Lab Engn Prod Dev Singapore 487372 Singapore|UET Lahore Elect Engn Dept NWL Campus Lahore Pakistan;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Tiling robotics; Cleaning and maintenance; Inspection; Path planing; Reinforcement learning;

    机译:平铺机器人;清洁和维护;检查;路径规划;强化学习;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号