首页> 外文学位 >Collaborative execution of exploration and tracking using move value estimation for robot teams (MVERT).
【24h】

Collaborative execution of exploration and tracking using move value estimation for robot teams (MVERT).

机译:使用机器人团队的移动值估计(MVERT)协作执行探索和跟踪。

获取原文
获取原文并翻译 | 示例

摘要

This work presents Move Value Estimation for Robot Teams (MVERT), an architecture specifically designed for selecting low-level actions for multi-agent teams. The design goal for MVERT is to produce reasonable performance that takes advantage of a heterogeneous team while maintaining computational efficiency. MVERT is fully distributed---each agent selects actions based on its knowledge and knowledge provided it by teammates. Each robot approximates the expected next-step teammate contributions and, given these predictions, each robot can select its action to maximize the team's progress. MVERT represents progress with mathematical value functions that map state and robot task performance models to a numerical value representing mission utility.; Many action selection approaches (optimal trajectory planning, for example) in large state-spaces may be computationally prohibitive, particularly for online mission replanning. However, taking advantage of a team's multi-agent nature to provide efficiency requires consideration of teammate contributions. Thus, in selecting an action with MVERT, each robot approximates the next-step contributions of teammates by applying their sensing models, task capabilities, and current poses in the value functions. The robot then evaluates its candidate actions by applying the value functions and its own sensor models. The action resulting in the overall highest-valued pose is selected and executed.; Performance in each task is described by an individual value function. State includes current locations of teammates and objects in the environment. Performance models include task capabilities and sensor models. Value functions may be any mathematical representation of task performance. To determine an actions' overall value, independent task values are combined by weighted average. Weighting each task's value allows prioritizing tasks in accordance with desired performance. As progress reduces potential for improving value on some tasks, the weights automatically shift focus to the other tasks. Weights can be dynamically adapted as mission needs change.; MVERT has been applied in simulation and on physical robots for mapping, dynamic target tracking, and complex multi-task missions (planetary exploration). MVERT improves team mission performance time and completeness compared to individual action selection and greatly improves computation time compared to a one-step optimal. MVERT produces contextually appropriate actions for successfully performing complex multi-task, multi-robot missions.
机译:这项工作介绍了机器人团队的移动价值估算(MVERT),该体系结构专门设计用于为多主体团队选择低级动作。 MVERT的设计目标是在保持计算效率的同时,利用异构团队的优势产生合理的性能。 MVERT是完全分布式的-每个代理根据其知识和队友提供的知识来选择操作。每个机器人都会估算出预期的下一步队友贡献,并且在给出这些预测的情况下,每个机器人都可以选择其动作来最大化团队的进步。 MVERT用数学值函数表示进度,该数学值函数将状态和机器人任务绩效模型映射到表示任务实用程序的数值。在大型状态空间中,许多动作选择方法(例如,最佳轨迹规划)在计算上可能是禁止的,特别是对于在线任务重新规划。但是,要利用团队的多代理性质来提高效率,需要考虑队友的贡献。因此,在使用MVERT选择动作时,每个机器人都通过在值函数中应用其感知模型,任务能力和当前姿势来估算队友的下一步贡献。然后,机器人通过应用值函数和自己的传感器模型来评估其候选动作。选择并执行导致总体上价值最高的姿势的动作。每个任务的绩效由一个单独的价值函数来描述。状态包括队友和环境中对象的当前位置。性能模型包括任务能力和传感器模型。价值函数可以是任务绩效的任何数学表示。为了确定操作的总体价值,将独立的任务价值与加权平均值相结合。对每个任务的值进行加权可以根据所需性能对任务进行优先级排序。随着进步减少了某些任务的价值提高潜力,权重会自动将重点转移到其他任务上。权重可以根据任务需求的变化而动态调整。 MVERT已应用于仿真和物理机器人中,用于制图,动态目标跟踪和复杂的多任务任务(行星探测)。与单个动作选择相比,MVERT改善了团队任务的执行时间和完整性,与单步优化相比,极大地缩短了计算时间。 MVERT会根据上下文进行适当的操作,以成功执行复杂的多任务,多机器人任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号