Inverse discounted-based LQR algorithm for learning human movement behaviors

El-Hussieny Haitham; Ryu Jee-Hwan

首页> 外文期刊>Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies >Inverse discounted-based LQR algorithm for learning human movement behaviors

【24h】

Inverse discounted-based LQR algorithm for learning human movement behaviors

机译：基于贴现折扣的LQR算法，用于学习人类运动行为

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Recently, there has been an increasing interest towards understanding human movement behaviors. In this regard, one of the approaches is to retrieve the unknown underlying objective function that the human has to optimize while achieving a certain movement behavior. Existing research of behavioral understanding merely depends on predefined optimality criteria, where the minimum time, minimum variance or/and minimum effort are mainly adopted. These criteria are assumed to be constant, where the human is assumed to have the same preferences during the movement duration. However, in this paper, the optimality criteria underlying the kinematic characteristics of a certain human behavior are assumed to be exponentially discounted to account for the change in the human preferences that could happen while achieving this behavior. A new Inverse Discounted-based Linear Quadratic Regulator (ID-LQR) algorithm is developed in the light of Inverse Optimal Control (IOC) framework to find out the discounted cost function that could reproduce the measured human behavior perfectly. Meanwhile, an Incremental version of the ID-LQR algorithm is proposed to continuously refine the so far learned cost function in the case of sequentially presented demonstrations. The saccadic eye gaze movement is studied as an example to quantify both the proposed ID-LQR and Inverse ID-LQR approaches. Simulation results are encouraging and show that the saccadic trajectories generated by ID-LQR approach match the experimental data in many aspects, including position and velocity profiles of saccades. Moreover, when it is assessed by a subsequent set of scenarios, the incremental ID-LQR algorithm confirms its capability to generalize the so far retrieved cost function for the unseen saccadic demonstrations.

机译：最近，对理解人类运动行为越来越兴趣。在这方面，其中一种方法是检索人类必须在实现某种运动行为的同时优化的未知潜在的目标函数。行为理解的现有研究仅仅取决于预定义的最优性标准，其中主要采用最短时间，最小方差或/和最小努力。假设这些标准是恒定的，其中假设人在运动持续时间期间具有相同的偏好。然而，在本文中，假设某种人类行为的运动特征的最优标准被指数折扣，以考虑在实现这种行为的同时可能发生的人类偏好的变化。根据反向最佳控制（IOC）框架的光线开发了一种新的基于贴现基于贴现的线性二次调节器（ID-LQR）算法，以找出可以完美再现测量人类行为的折扣成本函数。同时，提出了一种ID-LQR算法的增量版本，以在顺序呈现演示的情况下连续地改进到目前为止的学习成本函数。研究了扫视眼凝视运动，以定量所提出的ID-LQR和逆ID-LQR方法的示例。仿真结果令人鼓舞并表明ID-LQR方法产生的扫视轨迹在许多方面匹配实验数据，包括扫视的位置和速度谱。此外，当通过随后的方案进行评估时，增量ID-LQR算法确认其能力概括了未检索到的未检测到的扫视展示的成本函数。

著录项

来源
《Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies》 |2019年第4期|共13页
作者
El-Hussieny Haitham; Ryu Jee-Hwan;
展开▼
作者单位

Benha Univ Fac Engn Shoubra Elect Engn Dept Banha Egypt;

Korea Univ Technol &

Educ KOREATECH Sch Mech Engn Cheonan South Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
Inverse optimal control; Learning by demonstrations; Imitation learning; Behavior modeling;

机译：逆最佳控制;通过示威学习;模仿学习;行为建模;

相似文献

外文文献
中文文献
专利

1. Inverse discounted-based LQR algorithm for learning human movement behaviors [J] . El-Hussieny Haitham, Ryu Jee-Hwan Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2019,第4期

机译：基于贴现折扣的LQR算法，用于学习人类运动行为
2. Adaptive learning of human motor behaviors: An evolving inverse optimal control approach [J] . Haitham El-Hussieny, A.A. Abouelsoud, Samy F.M. Assal, Engineering Applications of Artificial Intelligence . 2016,第Apra期

机译：人体运动行为的自适应学习：一种不断发展的逆最优控制方法
3. Usability and Scalability in Human Behavior Models Created by Machine Learning Algorithms [J] . HANS FERNLUND, AVELINO GONZALEZ WSEAS Transactions on Systems . 2006,第10期

机译：机器学习算法在人类行为模型中的可用性和可扩展性
4. Modeling, Learning and Prediction of Longitudinal Behaviors of Human-Driven Vehicles by Incorporating Internal Human DecisionMaking Process using Inverse Model Predictive Control [C] . Longxiang Guo, Yunyi Jia IEEE/RSJ International Conference on Intelligent Robots and Systems . 2019

机译：用反逆模型预测控制掺入内部人决策过程的建模，学习和预测人力车辆纵向行为
5. Control of quantum dynamics using external fields designed by optimal control theory, learning algorithms, and inverse control. [D] . Gross, Peter Miller. 1993

机译：使用由最佳控制理论，学习算法和逆控制设计的外部场控制量子动力学。
6. Jaguar movement behavior: using trajectories and association rule mining algorithms to unveil behavioral states and social interactions [O] . Suelane Garcia Fontes, Ronaldo Gonçalves Morato, Silvio Luiz Stanzani, 2021

机译：捷豹运动行为：使用轨迹和关联规则挖掘算法揭示行为国家和社会互动
7. Reaching movement generation with a recurrent neural network based on learning inverse kinematics for the humanoid robot iCub [O] . R. Felix Reinhart, Jochen J. Steil 2009

机译：基于逆向运动学的递归神经网络实现人形机器人iCub的运动生成
8. Behavioral Profiling of Scada Network Traffic Using Machine Learning Algorithms. [R] . Werling, J. R. 2014

机译：利用机器学习算法对scada网络流量进行行为分析。

Inverse discounted-based LQR algorithm for learning human movement behaviors

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅