首页> 外国专利> Behavior information learning device, behavior information optimization system, and behavior information learning program

Behavior information learning device, behavior information optimization system, and behavior information learning program

机译:行为信息学习装置,行为信息优化系统和行为信息学习程序

摘要

To perform reinforcement learning that enables selecting action information for shortening a cycle time while also avoiding the occurrence of overheating. An action information learning device (300) includes: a state information acquisition means (310) for acquiring state information including an operation pattern of a spindle and a combination of parameters related to machining of a machine tool (100); an action information output means (320) for outputting action information including adjustment information for the operation pattern and the combination of parameters included in the state information; a reward calculation means (333) for acquiring judgment information which is information for temperature of the machine tool (100) and a machining time related to the machining of the machine tool (100), and calculating a value of a reward for reinforcement learning based on the judgment information thus acquired; and a value function update means (332) for updating a value function by performing the reinforcement learning based on the value of the reward, the state information and the action information.
机译:为了进行强化学习,使得能够选择动作信息以缩短周期时间,同时还避免了过热的发生。动作信息学习装置(300)包括:状态信息获取装置(310),用于获取状态信息,该状态信息包括主轴的操作模式和与机床(100)的加工有关的参数的组合。动作信息输出装置(320),用于输出包括用于操作模式的调整信息和状态信息中包括的参数的组合的动作信息。奖励计算装置(333),用于获取判断信息,该判断信息是关于机床(100)的温度和与机床(100)的加工相关的加工时间的信息,并基于强化学习来计算奖励的值。根据如此获得的判断信息;价值函数更新装置(332),用于通过基于奖励,状态信息和动作信息的值进行强化学习来更新价值函数。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号