首页>
外国专利>
Behavior information learning device, behavior information optimization system, and behavior information learning program
Behavior information learning device, behavior information optimization system, and behavior information learning program
展开▼
机译:行为信息学习装置,行为信息优化系统和行为信息学习程序
展开▼
页面导航
摘要
著录项
相似文献
摘要
To perform reinforcement learning that enables selecting action information for shortening a cycle time while also avoiding the occurrence of overheating. An action information learning device (300) includes: a state information acquisition means (310) for acquiring state information including an operation pattern of a spindle and a combination of parameters related to machining of a machine tool (100); an action information output means (320) for outputting action information including adjustment information for the operation pattern and the combination of parameters included in the state information; a reward calculation means (333) for acquiring judgment information which is information for temperature of the machine tool (100) and a machining time related to the machining of the machine tool (100), and calculating a value of a reward for reinforcement learning based on the judgment information thus acquired; and a value function update means (332) for updating a value function by performing the reinforcement learning based on the value of the reward, the state information and the action information.
展开▼