首页> 外国专利> METHOD AND APPARATUS FOR REINFORCEMENT LEARNING TRAINING SESSIONS WITH CONSIDERATION OF RESOURCE COSTING AND RESOURCE UTILIZATION

METHOD AND APPARATUS FOR REINFORCEMENT LEARNING TRAINING SESSIONS WITH CONSIDERATION OF RESOURCE COSTING AND RESOURCE UTILIZATION

机译：考虑资源成本和资源利用的强化学习培训课程的方法和装置

页面导航

摘要
著录项
相似文献

摘要

Reinforcement learning enables a framework of information technology assets that include software elements, computational hardware assets, and/or, bundled software and computational hardware systems and products. The performance of successive sessions of an inner loop reinforcement learning is directed and monitored by an outer loop reinforcement learning wherein the outer loop reinforcement learning is designed to reduce financial costs and computational asset requirements and/or optimize learning time in successive instantiations of inner loop reinforcement learning training sessions. The framework enables consideration of the license costs of domain specific simulators, the usage cost of hardware platforms, and the progress of a particular reinforcement learning training. The framework further enables reductions of these costs to orchestrate and train a neural network under budget constraints with respect to the available hardware and software licenses available at runtime. These improvements and optimizations may be performed by using heuristics and neural network algorithms.

机译：强化学习使信息技术资产的框架成为可能，该框架包括软件元素，计算硬件资产和/或捆绑的软件和计算硬件系统及产品。内环强化学习的连续会话的性能由外环强化学习指导和监视，其中外环强化学习旨在减少财务成本和计算资产需求和/或在内环强化的连续实例中优化学习时间学习培训课程。该框架可以考虑特定领域模拟器的许可成本，硬件平台的使用成本以及特定强化学习培训的进度。该框架进一步降低了在运行时可用的可用硬件和软件许可的预算约束下编排和训练神经网络的这些成本。这些改进和优化可以通过使用启发式和神经网络算法来执行。

著录项

公开/公告号US2020265302A1

专利类型
公开/公告日2020-08-20

原文格式PDF
申请/专利权人 SUMIT SANYAL;ANIL HEBBAR;ABDUL PULIYADAN KUNNIL MUNEER;ABHINAV KAUSHIK;BHARAT KUMAR PADI;JEROEN BÉDORF;TIJMEN TIELEMAN;
展开▼

申请/专利号US201916278699
发明设计人 SUMIT SANYAL;ANIL HEBBAR;ABDUL PULIYADAN KUNNIL MUNEER;ABHINAV KAUSHIK;BHARAT KUMAR PADI;JEROEN BÉDORF;TIJMEN TIELEMAN;
展开▼

申请日2019-02-18
分类号G06N3/08;G06N3/04;
国家 US
入库时间 2022-08-21 11:24:40

相似文献

专利
外文文献
中文文献