首页> 外国专利> Method and apparatus for pruning deep neural network based Q-learning empirical memory

Method and apparatus for pruning deep neural network based Q-learning empirical memory

机译:基于Q学习经验记忆的深度神经网络修剪方法及装置

摘要

The present technology discards or adds new experiences based on collecting new experiences by the agent, comparing the new experiences with the experiences stored in the memory of the agent, and the comparison. And using the experience to overwrite the in-memory experience. For example, the agent or associated processor can determine how similar the new experience is to the stored experience. If the new experience is overly similar, the agent discards the new experience, otherwise the agent stores the new experience in memory and discards the previously stored experience instead. Do. Collecting experiences and selectively storing these experiences based on the similarity of these experiences to previously stored experiences addresses technical issues and results in multiple technical improvements. For example, memory size constraints are reduced, opportunities for catastrophic forgetting by neural networks are reduced or eliminated, and neural network performance is improved. [Selected figure] Figure 7
机译:本技术基于由代理收集新的经验,将新的经验与存储在代理的存储器中的经验进行比较以及比较,来丢弃或添加新的经验。并使用该经验来覆盖内存中的经验。例如,代理或关联的处理器可以确定新体验与所存储的体验有多相似。如果新的体验过于相似,则代理将放弃新的体验,否则,代理会将新的体验存储在内存中,而丢弃先前存储的体验。做。收集经验并根据这些经验与先前存储的经验的相似性来选择性地存储这些经验可解决技术问题并带来多项技术改进。例如,减少了内存大小约束,减少或消除了神经网络造成灾难性遗忘的机会,并改善了神经网络性能。 [选定图]图7

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号