首页> 外国专利> Scaling complex models for neural networks

Scaling complex models for neural networks

机译:用于神经网络的缩放复杂模型

摘要

A method for determining the final architecture for a neural network to perform a specific machine learning task is described. The method includes receiving a basic architecture for a neural network, the basic architecture having a network width dimension, a network depth dimension and a resolution dimension; receiving data to define complex coefficients that control additional computational resources used to scale the underlying architecture; performing a search to determine base width, depth, and resolution coefficients, respectively, specifying how to allocate additional computational resources to the network width, depth, and resolution dimensions of the underlying architecture; determining width, depth, and resolution coefficients based on the base width, depth, and resolution coefficients and the composite coefficient; and generating a final architecture that scales the network width, network depth and resolution dimensions of the base architecture based on the corresponding width, depth and resolution coefficients.
机译:描述了一种确定用于执行特定机器学习任务的神经网络的最终架构的方法。该方法包括接收用于神经网络的基本架构,该方法具有网络宽度维度,网络深度维度和分辨率维度的基本架构;接收数据以定义控制用于扩展底层体系结构的附加计算资源的复杂系数;执行搜索以分别确定基础宽度,深度和分辨率系数,指定如何将额外的计算资源分配给底层体系结构的网络宽度,深度和分辨率;基于基础宽度,深度和分辨率系数和复合系数确定宽度,深度和分辨率系数;并生成基于相应的宽度,深度和分辨率系数的基础架构的网络宽度,网络深度和分辨率尺寸的最终架构。

著录项

  • 公开/公告号KR20210105976A

    专利类型

  • 公开/公告日2021-08-27

    原文格式PDF

  • 申请/专利权人 구글 엘엘씨;

    申请/专利号KR20217023377

  • 发明设计人 탄 밍싱;레 꾸억 브이.;

    申请日2020-01-23

  • 分类号G06N3/04;G06N3/063;G06N3/08;

  • 国家 KR

  • 入库时间 2022-08-24 22:19:29

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号