【24h】

Utilizing Previous Weight Estimates for Accelerated Training

机译:利用先前的体重估算值进行加速训练

获取原文

摘要

It is easily seen that if the error function is quadratic anda linear combination of several previos gradients is the zero vector, then the same linear combination of the weights at those points yields the minimizing weight. The former condition can be made highly likely by considering one more gradient than the dimension of the weight vector. From this simple observation, we first generalize to the case of linear combinations of fewer gradients than the dimension of the weight vector, and then to a general algorithm pplicable to non-quadratic error functions which permits us to calculate the coefficients of the linear combination and control their number in a computationally inexpensive manner. A hybrid of this algorithm iwht the traditional backpropagation algorithm is seen to yield substantial reduction in computation time over the backpropagation algorithm alone.
机译:很容易看出,如果误差函数是二次函数,并且几个previos梯度的线性组合是零向量,那么在这些点处的权重的相同线性组合会产生最小的权重。通过考虑比权重向量的维数多的梯度,可以使前一种情况变得很有可能。从这个简单的观察中,我们首先推广到线性组合的情况,该线性组合的梯度小于权重向量的维数,然后推广到适用于非二次误差函数的通用算法,该算法允许我们计算线性组合的系数和以计算便宜的方式控制它们的数量。与单独的反向传播算法相比,该算法与传统反向传播算法的混合可显着减少计算时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号