首页> 外文会议>International Joint Conference on Neural Networks >Modulation of Activation Function in Triangular Recurrent Neural Networks for Time Series Modeling
【24h】

Modulation of Activation Function in Triangular Recurrent Neural Networks for Time Series Modeling

机译:时间序列建模中三角递归神经网络中激活函数的调制

获取原文

摘要

This paper introduces a novel method to dynamically vary the activation function slopes in recently developed upper and lower triangular recurrent neural networks (ULTRNN) to enhance their modeling capability. The ULTRNN employs a pair of triangular feedback weight matrices with block diagonal elements whose eigenvalues are constrained to lie on or close to the unit circle in the complex z-plane to maintain network and training stability. The activation function slopes of the ULTRNN state variables are dynamically varied by a second modulating network. The inputs to the modulating network are the state variables of the principal ULTRNNs and their inputs. The modulating network is trained simultaneously with the principal ULTRNN to compute the activation function slope for the latter’s each state variable at each time step. Such dynamic variation of the activation function slopes selectively enhances the contribution of certain states while suppressing that of the others. A larger slope results in a longer time contribution of the corresponding state and helps model long-term dependencies. Conversely, a smaller slope results in a shorter time contribution and may be used to model controlled "forgetting". The proposed modulation technique enhances the ULTRNN’s ability to effectively incorporate short-term memory and long-term dependencies. Simulation results show that with activation function modulation the ULTRNNs are able to autonomously replicate the outputs of sample chaotic dynamic system with good accuracy. This capability can be highly effective in modeling or characterizing the inherent process that generates the time series.
机译:本文介绍了一种新方法来动态改变最近开发的上下三角递归神经网络(ULTRNN)中的激活函数斜率,以增强其建模能力。 ULTRNN使用一对带有块对角线元素的三角形反馈权重矩阵,其特征值被约束为位于复杂z平面中的单位圆上或附近,以保持网络和训练稳定性。 ULTRNN状态变量的激活函数斜率通过第二个调制网络动态变化。调制网络的输入是主要ULTRNN及其输入的状态变量。调制网络与主要ULTRNN同时进行训练,以在每个时间步为后者的每个状态变量计算激活函数斜率。激活函数斜率的这种动态变化有选择地增强了某些状态的贡献,同时抑制了其他状态的贡献。较大的斜率会导致相应状态的时间贡献较长,并有助于对长期依赖性进行建模。相反,较小的斜率导致较短的时间贡献,并且可以用于对受控的“遗忘”进行建模。提议的调制技术增强了ULTRNN有效合并短期记忆和长期依赖性的能力。仿真结果表明,通过激活函数调制,ULTRNN能够以良好的精度自动复制样本混沌动态系统的输出。此功能在建模或表征生成时间序列的固有过程方面可能非常有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号