...
首页> 外文期刊>SIGKDD explorations >Quantifying Long Range Dependence in Language and User Behavior to improve RNNs
【24h】

Quantifying Long Range Dependence in Language and User Behavior to improve RNNs

机译:量化语言和用户行为中的长距离依赖性,以改善RNN

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Characterizing temporal dependence patterns is a critical step in understanding the statistical properties of sequential data. Long Range Dependence (LRD) - referring to long-range correlations decaying as a power law rather than exponentially w.r.t. distance - demands a different set of tools for modeling the underlying dynamics of the sequential data. While it has been widely conjectured that LRD is present in language modeling and sequential recommendation, the amount of LRD in the corresponding sequential datasets has not yet been quantified in a scalable and model-independent manner. We propose a principled estimation procedure of LRD in sequential datasets based on established LRD theory for real-valued time series and apply it to sequences of symbols with million-itemscale dictionaries. In our measurements, the procedure estimates reliably the LRD in the behavior of users as they write Wikipedia articles and as they interact with YouTube. We further show that measuring LRD better informs modeling decisions in particular for RNNs whose ability to capture LRD is still an active area of research. The quantitative measure informs new Evolutive Recurrent Neural Networks (EvolutiveRNNs) designs, leading to state-of-the-art results on language understanding and sequential recommendation tasks at a fraction of the computational cost.
机译:表征时间依赖模式是理解顺序数据的统计特性的关键步骤。远程依赖(LRD) - 参考作为权力律衰减的远程相关性,而不是指数为W.R.T.距离 - 要求为模拟顺序数据的底层动态进行建模不同的工具。虽然已被广泛猜测LRD在语言建模和顺序推荐中,但相应的顺序数据集中的LRD的量尚未以可扩展和独立于模型的方式量化。我们提出了基于已建立的LRD理论的序列数据集的LRD中的LRD的原则性估算程序,并将其应用于百万仪器词典的符号序列。在我们的测量中,程序在用户编写维基百科文章时可靠地估计LRD,并按照与YouTube互动。我们进一步表明,测量LRD更好地通知建模决策,特别是对于捕获LRD的能力仍然是一个有效的研究领域。定量措施通知新的演化经常性神经网络(Evolutivernns)设计,导致最先进的导语言理解和顺序推荐任务,以计算成本的一小部分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号