首页> 外文会议>Discovery science >An In-Depth Experimental Comparison of RNTNs and CNNs for Sentence Modeling
【24h】

An In-Depth Experimental Comparison of RNTNs and CNNs for Sentence Modeling

机译:RNTN和CNN句子建模的深度实验比较

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

The goal of modeling sentences is to accurately represent their meaning for different tasks. A variety of deep learning architectures have been proposed to model sentences, however, little is known about their comparative performance on a common ground, across a variety of datasets, and on the same level of optimization. In this paper, we provide such a novel comparison for two popular architectures, Recursive Neural Tensor Networks (RNTNs) and Convolutional Neural Networks (CNNs). Although RNTNs have been shown to work well in many cases, they require intensive manual labeling due to the vanishing gradient problem. To enable an extensive comparison of the two architectures, this paper employs two methods to automatically label the internal nodes: a rule-based method and (this time as part of the RNTN method) a convolutional neural network. This enables us to compare these RNTN models to a relatively simple CNN architecture. Experiments conducted on a set of benchmark datasets demonstrate that the CNN outperforms the RNTNs based on automatic phrase labeling, whereas the RNTN based on manual labeling outperforms the CNN. The results corroborate that CNNs already offer good predictive performance and, at the same time, more research on RNTNs is needed to further exploit sentence structure.
机译:为句子建模的目的是准确地表示它们在不同任务中的含义。已经提出了多种深度学习架构来对句子进行建模,但是,基于共同点,跨各种数据集以及在相同的优化级别上,它们的比较性能知之甚少。在本文中,我们对两种流行的体系结构(递归神经张量网络(RNTN)和卷积神经网络(CNN))进行了新颖的比较。尽管已显示RNTN在许多情况下都能很好地工作,但由于梯度消失的问题,它们需要大量的手动标记。为了能够对两种体系结构进行广泛的比较,本文采用了两种方法来自动标记内部节点:基于规则的方法和卷积神经网络(这是RNTN方法的一部分)。这使我们能够将这些RNTN模型与相对简单的CNN架构进行比较。在一组基准数据集上进行的实验表明,基于自动短语标记的CNN优于RNTN,而基于手动标记的RNTN则优于CNN。结果证实了CNN已经提供了良好的预测性能,同时,需要对RNTN进行更多的研究以进一步利用句子结构。

著录项

  • 来源
    《Discovery science》|2017年|144-152|共9页
  • 会议地点 Kyoto(JP)
  • 作者单位

    Institut Fur Informatik, Johannes Gutenberg-Universitat, Mainz, Germany;

    Austrian Research Institute for Artificial Intelligence, Vienna, Austria;

    Institut Fur Informatik, Johannes Gutenberg-Universitat, Mainz, Germany;

    Institut Fur Informatik, Johannes Gutenberg-Universitat, Mainz, Germany;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号