首页> 外文会议>Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies >Effective Use of Word Order for Text Categorization with Convolutional Neural Networks
【24h】

Effective Use of Word Order for Text Categorization with Convolutional Neural Networks

机译:卷积神经网络有效地利用词序进行文本分类

获取原文

摘要

Convolutional neural network (CNN) is a neural network that can make use of the internal structure of data such as the 2D structure of image data. This paper studies CNN on text categorization to exploit the 1D structure (namely, word order) of text data for accurate prediction. Instead of using low-dimensional word vectors as input as is often done, we directly apply CNN to high-dimensional text data, which leads to directly learning embedding of small text regions for use in classification. In addition to a straightforward adaptation of CNN from image to text, a simple but new variation which employs bag-of-word conversion in the convolution layer is proposed. An extension to combine multiple convolution layers is also explored for higher accuracy. The experiments demonstrate the effectiveness of our approach in comparison with state-of-the-art methods.
机译:卷积神经网络(CNN)是可以利用数据的内部结构(例如图像数据的2D结构)的神经网络。本文研究CNN的文本分类,以利用文本数据的一维结构(即单词顺序)进行准确预测。与其像通常那样将低维单词向量用作输入,不如将CNN直接应用于高维文本数据,这导致直接学习嵌入小文本区域以用于分类。除了将CNN从图像直接转换为文本之外,还提出了一种简单而新颖的变体,该变体在卷积层中采用了词袋转换。还探索了组合多个卷积层的扩展,以实现更高的准确性。实验证明了我们的方法与最新技术方法相比的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号