首页> 外文会议>Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies >MGNC-CNN: A Simple Approach to Exploiting Multiple Word Embeddings for Sentence Classification
【24h】

MGNC-CNN: A Simple Approach to Exploiting Multiple Word Embeddings for Sentence Classification

机译:MGNC-CNN:一种用于句子分类的利用多个单词嵌入的简单方法

获取原文

摘要

We introduce a novel, simple convolution neural network (CNN) architecture - multi-group norm constraint CNN (MGNC-CNN) - that capitalizes on multiple sets of word embeddings for sentence classification. MGNC-CNN extracts features from input embedding sets independently and then joins these at the penultimate layer in the network to form a final feature vector. We then adopt a group regularization strategy that differentially penalizes weights associated with the subcomponents generated from the respective embedding sets. This model is much simpler than comparable alternative architectures and requires substantially less training time. Furthermore, it is flexible in that it does not require input word embeddings to be of the same dimensionality. We show that MGNC-CNN consistently outperforms baseline models.
机译:我们介绍了一种新颖,简单的卷积神经网络(CNN)架构-多组范式约束CNN(MGNC-CNN)-该架构利用多组单词嵌入进行句子分类。 MGNC-CNN独立地从输入嵌入集中提取特征,然后在网络的倒数第二层将它们结合起来,以形成最终的特征向量。然后,我们采用组正则化策略,该策略对与从各个嵌入集生成的子组件相关联的权重进行差分惩罚。该模型比类似的替代体系结构简单得多,并且所需的培训时间大大减少。此外,它的灵活性在于它不需要输入单词嵌入具有相同的维数。我们显示MGNC-CNN始终优于基线模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号