首页> 外文会议>International conference on agents artificial intelligence >Text Classification and Transfer Learning Based on Character-Level Deep Convolutional Neural Networks
【24h】

Text Classification and Transfer Learning Based on Character-Level Deep Convolutional Neural Networks

机译:基于字符级深度卷积神经网络的文本分类与迁移学习

获取原文

摘要

Temporal (one-dimensional) Convolutional Neural Network (Temporal CNN, ConvNet) is an emergent technology for text understanding. The input for the ConvNets could be either a sequence of words or a sequence of characters. In the latter case there are no needs for natural language processing. Past studies showed that the character-level ConvNets worked well for text classification in English and romanized Chinese corpus. In this article we apply the character-level ConvNets to Japanese corpus. We confirmed that meaningful representations are extracted by the ConvNets in English corpus and Japanese corpus. We attempt to reuse the meaningful representations that are learned in the ConvNets from a large-scale dataset in the form of transfer learning. As for the application to the news categorization and the sentiment analysis tasks in Japanese corpus, the ConvNets outperformed N-gram-based classifiers. In addition, our ConvNets transfer learning frameworks worked well for a task which is similar to one used for pre-training.
机译:时间(一维)卷积神经网络(Temporal CNN,ConvNet)是一种用于文本理解的新兴技术。 ConvNets的输入可以是单词序列或字符序列。在后一种情况下,不需要自然语言处理。过去的研究表明,字符级的ConvNets可以很好地用于英语和罗马化中文语料库的文本分类。在本文中,我们将字符级的ConvNets应用于日语语料库。我们确认,ConvNets提取了英语语料库和日语语料库中有意义的表示形式。我们尝试以转移学习的形式重用在ConvNets中从大规模数据集中学习的有意义的表示形式。至于在日语语料库中的新闻分类和情感分析任务中的应用,ConvNets优于基于N-gram的分类器。此外,我们的ConvNets转移学习框架可以很好地完成一项任务,该任务类似于用于预培训的任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号