首页> 外文会议>IEEE Annual Computer Software and Applications Conference >DLCEncDec: A Fully Character-level Encoder-Decoder Model for Neural Responding Conversation
【24h】

DLCEncDec: A Fully Character-level Encoder-Decoder Model for Neural Responding Conversation

机译:dlcencdec:神经响应对话的完全字符级编码器 - 解码器模型

获取原文

摘要

Recent years have witnessed a surge of interest in building conversation systems such as smart agents or chatbots. The most existing generation-based neural responding conversation systems are implemented by RNN Encoder-Decoder framework relying on word-level modeling with explicit segmentation. A word-level model typically maintains a fixed vocabulary, which correspondingly encounters the unknown words and segmentation issues. In this paper, we proposed a fully character-level Encoder-Decoder model DLCEncDec without explicit segmentation for neural responding conversation. DLCEncDec utilizes both of fine-grained character embedding features and coarse-grained n-gram features. Coarse-grained n-gram features are captured by constructing a convolutional layer and a four-layer highway network on the top of the character embeddings. The appearance of out-of-vocabulary words (i.e. unknown words) can be addressed due to the fully character-level operating. We evaluate the DLCEncDec on a Chinese corpus consisting of 4.44 million message-response pairs from Sina Weibo. Experimental results show that our fully character-level model DLCEncDec significantly outperforms baseline models in terms of BLEU and ROUGE.
机译:近年来目睹了在智能代理或聊天诸如建造谈话系统的兴趣激增。最现有的基于生成的神经响应对话系统由RNN编码器 - 解码器框架实现,依赖于具有显式分割的字级建模。单词级模型通常维护固定的词汇表,这相应地遇到未知的单词和分段问题。在本文中,我们提出了一个完全字符级编码器解码器模型DLCENCDEC,而没有用于神经响应对话的显式分段。 DLCENCDEC利用细粒度嵌入功能和粗粒粒度的N-GRAM功能。通过在角色嵌入的顶部构造卷积层和四层公路网络来捕获粗粒粒度的N-GRAM特征。由于完全字符级操作,可以解决词汇外单词(即未知单词)的外观。我们评估了来自新浪微博的444万条消息回复对组成的中文语料库的DLCENCDEC。实验结果表明,我们完全性格级模型DLCENCDEC在Bleu和Rouge方面显着优于基线模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号