Cascaded LSTMs Based Deep Reinforcement Learning for Goal-Driven Dialogue

机译：基于级联LSTM的目标学习对话的深度强化学习

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper proposes a deep neural network model for jointly modeling Natural Language Understanding and Dialogue Management in goal-driven dialogue systems. There are three parts in this model. A Long Short-Term Memory (LSTM) at the bottom of the network encodes utterances in each dialogue turn into a turn embedding. Dialogue embeddings are learned by a LSTM at the middle of the network, and updated by the feeding of all turn embeddings. The top part is a forward Deep Neural Network which converts dialogue embeddings into the Q-values of different dialogue actions. The cascaded LSTMs based reinforcement learning network is jointly optimized by making use of the rewards received at each dialogue turn as the only supervision information. There is no explicit NLU and dialogue states in the network. Experimental results show that our model outperforms both traditional Markov Decision Process (MDP) model and single LSTM with Deep Q-Network on meeting room booking tasks. Visualization of dialogue embeddings illustrates that the model can learn the representation of dialogue states.

机译：本文提出了一种深度神经网络模型，用于在目标驱动的对话系统中联合建模自然语言理解和对话管理。此模型分为三个部分。网络底部的长短期记忆（LSTM）将每个对话转弯中的语音编码为转弯嵌入。对话嵌入是由LSTM在网络中间学习的，并通过提供所有回合嵌入来更新。顶部是前向深度神经网络，它将对话嵌入转换为不同对话动作的Q值。基于级联的LSTM的强化学习网络是通过将每次对话时收到的奖励用作唯一的监督信息来进行联合优化的。网络中没有明确的NLU和对话状态。实验结果表明，在会议室预订任务上，我们的模型优于传统的马尔可夫决策过程（MDP）模型和具有Deep Q网络的单个LSTM。对话嵌入的可视化说明该模型可以学习对话状态的表示。

著录项

来源
《Natural language understanding and intelligent applications》|2017年|29-41|共13页
会议地点 Dalian(CN)
作者
Yue Ma; Xiaojie Wang; Zhenjiang Dong; Hong Chen;
展开▼
作者单位

Beijing University of Posts and Telecommunications, Beijing, China;

Beijing University of Posts and Telecommunications, Beijing, China;

ZTE Corporation, Nanjing, China;

ZTE Corporation, Nanjing, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Cascaded LSTMs; Deep reinforcement learning Goal-driven dialogue;

机译：级联的LSTM；深度强化学习目标驱动的对话;

相似文献

外文文献
中文文献
专利

1. Multiobject Tracking in Videos Based on LSTM and Deep Reinforcement Learning [J] . Jiang Ming-xin, Deng Chao, Pan Zhi-geng, Complexity . 2018,第1期

机译：基于LSTM和深度强化学习的视频多目标跟踪
2. Multiobject Tracking in Videos Based on LSTM and Deep Reinforcement Learning [J] . Ming-xin Jiang, Chao Deng, Zhi-geng Pan, Complexity . 2018,第Pta15期

机译：基于LSTM和深度加强学习的视频多机制跟踪
3. Deep reinforcement learning for map-less goal-driven robot navigation [J] . Matej Dobrevski, Danijel Sko?aj International Journal of Advanced Robotic Systems . 2021,第1期

机译：较低地图的目标驱动机器人导航深度加固学习
4. Cascaded LSTMs Based Deep Reinforcement Learning for Goal-Driven Dialogue [C] . Yue Ma, Xiaojie Wang, Zhenjiang Dong, International Conference on Natural Language Processing and Chinese Computing . 2017

机译：基于级联的LSTMS基于LSTMS的目标驱动对话
5. Deep Learning-Based Hosting Capacity Analysis in LV Distribution Grids with Spatial-Temporal LSTMs [D] . Wu, Jiaqi. 2021

机译：LV分布网的基于深度学习的托管能力分析，具有空间时间LSTMS
6. LSTMCNNsucc: A Bidirectional LSTM and CNN-Based Deep Learning Method for Predicting Lysine Succinylation Sites [O] . Guohua Huang, Qingfeng Shen, Guiyang Zhang, 2021

机译：LSTMCNNSUCC：一种预测赖氨酸琥珀酸位点的双向LSTM和基于CNN的深度学习方法
7. Cascaded LSTMs Based Deep Reinforcement Learning for Goal-Driven Dialogue [O] . Yue Ma, Xiaojie Wang, Zhenjiang Dong, 2018

机译：基于级联的LSTMS基于LSTMS的目标驱动对话

Cascaded LSTMs Based Deep Reinforcement Learning for Goal-Driven Dialogue

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅