Utterance Censorship of Online Reinforcement Learning Chatbot

机译：在线强化学习聊天聊天的话语审查

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Researchers have applied online deep reinforcement learning in order to enhance the open-domain conversational skills of chatbots. These chatbots have the ability to learn conversations from real users but in practical applications, some users may take advantage of the chatbot's online learning ability to generate offensive responses. In this paper, we introduce an utterance censorship system to check whether the chatbot's utterance is appropriate. If the speech is inappropriate, the censor will block it and give a negative reward to "punish" the chatbot. The censorship system is based on a character-level bidirectional LSTM model, and the chatbot receiving the reward from the censorship system "forgets" the learned offensive utterances. Experimental results show that our proposed architecture enables online learning chatbots to self-purify and that character-level LSTM is more appropriate for the utterance censorship task compared with classical word-level LSTM model.

机译：研究人员应用了在线深度加强学习，以提高聊天域的开放式对话技能。这些Chatbots有能力学习来自真实用户的对话，但在实际应用中，一些用户可能会利用Chatbot的在线学习能力来产生冒犯响应。在本文中，我们介绍了一种话语审查系统来检查Chatbot的话语是否合适。如果演讲是不合适的，审查表将阻止它并给出“惩罚”聊天栏的负面奖励。审查系统基于一个字符级别的双向LSTM模型，以及从审查系统中接收奖励的聊天“忘记”的令人攻击性话语。实验结果表明，我们提出的架构使在线学习聊天聊天，以自我净化，并且与经典单词LSTM模型相比，该字符级LSTM更适合发言权审查任务。

著录项

来源
《IEEE International Conference on Tools with Artificial Intelligence》|2018年|526p|共5页
会议地点
作者
Yixuan Chai; Guohua Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Utterance censorship; Reinforcement learning chatbot; Character-level LSTM;

机译：话语审查;加强学习聊天;字符级LSTM;

相似文献

外文文献
中文文献
专利

1. Learning bi-utterance for multi-turn response selection in retrieval-based chatbots [J] . Shuliang Wang, Dapeng Li, Jing Geng, International Journal of Advanced Robotic Systems . 2019,第2期

机译：在基于检索的聊天机器人中学习双话语以进行多回合响应选择
2. Ensemble-based deep reinforcement learning for chatbots [J] . Cuayahuitl Heriberto, Lee Donghyeon, Ryu Seonghan, Neurocomputing . 2019,第Nova13期

机译：基于集成的聊天机器人深度强化学习
3. Online Multiclass Learning with k-Way Limited Feedback and an Application to Utterance Classification [J] . HIYAN ALSHAWI Machine Learning . 2005,第1a3期

机译：具有k-Way有限反馈的在线多类学习及其在话语分类中的应用
4. Utterance Censorship of Online Reinforcement Learning Chatbot [C] . Yixuan Chai, Guohua Liu IEEE International Conference on Tools with Artificial Intelligence . 2018

机译：在线强化学习聊天聊天的话语审查
5. Data-Driven Online Network Optimization Through Reinforcement Learning [D] . Wang, Yimeng. 2021

机译：数据驱动的在线网络优化通过强化学习
6. Embodied Synaptic Plasticity With Online Reinforcement Learning [O] . Jacques Kaiser, Michael Hoff, Andreas Konle, 2019

机译：在线强化学习实现的突触可塑性
7. Learning bi-utterance for multi-turn response selection in retrieval-based chatbots [O] . Shuliang Wang, Dapeng Li, Jing Geng, 2019

机译：在基于检索的Chatbots中学习用于多转响应选择的双语

Utterance Censorship of Online Reinforcement Learning Chatbot

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅