首页> 外文会议>IEEE International Conference on Tools with Artificial Intelligence >Utterance Censorship of Online Reinforcement Learning Chatbot
【24h】

Utterance Censorship of Online Reinforcement Learning Chatbot

机译:在线强化学习聊天聊天的话语审查

获取原文
获取外文期刊封面目录资料

摘要

Researchers have applied online deep reinforcement learning in order to enhance the open-domain conversational skills of chatbots. These chatbots have the ability to learn conversations from real users but in practical applications, some users may take advantage of the chatbot's online learning ability to generate offensive responses. In this paper, we introduce an utterance censorship system to check whether the chatbot's utterance is appropriate. If the speech is inappropriate, the censor will block it and give a negative reward to "punish" the chatbot. The censorship system is based on a character-level bidirectional LSTM model, and the chatbot receiving the reward from the censorship system "forgets" the learned offensive utterances. Experimental results show that our proposed architecture enables online learning chatbots to self-purify and that character-level LSTM is more appropriate for the utterance censorship task compared with classical word-level LSTM model.
机译:研究人员应用了在线深度加强学习,以提高聊天域的开放式对话技能。这些Chatbots有能力学习来自真实用户的对话,但在实际应用中,一些用户可能会利用Chatbot的在线学习能力来产生冒犯响应。在本文中,我们介绍了一种话语审查系统来检查Chatbot的话语是否合适。如果演讲是不合适的,审查表将阻止它并给出“惩罚”聊天栏的负面奖励。审查系统基于一个字符级别的双向LSTM模型,以及从审查系统中接收奖励的聊天“忘记”的令人攻击性话语。实验结果表明,我们提出的架构使在线学习聊天聊天,以自我净化,并且与经典单词LSTM模型相比,该字符级LSTM更适合发言权审查任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号