首页> 外文会议>Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies >Adaptation of Hierarchical Structured Models for Speech Act Recognition in Asynchronous Conversation
【24h】

Adaptation of Hierarchical Structured Models for Speech Act Recognition in Asynchronous Conversation

机译:异步会话中语音行为识别的层次结构模型的适应

获取原文

摘要

We address the problem of speech act recognition (SAR) in asynchronous conversations (forums, emails). Unlike synchronous conversations (e.g., meetings, phone), asynchronous domains lack large labeled datasets to train an effective SAR model. In this paper, we propose methods to effectively leverage abundant unlabeled conversational data and the available labeled data from synchronous domains. We carry out our research in three main steps. First, we introduce a neural architecture based on hierarchical LSTMs and conditional random fields (CRF) for SAR, and show that our method outperforms existing methods when trained on in-domain data only. Second, we improve our initial SAR models by semi-supervised learning in the form of pretrained word embeddings learned from a large unla-beled conversational corpus. Finally, we em-ploy adversarial training to improve the results further by leveraging the labeled data from synchronous domains and by explicitly modeling the distributional shift in two domains.
机译:我们解决异步对话(论坛,电子邮件)中的言语行为识别(SAR)问题。与同步对话(例如会议,电话)不同,异步域缺少大型标签数据集来训练有效的SAR模型。在本文中,我们提出了有效利用大量未标记的会话数据和同步域中可用的标记数据的方法。我们通过三个主要步骤进行研究。首先,我们介绍了一种基于层次LSTM和条件随机场(CRF)的SAR神经结构,并证明了仅对域内数据进行训练时,该方法的性能优于现有方法。其次,我们通过从大型无语会话语料库中学习到的预训练词嵌入的形式,通过半监督学习来改进初始SAR模型。最后,我们采用对抗训练来进一步改善结果,方法是利用同步域中的标记数据,并通过对两个域中的分布偏移进行显式建模。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号