Adaptation of Hierarchical Structured Models for Speech Act Recognition in Asynchronous Conversation

机译：异步会话中语音行为识别的层次结构模型的适应

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We address the problem of speech act recognition (SAR) in asynchronous conversations (forums, emails). Unlike synchronous conversations (e.g., meetings, phone), asynchronous domains lack large labeled datasets to train an effective SAR model. In this paper, we propose methods to effectively leverage abundant unlabeled conversational data and the available labeled data from synchronous domains. We carry out our research in three main steps. First, we introduce a neural architecture based on hierarchical LSTMs and conditional random fields (CRF) for SAR, and show that our method outperforms existing methods when trained on in-domain data only. Second, we improve our initial SAR models by semi-supervised learning in the form of pretrained word embeddings learned from a large unla-beled conversational corpus. Finally, we em-ploy adversarial training to improve the results further by leveraging the labeled data from synchronous domains and by explicitly modeling the distributional shift in two domains.

机译：我们解决异步对话（论坛，电子邮件）中的言语行为识别（SAR）问题。与同步对话（例如会议，电话）不同，异步域缺少大型标签数据集来训练有效的SAR模型。在本文中，我们提出了有效利用大量未标记的会话数据和同步域中可用的标记数据的方法。我们通过三个主要步骤进行研究。首先，我们介绍了一种基于层次LSTM和条件随机场（CRF）的SAR神经结构，并证明了仅对域内数据进行训练时，该方法的性能优于现有方法。其次，我们通过从大型无语会话语料库中学习到的预训练词嵌入的形式，通过半监督学习来改进初始SAR模型。最后，我们采用对抗训练来进一步改善结果，方法是利用同步域中的标记数据，并通过对两个域中的分布偏移进行显式建模。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2019年|1326-1336|共11页
会议地点
作者
Tasnim Mohiuddin; Thanh-Tung Nguyen; Shafiq Joty;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 14:35:07

相似文献

外文文献
中文文献
专利

1. Robust Speech Recognition Based on Structured Modeling, Irrelevant Variability Normalization and Unsupervised Online Adaptation [J] . Qiang Huo 電子情報通信学会技術研究報告 . 2008,第551期

机译：基于结构化建模，不相关变量归一化和无监督在线自适应的鲁棒语音识别
2. Modeling Speech Acts in Asynchronous Conversations: A Neural-CRF Approach [J] . Shafiq Joty, Tasnim Mohiuddin Computational linguistics . 2018,第4期

机译：异步对话中的言语行为建模：一种神经CRF方法
3. Hierarchical Bayesian combination of plug-in maximum a posteriori decoders in deep neural networks-based speech recognition and speaker adaptation [J] . Huang Zhen, Siniscalchi Sabato Marco, Lee Chin-Hui Pattern recognition letters . 2017,第octa15期

机译：基于深度神经网络的语音识别和说话人自适应的插件最大后验解码器的分层贝叶斯组合
4. Adaptation of Hierarchical Structured Models for Speech Act Recognition in Asynchronous Conversation [C] . Tasnim Mohiuddin, Thanh-Tung Nguyen, Shafiq Joty Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2019

机译：异步对话中的语音动作识别的分层结构模型的适应
5. Acoustic model and adaptation for automatic speech recognition and animal vocalization classification. [D] . Tao, Jidong. 2009

机译：自动语音识别和动物发声分类的声学模型和自适应。
6. Hierarchical temporal structure in music speech and animal vocalizations: jazz is like a conversation humpbacks sing like hermit thrushes [O] . Christopher T. Kello, Simone Dalla Bella, Butovens Médé, 2017

机译：音乐语音和动物发声中的时间层次结构：爵士乐像对话座头鲸像隐士鹅口疮一样唱歌
7. Adaptation of Hierarchical Structured Models for Speech Act Recognition in Asynchronous Conversation [O] . Tasnim Mohiuddin, Thanh-Tung Nguyen, Shafiq Joty 2019

机译：异步对话中的语音动作识别的分层结构模型的适应

Adaptation of Hierarchical Structured Models for Speech Act Recognition in Asynchronous Conversation

摘要

著录项

相似文献

相关主题

期刊订阅