首页> 外文会议>AAAI Workshop >Dialog Act Modeling for Conversational Speech
【24h】

Dialog Act Modeling for Conversational Speech

机译:对话动作建模对话演讲

获取原文
获取外文期刊封面目录资料

摘要

We describe an integrated approach for statistical modeling of discourse structure for natural conversational speech. Our model is based on 42 'dialog acts' (e.g., Statement, Question, Backchannel, Agreement, Disagreement, Apology), which were hand-labeled in 1155 conversations from the Switchboard corpus of spontaneous human-to-human telephone speech. We developed several models and algorithms to automatically detect dialog acts from transcribed or automatically recognized words and from prosodic properties of the speech signal, and by using a statistical discourse grammar. All of these components were probabilistic in nature and estimated from data, employing a variety of techniques (hidden Markov models, N-gram language models, maximum entropy estimation, decision tree classifiers, and neural networks). In preliminary studies, we achieved a dialog act labeling accuracy of 65% based on recognized words and prosody, and an accuracy of 72% based on word transcripts. Since humans achieve 84% on this task (with chance performance at 35%) we find these results encouraging.
机译:我们描述了自然对话语音话语结构的统计建模的综合方法。我们的模型是基于42“对话框作用”(例如,语句,问题,反向信道协议,分歧,道歉),这是手标记在从自发人对人的电话语音的总机语料库1155个对话。我们开发了几个模型和算法,从转录自动检测对话框行为或自动识别单词和从语音信号的韵律性,并采用统计话语语法。所有这些组件在本质上是概率和数据估算,采用多种技术(隐马尔可夫模型,N-gram语言模型,最大熵估计,决策树分类,以及神经网络)。在初步研究中,我们实现了65%的对话行为贴标精度基于公认的文字和韵律,和72%的基础上字成绩单的精度。由于人类对这项任务达到84%(含35%的几率性能),我们发现这些结果令人鼓舞。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号