首页> 外文会议>IEEE Workshop on Spoken Language Technology >Multimodal deep neural nets for detecting humor in TV sitcoms
【24h】

Multimodal deep neural nets for detecting humor in TV sitcoms

机译:用于检测电视情景喜剧幽默的多模式深度神经网络

获取原文

摘要

We propose a novel approach of combining acoustic and language features to predict humor in dialogues with a deep neural network. We analyze data from three popular TV-sitcoms whose canned laughters give an indication of when the audience would react. We model the setup-punchline sequential relation of conversational humor with a Long Short-Term Memory network, with utterance encodings obtained from two Convolutional Neural Networks, one to model word-level language features and the other to model frame-level acoustic and prosodic features. Our neural network framework is able to improve the F-score of over 5% over a Conditional Random Field baseline trained on a similar acoustic and language feature combination, achieving a much higher recall. It is also more effective over a language features-only setting, with a F-score of 10% higher. It also has a good generalization performance, reaching in most cases precision values of over 70% when trained and tested over different sitcoms.
机译:我们提出了一种结合声音和语言特征来预测与深层神经网络对话的幽默的新颖方法。我们分析了来自三个受欢迎的电视连续剧的数据,它们的罐装笑声表明了观众何时会做出反应。我们使用一个长短期记忆网络对会话幽默的建立-点线顺序关系进行建模,并使用从两个卷积神经网络获得的话语编码,一个用于建模单词级语言特征,另一个用于建模帧级声学和韵律特征。我们的神经网络框架能够在以类似的声学和语言特征组合训练的条件随机场基准上,将F分数提高5%以上,从而实现更高的召回率。与仅使用语言功能的设置相比,它的效果也更好,F分数高10%。它还具有良好的泛化性能,在不同情景喜剧中经过培训和测试后,在大多数情况下,其精度值超过70%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号