SITS: A Hierarchical Nonparametric Model using Speaker Identity for Topic Segmentation in Multiparty Conversations

机译：SITS：使用发言人身份进行多方对话中的主题细分的分层非参数模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

One of the key tasks for analyzing conversational data is segmenting it into coherent topic segments. However, most models of topic segmentation ignore the social aspect of conversations, focusing only on the words used. We introduce a hierarchical Bayesian nonparametric model, Speaker Identity for Topic Segmentation (SITS), that discovers (1) the topics used in a conversation, (2) how these topics are shared across conversations, (3) when these topics shift, and (4) a person-specific tendency to introduce new topics. We evaluate against current unsupervised segmentation models to show that including person-specific information improves segmentation performance on meeting corpora and on political debates. Moreover, we provide evidence that SITS captures an individual's tendency to introduce new topics in political contexts, via analysis of the 2008 US presidential debates and the television program Crossfire.

机译：分析会话数据的关键任务之一是将其细分为相关的主题段。但是，大多数主题细分模型都忽略了会话的社交方面，仅关注所使用的单词。我们介绍了一个分层的贝叶斯非参数模型，即主题细分的说话者身份（SITS），它发现（1）对话中使用的主题，（2）如何在对话中共享这些主题，（3）这些主题何时转移，以及（ 4）引入新话题的个人倾向。我们根据当前的无监督细分模型进行评估，结果表明，包含特定于人的信息可以提高在满足语料库和政治辩论方面的细分效果。此外，通过对2008年美国总统辩论和电视节目Crossfire的分析，我们提供的证据表明SITS抓住了个人在政治背景下引入新话题的趋势。

著录项

来源
《Annual meeting of the Association for Computational Linguistics;ACL 2012》|2012年|p.78-87|共10页
会议地点
作者
Viet-An Nguyen; Jordan Boyd-Graber; Philip Resnik;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Modeling topic control to detect influence in conversations using nonparametric topic models [J] . Viet-An Nguyen, Jordan Boyd-Graber, Philip Resnik, Machine Learning . 2014,第3期

机译：使用非参数主题模型对主题控件进行建模以检测对话中的影响
2. Dictation of multiparty conversation using MLLR speaker adaptation and statistical turn taking model [J] . Noriyuki Murai, Tetsunori Kobayashi 電子情報通信学会技術研究報告. 音声. Speech . 2000,第136期

机译：使用MLLR说话者自适应和统计转向模型对多方对话进行听写
3. Dictation of multiparty conversation using MLLR speaker adaptation and statistical turn taking model [J] . Noriyuki Murai, Tetsunori Kobayashi 電子情報通信学会技術研究報告. 音声. Speech . 2000,第136期

机译：使用MLLR扬声器适应和统计转向模型的多党对话的听写
4. SITS: A Hierarchical Nonparametric Model using Speaker Identity for Topic Segmentation in Multiparty Conversations [C] . Viet-An Nguyen, Jordan Boyd-Graber, Philip Resnik Annual meeting of the Association for Computational Linguistics . 2012

机译：坐下：使用扬声器标识的分层非参数模型，用于多党对话中的主题分段
5. Nonparametric modeling for speaker recognition. [D] . Iyer, Ananth N. 2007

机译：用于说话人识别的非参数建模。
6. A hierarchical method based on active shape models and directed Hough transform for segmentation of noisy biomedical images; application in segmentation of pelvic X-ray images [O] . Rebecca Smith, Kayvan Najarian, Kevin Ward 2009

机译：一种基于主动形状模型和定向霍夫变换的分层方法用于分割嘈杂的生物医学图像；在骨盆X线图像分割中的应用
7. Using Participant Role in Multiparty Meetings as Prior Knowledge for Nonparametric Topic Modeling [O] . Renals Steve, Huang Songfang 2008

机译：使用多方会议中的参与者角色作为非参数主题建模的先验知识
8. Speaker Adaptation of Language Models for Automatic Dialog Act Segmentation of Meetings [R] . Kolar, J. , Liu, Y. , Shriberg, E. 2007

机译：会议自动对话行为分割的语言模型演讲者自适应

SITS: A Hierarchical Nonparametric Model using Speaker Identity for Topic Segmentation in Multiparty Conversations

摘要

著录项

相似文献

相关主题

期刊订阅