首页> 外文会议>ACM international conference on information and knowledge management >Summarizing Web Forum Threads based on a Latent Topic Propagation Process
【24h】

Summarizing Web Forum Threads based on a Latent Topic Propagation Process

机译:总结基于潜在主题传播过程的Web论坛线程

获取原文

摘要

With an increasingly amount of information in web forums, quick comprehension of threads in web forums has become a challenging research problem. To handle this issue, this paper investigates the task of Web Forum Thread Summarization (WFTS), aiming to give a brief statement of each thread that involving multiple dynamic topics. When applied to the task of WFTS. traditional summarization methods are cramped by topic dependencies, topic drifting and text sparseness. Consequently, we explore an unsuper-vised topic propagation model in this paper, the Post Propagation Model (PPM), to burst through these problems by simultaneously modeling the semantics and the reply relationship existing in each thread. Each post in PPM is considered as a mixture of topics, and a product of Dirichlet distributions in previous posts is employed to model each topic dependencies during the asynchronous discussion. Based on this model, the task of WFTS is accomplished by extracting most significant sentences in a thread. The experimental results on two different forum data sets show that WFTS based on the PPM outperforms several state-of-the-art summarization methods in terms of ROUGE metrics.
机译:在Web论坛中越来越多的信息,Web论坛中的线程的快速理解已经成为一个具有挑战性的研究问题。为了处理这个问题,本文调查了Web论坛线程摘要(WFT)的任务,旨在介绍涉及多个动态主题的每个线程的简要说明。应用于WFT的任务时。传统的摘要方法是由主题依赖性,主题漂移和文本稀疏的痉挛。因此,我们在本文中探讨了一个令人难过的主题传播模型,通过同时建模了每个线程中存在的语义和应答关系来突发传播模型(PPM),突发通过这些问题。 PPM中的每个帖子被视为主题的混合,并且在异步讨论期间,使用先前帖子中的Dirichlet分布的产品在异步讨论期间模拟每个主题依赖性。基于该模型,通过提取线程中的大多数重要句子来完成WFT的任务。两种不同论坛数据集的实验结果表明,基于PPM的WFT在胭脂指标方面优于几种最先进的摘要方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号