首页> 外文会议>ACM international conference on information and knowledge management >Summarizing Web Forum Threads based on a Latent Topic Propagation Process
【24h】

Summarizing Web Forum Threads based on a Latent Topic Propagation Process

机译:基于潜在主题传播过程总结Web论坛主题

获取原文

摘要

With an increasingly amount of information in web forums, quick comprehension of threads in web forums has become a challenging research problem. To handle this issue, this paper investigates the task of Web Forum Thread Summarization (WFTS), aiming to give a brief statement of each thread that involving multiple dynamic topics. When applied to the task of WFTS. traditional summarization methods are cramped by topic dependencies, topic drifting and text sparseness. Consequently, we explore an unsuper-vised topic propagation model in this paper, the Post Propagation Model (PPM), to burst through these problems by simultaneously modeling the semantics and the reply relationship existing in each thread. Each post in PPM is considered as a mixture of topics, and a product of Dirichlet distributions in previous posts is employed to model each topic dependencies during the asynchronous discussion. Based on this model, the task of WFTS is accomplished by extracting most significant sentences in a thread. The experimental results on two different forum data sets show that WFTS based on the PPM outperforms several state-of-the-art summarization methods in terms of ROUGE metrics.
机译:随着网络论坛中信息的日益增多,快速理解网络论坛中的话题已成为一个具有挑战性的研究问题。为了解决此问题,本文研究了Web论坛线程摘要(WFTS)的任务,旨在对涉及多个动态主题的每个线程进行简要说明。当应用于WFTS的任务时。传统的摘要方法受限于主题依存关系,主题漂移和文本稀疏性。因此,我们在本文中探索了一个无监督的主题传播模型,即后期传播模型(PPM),通过同时对每个线程中存在的语义和答复关系进行建模来突破这些问题。 PPM中的每个帖子都被认为是主题的混合,并且在异步讨论期间,以前的帖子中的Dirichlet分布产品被用来对每个主题的依赖性进行建模。基于此模型,通过提取线程中最重要的句子来完成WFTS的任务。在两个不同的论坛数据集上的实验结果表明,就ROUGE指标而言,基于PPM的WFTS优于几种最新的汇总方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号