首页> 外文会议>Pacific-Asia conference on knowledge discovery and data mining >Semantic Title Evaluation and Recommendation Based on Topic Models
【24h】

Semantic Title Evaluation and Recommendation Based on Topic Models

机译:基于主题模型的语义标题评价与推荐

获取原文

摘要

To digest tremendous documents efficiently, people often resort to their titles, which normally provide a concise and semantic representation of main text. Some titles however are misleading due to lexical ambiguity or eye-catching intention. The requirement of reference summaries hampers using traditional lexical summarisation evaluation techniques for title evaluation. In this paper we develop semantic title evaluation techniques by comparing a title with other sentences in terms of topic-based similarity with regard to the whole document. We further give a statistical hypothesis test to check whether a title is favourable without any reference summary. As a byproduct, the top similar sentence can be recommended as a candidate for title. Experiments on patents, scientific papers and DUC'04 benchmarks show our Semantic Title Evaluation and Recommendation technique based on a recent Segmented Topic Model (STERSTM), performs substantially better than that based on the canonical model Latent Dirichlet Allocation (STERLDA). It can also recommend titles with quality comparable with the winners of DUC'04 in terms of summarising documents into very short summaries.
机译:为了有效地消化大量文档,人们经常求助于其标题,该标题通常提供主要文本的简洁和语义表示。但是,某些标题由于词汇上的歧义或引人注目的意图而产生误导。参考摘要的要求妨碍了使用传统词汇摘要评估技术进行标题评估。在本文中,我们通过根据整个主题的基于主题的相似性比较标题和其他句子来开发语义标题评估技术。我们进一步给出了统计假设检验,以检查标题是否有利,而没有任何参考摘要。作为副产品,可以推荐最相似的最高句子作为标题的候选者。在专利,科学论文和DUC'04基准上进行的实验表明,我们的语义标题评估和推荐技术基于最近的分段主题模型(STERSTM),其性能明显优于基于规范模型的潜在狄利克雷分配(STERLDA)。在将文档汇总成非常简短的摘要方面,它也可以推荐质量与DUC'04获奖者相当的书名。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号