...
首页> 外文期刊>Data mining and knowledge discovery >Micro-review synthesis for multi-entity summarization
【24h】

Micro-review synthesis for multi-entity summarization

机译:多实体摘要的微观审查合成

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Location-based social networks (LBSNs), exemplified by Foursquare, are fast gaining popularity. One important feature of LBSNs is micro-review. Upon check-in at a particular venue, a user may leave a short review (up to 200 characters long), also known as a tip. These tips are an important source of information for others to know more about various aspects of an entity (e.g., restaurant), such as food, waiting time, or service. However, a user is often interested not in one particular entity, but rather in several entities collectively, for instance within a neighborhood or a category. In this paper, we address the problem of summarizing the tips of multiple entities in a collection, by way of synthesizing new micro-reviews that pertain to the collection, rather than to the individual entities per se. We formulate this problem in terms of first finding a representation of the collection, by identifying a number of "aspects" that link common threads across two or more entities within the collection. We express these aspects as dense subgraphs in a graph of sentences derived from the multi-entity corpora. This leads to a formulation of maximal multi-entity quasi-cliques, as well as a heuristic algorithm to find K such quasi-cliques maximizing the coverage over the multi-entity corpora. To synthesize a summary tip for each aspect, we select a small number of sentences from the corresponding quasi-clique, balancing conciseness and representativeness in terms of a facility location problem. Our approach performs well on collections of Foursquare entities based on localities and categories, producing more representative and diverse summaries than the baselines.
机译:基于位置的社交网络(LBSNS),由Foursquare示例,是快速的受欢迎程度。 LBSN的一个重要特征是微观审查。在特定场地登记时,用户可以留下短暂的评论(最多200个字符),也称为尖端。这些提示是其他人更多地了解有关实体(例如,餐厅)的各个方面的重要信息来源,例如食物,等待时间或服务。然而,用户通常不在一个特定实体中感兴趣,而是共同地在邻居或类别内集体中的若干实体。在本文中,我们通过综合与集合的新微型评论来解决集合中多个实体提示的问题,而不是本身的单个实体。我们通过识别在集合中的两个或更多个实体中链接通用线程的数量的“方面”,在首先找到集合的表示来制定此问题。我们将这些方面表达为从多实体语料库派生的句子图中的密集子图。这导致了最大多实体准批变的制定,以及一种启发式算法,用于找到k这样的准批变,从而最大化多实体语料库的覆盖范围。为了为每个方面综合摘要提示,我们从相应的准集团中选择少数句子,在设施位置问题方面从相应的准分支机构,平衡简洁和代表性。我们的方法在基于地方和类别的Foursquare实体的集合中表现出色,生产比基线更具代表性和多样化的摘要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号