首页> 外文期刊>Neurocomputing >Search engine reinforced semi-supervised classification and graph-based summarization of microblogs
【24h】

Search engine reinforced semi-supervised classification and graph-based summarization of microblogs

机译:搜索引擎增强的半监督分类和基于图的微博摘要

获取原文
获取原文并翻译 | 示例
       

摘要

There is an abundance of information found on microblog services due to their popularity. However the potential of this trove of information is limited by the lack of effective means for users to browse and interpret the numerous messages found on these services. We tackle this problem using a two-step process, first by slicing up the search results of current retrieval systems along multiple possible genres. Then, a summary is generated from the microblog messages attributed to each genre. We believe that this helps users to better understand the possible interpretations of the retrieved results and aid them in finding the information that they need. Our novel approach makes use of automatically acquired information from external search engines in each of these two steps. We first integrate this information with a semi-supervised probabilistic graphical model, and show that this helps us to achieve significantly better classification performance without the need for much training data. Next we incorporate the extra information into graph-based summarization, and demonstrate that superior summaries (up to 30% improvement in ROUGE-1) are obtained.
机译:由于微博服务的普及,因此存在大量信息。但是,由于缺乏有效的方法来使用户浏览和解释在这些服务上找到的大量消息,因此大量信息的潜力受到了限制。我们通过两步过程来解决这个问题,首先是按照多种可能的类型来划分当前检索系统的搜索结果。然后,从归因于每种流派的微博消息中生成摘要。我们认为,这有助于用户更好地理解检索结果的可能解释,并帮助他们找到所需的信息。我们的新颖方法在这两个步骤的每一个步骤中都利用了从外部搜索引擎自动获取的信息。我们首先将此信息与半监督概率图形模型集成在一起,并表明这有助于我们在无需大量训练数据的情况下实现明显更好的分类性能。接下来,我们将额外的信息合并到基于图的摘要中,并证明获得了出色的摘要(ROUGE-1的改进高达30%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号