首页> 外文期刊>International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms >Exploiting User-Generated Content to Enrich Web Document Summarization
【24h】

Exploiting User-Generated Content to Enrich Web Document Summarization

机译:利用用户生成的内容以丰富Web文档摘要

获取原文
获取原文并翻译 | 示例
           

摘要

User-generated content such as comments or tweets (also called by social information) following a Web document provides additional information for enriching the content of an event mentioned in sentences. This paper presents a framework named SoSVMRank, which integrates the user-generated content of a Web document to generate a highquality summarization. In order to do that, the summarization was formulated as a learning to rank task, in which comments or tweets are exploited to support sentences in a mutual reinforcement fashion. To model sentence-comment (or tweet) relation, a set of local and social features are proposed. After ranking, top m ranked sentences and comments (or tweets) are selected as the summarization. To validate the efficiency of our framework, sentence and story highlight extraction tasks were taken as a case study on three datasets in two languages, English and Vietnamese. Experimental results indicate that: (i) our new features improve the summary performance of the framework in term of ROUGE-scores compared to state-of-the-art baselines and (ii) the integration of user-generated content benefits single-document summarization.
机译:用户生成的内容(如评论或推文)之类的内容在Web文档之后提供了用于丰富句子中提到的事件内容的其他信息。本文介绍了一个名为SOSVMRANK的框架,该框架集成了Web文档的用户生成内容以生成高度摘要。为此,将摘要制定为学习对秩任务的学习,其中利用评论或推文以支持相互加强方式的句子。为了模拟句子评论(或推文)关系,提出了一组本地和社会功能。排名后,选择Top M排名句子和注释(或推文)作为摘要。为了验证我们框架的效率,句子和故事突出提取任务是在两种语言,英语和越南语中进行三个数据集的案例研究。实验结果表明:(i)与最先进的基线和(ii)相比,我们的新功能改善了胭脂分数的框架的概要表现,而(ii)用户生成的内容的集成有益于单一文件摘要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号