首页> 外文期刊>World Wide Web >MARES: multitask learning algorithm for Web-scale real-time event summarization
【24h】

MARES: multitask learning algorithm for Web-scale real-time event summarization

机译:MARES:用于Web规模的实时事件摘要的多任务学习算法

获取原文
获取原文并翻译 | 示例
           

摘要

Automatic real-time summarization of massive document streams on the Web has become an important tool for quickly transforming theoverwhelming documents into a novel, comprehensive and concise overview of an event for users. Significant progresses have been made in static text summarization. However, most previous work does not consider the temporal features of the document streams which are valuable in real-time event summarization. In this paper, we propose a novel M ultitask learning A lgorithm for Web-scale R eal-time E vent S ummarization (MARES), which leverages the benefits of supervised deep neural networks as well as a reinforcement learning algorithm to strengthen the representation learning of documents. Specifically, MARES consists two key components: (i) A relevance prediction classifier, in which a hierarchical LSTM model is used to learn the representations of queries and documents; (ii) A document filtering model learns to maximize the long-term rewards with reinforcement learning algorithm, working on a shared document encoding layer with the relevance prediction component. To verify the effectiveness of the proposed model, extensive experiments are conducted on two real-life document stream datasets: TREC Real-Time Summarization Track data and TREC Temporal Summarization Track data. The experimental results demonstrate that our model can achieve significantly better results than the state-of-the-art baseline methods.
机译:Web上海量文档流的自动实时汇总已成为一种重要工具,可以将大量文档快速转换为用户事件的新颖,全面而简洁的概述。静态文本摘要已取得重大进展。但是,大多数以前的工作没有考虑文档流的时间特征,这些特征对于实时事件摘要很有用。在本文中,我们提出了一种新颖的多任务学习算法,用于Web规模实时学习出口总结(MARES),它利用了监督深度神经网络以及增强学习算法的优势来增强表示学习文件。具体来说,MARES包含两个关键组件:(i)关联性预测分类器,其中使用分层LSTM模型来学习查询和文档的表示形式; (ii)文档过滤模型通过使用增强学习算法来学习最大化长期奖励,该算法在具有相关性预测组件的共享文档编码层上工作。为了验证所提出模型的有效性,对两个真实文档流数据集进行了广泛的实验:TREC实时摘要跟踪数据和TREC时间摘要跟踪数据。实验结果表明,与最新的基线方法相比,我们的模型可以获得更好的结果。

著录项

  • 来源
    《World Wide Web》 |2019年第2期|499-515|共17页
  • 作者单位

    Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China;

    Shanghai Univ Finance & Econ, Dept Comp Sci, Shanghai, Peoples R China;

    Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China;

    Peking Univ, Sch Elect & Comp Engn, Shenzhen, Peoples R China;

    Shenzhen Univ, Sch Comp Sci, Shenzhen, Peoples R China;

    South China Normal Univ, Sch Comp Sci, Guangzhou, Guangdong, Peoples R China;

    Peking Univ, Shenzhen Grad Sch, Sch Elect & Comp Engn, Shenzhen, Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Multitask learning; Real-time event summarization; Relevance prediction; Document filtering;

    机译:多任务学习;实时事件汇总;相关性预测;文档过滤;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号