Summarizing a Document Stream

机译：总结文档流

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce the task of summarizing a stream of short documents on microblogs such as Twitter. On microblogs, thousands of short documents on a certain topic such as sports matches or TV dramas are posted by users. Noticeable characteristics of microblog data are that documents are often very highly redundant and aligned on timeline. There can be thousands of documents on one event in the topic. Two very similar documents will refer to two distinct events when the documents are temporally distant. We examine the microblog data to gain more understanding of those characteristics, and propose a summarization model for a stream of short documents on timeline, along with an approximate fast algorithm for generating summary. We empirically show that our model generates a good summary on the datasets of microblog documents on sports matches.

机译：我们介绍了总结微博（例如Twitter）上的简短文档流的任务。在微博上，用户发布了关于特定主题（例如体育比赛或电视剧）的数千个简短文档。微博数据的显着特征是文档通常是非常冗余的，并且在时间轴上对齐。一个主题中的一个事件可能有成千上万的文档。当两个文件在时间上遥远时，两个非常相似的文件将引用两个不同的事件。我们研究了微博数据，以更深入地了解这些特征，并提出了时间线上短文档流的汇总模型，以及用于生成摘要的近似快速算法。我们凭经验表明，我们的模型对体育比赛中微博文档的数据集产生了很好的总结。

著录项

来源
《Advances in information retrieval》|2011年|p.177-188|共12页
会议地点 Dublin(IE);Dublin(IE)
作者
Hiroya Takamura; Hikaru Yokono; Manabu Okumura;
展开▼
作者单位

Precision and Intelligence Laboratory, Tokyo Institute of Technology;

Precision and Intelligence Laboratory, Tokyo Institute of Technology;

Precision and Intelligence Laboratory, Tokyo Institute of Technology;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;
关键词
入库时间 2022-08-26 13:47:05

相似文献

外文文献
中文文献
专利

1. Encoded summarization: summarizing documents into continuous vector space for legal case retrieval [J] . Vu Tran, Minh Le Nguyen, Satoshi Tojo, Artificial Intelligence and Law . 2020,第4期

机译：编码摘要：将文档归纳为法律案例检索的连续矢量空间
2. Single document summarization using the information from documents with the same topic [J] . Mao Xiangke, Huang Shaobin, Shen Linshan, Knowledge-Based Systems . 2021,第Sepa27期

机译：单一文件摘要使用来自具有相同主题的文档的信息
3. Multi document summarization based on news components using fuzzy cross-document relations [J] . Yogan Jaya Kumar, Naomie Salim, Albaraa Abuobieda, Applied Soft Computing . 2014,第Null期

机译：使用模糊的跨文档关系基于新闻组件的多文档摘要
4. Topic and Subject Detection in News Streams for Multi-document Summarization [C] . Fumiyo Fukumoto, Yoshimi Suzuki, Atsuhiro Takasu International Conference on Knowledge Discovery and Information Retrieval . 2012

机译：多文件摘要的新闻流中的主题和主题检测
5. Multi-document Summarization Based on Document Clustering and Neural Sentence Fusion [D] . Fuad, Tanvir Ahmed. 2018

机译：基于文档聚类和神经句子融合的多文件摘要
6. Extractive single document summarization using binary differential evolution: Optimization of different sentence quality measures [O] . Naveen Saini, Sriparna Saha, Dhiraj Chakraborty, 2019

机译：采用二元差分演进的提取单一文件摘要：不同句子质量措施的优化
7. Temporal Summarization of Time Critical Events - A system for summarizing events over time in a continuous stream of documents [O] . Eidheim Håvard Lund 2015

机译：时间关键事件的时间汇总-用于在连续文档流中随时间推移汇总事件的系统
8. Automatic Summarization with Sloth (Summarizes Lengthy Documents and Outputs The Highlights) [R] . Kaplin, D. B. 2002

机译：树懒自动摘要（总结冗长的文档和输出亮点）

Summarizing a Document Stream

摘要

著录项

相似文献

相关主题

期刊订阅