Sentence Scoring in Multi-document Summarizing under Topic Model LDA

Haimin Shao; Jim Ma

首页> 外文期刊>Journal of information and computational science >Sentence Scoring in Multi-document Summarizing under Topic Model LDA

【24h】

Sentence Scoring in Multi-document Summarizing under Topic Model LDA

机译：主题模型LDA下多文档摘要中的句子评分

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper automatic multi-document summarizing in a greedy framework is studied, where sentences are selected based on their contribution for the theme construction of the summary. The scores of sentences are evaluated based on their topic representations obtained from LDA (Latent Dirichlet Allocation), which is a probabilistic topic model. Consistent probabilistic representations of the relations between texts and topics are first proposed, and then two scoring methods are developed based on these representations. In addition the sentence length as an important factor in document summarizing is also studied. Experimental results show the pertinence of these probabilities, and the effectiveness of our scoring methods.

机译：本文研究贪婪框架中的自动多文档摘要，其中基于句子的贡献选择句子，以实现摘要的主题构建。基于从概率主题模型LDA（潜在Dirichlet分配）获得的主题表示，评估句子的分数。首先提出了文本和主题之间关系的一致概率表示，然后基于这些表示开发了两种评分方法。另外，还研究了句子长度作为文档摘要中的重要因素。实验结果表明了这些概率的相关性，以及我们的评分方法的有效性。

著录项

来源
《Journal of information and computational science》 |2010年第1期|P.285-291|共7页
作者
Haimin Shao; Jim Ma;
展开▼
作者单位

School of Computer Science and Technology, Shandong University, Jinan 250101, China;

rnSchool of Computer Science and Technology, Shandong University, Jinan 250101, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
multi-document summarizing; sentence scor ing; latent dirichlet allocation;

机译：多文档摘要;句子评分潜在狄利克雷分配;

相似文献

外文文献
中文文献
专利

1. Multi-Document Summarization Using K-Means and Latent Dirichlet Allocation (LDA) – Significance Sentences [J] . Shiva Twinandilla, Satriyo Adhy, Bayu Surarso, Procedia Computer Science . 2018,第22期

机译：使用K均值和潜在Dirichlet分配（LDA）的多文档摘要-重要语句
2. Assessing shallow sentence scoring techniques and combinations for single and multi-document summarization [J] . Oliveira Hilario, Ferreira Rafael, Lima Rinaldo, Expert Systems with Application . 2016,第deca期

机译：评估浅句子评分技术及其对单文档和多文档摘要的组合
3. Heterogeneous-Length Text Topic Modeling for Reader-Aware Multi-Document Summarization [J] . Qiang Jipeng, Chen Ping, Ding Wei, ACM transactions on knowledge discovery from data . 2019,第4期

机译：读者感知多文件概述的异构长度文本模型
4. LDA-Based Topic Formation and Topic-Sentence Reinforcement for Graph-Based Multi-document Summarization [C] . Dehong Gao, Wenjie Li, You Ouyang, Asia Information Retrieval Societies Conference . 2012

机译：基于LDA的主题形成和主题句子加固基于图形的多文件摘要
5. Multi-document Summarization Based on Document Clustering and Neural Sentence Fusion [D] . Fuad, Tanvir Ahmed. 2018

机译：基于文档聚类和神经句子融合的多文件摘要
6. Ms2lda.org: web-based topic modelling for substructure discovery in mass spectrometry [O] . Joe Wandy, Yunfeng Zhu, Justin J J van der Hooft, -1

机译：Ms2lda.org：用于质谱分析中子结构发现的基于Web的主题建模
7. LDA-based topic formation and topic-sentence reinforcement for graph-based multi-document summarization [O] . Gao D, Li W, Ouyang Y, 2012

机译：基于LDA的主题形成和主题句增强，用于基于图的多文档摘要

Sentence Scoring in Multi-document Summarizing under Topic Model LDA

摘要

著录项

相似文献

相关主题

期刊订阅