Multi-document Summarization by Creating Synthetic Document Vector Based on Language Model

机译：通过基于语言模型创建综合文档向量的多文档摘要

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multi-document summarization is to create summaries covering the major information that multiple documents tell in common. For this point, the existing methods are based on hand-crafted features for word and sentence. However, it is difficult to figure out the core contents of each document with the hand-crafted features because they have the limited information presented the given documents. Moreover, there exists a limit to figure out the major information because documents with the same meaning used to be paraphrased depending on their writers. Therefore, it is necessary to represent the semantic meanings of documents as well as sentences through understanding natural language. In this paper, we propose a new multi-document summarization system by creating a synthetic document vector covering the whole documents based on Language Model, whose is well-known for learning the semantic features in text. We experimented with DUC 2004 dataset provided by Document Understanding Conference (DUC) and the results show that our method summarizes multiple documents effectively based on their core contents.

机译：多文档摘要是为了创建摘要，以涵盖多个文档共同讲述的主要信息。为此，现有方法基于单词和句子的手工制作功能。但是，由于手工制作的功能在给定文档中提供的信息有限，因此很难找出每个文档的核心内容。此外，由于主要具有相同含义的文档根据其作者而被释义，因此找出主要信息存在一定的局限性。因此，有必要通过理解自然语言来表达文档和句子的语义含义。本文通过基于语言模型创建覆盖整个文档的合成文档向量，提出了一种新的多文档摘要系统，该系统以学习文本的语义特征而闻名。我们对由文档理解会议（DUC）提供的DUC 2004数据集进行了试验，结果表明，我们的方法有效地总结了基于其核心内容的多个文档。

著录项

来源
《International Conference on Soft Computing and Intelligent Systems;International Symposium on Advanced Intelligent Systems》|2016年|605-609|共5页
会议地点
作者
Dahae Kim; Jee-Hyong Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Semantics; Context; Hidden Markov models; Computational modeling; Redundancy; Intelligent systems; Natural languages;

机译：语义;上下文;隐马尔可夫模型;计算模型;冗余;智能系统;自然语言;

相似文献

外文文献
中文文献
专利

1. INFORMATION ORDERING WITH AN EVENT-ENRICHED VECTOR SPACE MODEL FOR MULTI-DOCUMENT NEWS SUMMARIZATION [J] . Zhang Renxian, Li Wenjie, Liu Naishi, Computational Intelligence . 2016,第2期

机译：利用多事件新闻摘要的事件向量丰富的空间模型进行信息订购
2. Multi-document Summarization using Probabilistic Topic-based Network Models [J] . Yang Cheng-Zen, Fan Jhih-Shang, Liu Yu-Fan Journal of information science and engineering . 2016,第6期

机译：使用基于概率主题的网络模型进行多文档摘要
3. MHLM Majority Voting Based Hybrid Learning Model for Multi-Document Summarization [J] . Suneetha S, Venugopal Reddy A International journal of artificial life research . 2019,第1期

机译：基于MHLM多数投票的混合学习模型，用于多文档摘要
4. Multi-document Summarization by Creating Synthetic Document Vector Based on Language Model [C] . Dahae Kim, Jee-Hyoung Lee International Conference on Soft Computing and Intelligent Systems . 2016

机译：基于语言模型创建合成文档向量的多文件摘要
5. Multi-document Summarization Based on Document Clustering and Neural Sentence Fusion [D] . Fuad, Tanvir Ahmed. 2018

机译：基于文档聚类和神经句子融合的多文件摘要
6. Free-text medical document retrieval via phrase-based vector space model. [O] . Wenlei Mao, Wesley W. Chu 2002

机译：通过基于短语的向量空间模型检索自由文本医学文献。
7. Cross - Language based Multi-Document Summarization Model using Machine Learning Technique [O] . Ms. P. Mahalakshmi Et.al 2021

机译：基于跨语言的多文件摘要模型使用机器学习技术

Multi-document Summarization by Creating Synthetic Document Vector Based on Language Model

摘要

著录项

相似文献

相关主题

期刊订阅