Deep Learning Based Abstractive Text Summarization: Approaches, Datasets, Evaluation Measures, and Challenges

Dima Suleiman; Arafat Awajan

首页> 外文期刊>Mathematical Problems in Engineering: Theory, Methods and Applications >Deep Learning Based Abstractive Text Summarization: Approaches, Datasets, Evaluation Measures, and Challenges

【24h】

Deep Learning Based Abstractive Text Summarization: Approaches, Datasets, Evaluation Measures, and Challenges

机译：基于深度学习的抽象文本摘要：方法，数据集，评估措施和挑战

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years, the volume of textual data has rapidly increased, which has generated a valuable resource for extracting and analysing information. To retrieve useful knowledge within a reasonable time period, this information must be summarised. This paper reviews recent approaches for abstractive text summarisation using deep learning models. In addition, existing datasets for training and validating these approaches are reviewed, and their features and limitations are presented. The Gigaword dataset is commonly employed for single-sentence summary approaches, while the Cable News Network (CNN)/Daily Mail dataset is commonly employed for multisentence summary approaches. Furthermore, the measures that are utilised to evaluate the quality of summarisation are investigated, and Recall-Oriented Understudy for Gisting Evaluation 1 (ROUGE1), ROUGE2, and ROUGE-L are determined to be the most commonly applied metrics. The challenges that are encountered during the summarisation process and the solutions proposed in each approach are analysed. The analysis of the several approaches shows that recurrent neural networks with an attention mechanism and long short-term memory (LSTM) are the most prevalent techniques for abstractive text summarisation. The experimental results show that text summarisation with a pretrained encoder model achieved the highest values for ROUGE1, ROUGE2, and ROUGE-L (43.85, 20.34, and 39.9, respectively). Furthermore, it was determined that most abstractive text summarisation models faced challenges such as the unavailability of a golden token at testing time, out-of-vocabulary (OOV) words, summary sentence repetition, inaccurate sentences, and fake facts.

机译：近年来，文本数据的数量迅速增加，这已经产生了有价值的资源来提取和分析信息。要在合理的时间段内检索有用的知识，必须概括此信息。本文评论了最近使用深度学习模型的抽象文本汇总方法。此外，审查了用于培训和验证这些方法的现有数据集，并提出了它们的特征和限制。 Gigaword DataSet通常用于单句摘要方法，而电缆新闻网（CNN）/每日邮件数据集通常用于多学期摘要方法。此外，研究了用于评估总结质量的措施，并召回针对直肠评估1（Rouge1），Rouge2和Rouge-L的召回的升高为最常用的指标。分析了在汇总过程中遇到的挑战和在每种方法中提出的解决方案。对几种方法的分析表明，具有注意机制和长短期记忆（LSTM）的经常性神经网络是抽象文本汇总的最普遍的技术。实验结果表明，用普拉普雷雷达编码器模型的文本汇总实现了Rouge1，Rouge2和Rouge-L（43.85,20.34和39.9）的最高值。此外，确定最具抽象文本汇总模型面临挑战，例如在测试时间，词汇（OOV）单词，摘要句子重复，不准确的句子和假事实中的金色令牌的不可用品等挑战。

著录项

来源
《Mathematical Problems in Engineering: Theory, Methods and Applications》 |2020年第1期|共29页
作者
Dima Suleiman; Arafat Awajan;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. DEEP LEARNING BASED ABSTRACTIVE ARABIC TEXT SUMMARIZATION USING TWO LAYERS ENCODER AND ONE LAYER DECODER [J] . DIMA SULEIMAN, ARAFAT AWAJAN Journal of Theoretical and Applied Information Technology . 2020,第16期

机译：基于深度学习的抽象阿拉伯文摘要，使用两层编码器和一层解码器
2. Abstractive text summarization using LSTM-CNN based deep learning [J] . Song Shengli, Huang Haitao, Ruan Tongxiao Multimedia Tools and Applications . 2019,第1期

机译：使用基于LSTM-CNN的深度学习进行抽象文本摘要
3. A Survey of Text Summarization Approaches Based on Deep Learning [J] . Sheng-Luan Hou, Xi-Kun Huang, Chao-Qun Fei, 计算机科学技术学报（英文版） . 2021,第003期

机译：基于深度学习的文本摘要方法调查
4. Abstractive text summarization based on deep learning and semantic content generalization [C] . Panagiotis Kouris, Georgios Alexandridis, Andreas Stafylopatis Annual meeting of the Association for Computational Linguistics . 2019

机译：基于深度学习和语义内容泛化的抽象文本摘要
5. QUERY-FOCUSED EXTRACTIVE SUMMARIZATION BASED ON DEEP LEARNING: COMPARISON OF SIMILARITY MEASURES FOR PSEUDO GROUND TRUTH GENERATION [D] . Yuliska 2019

机译：基于深度学习的查询重点摘要：伪地面真相生成相似度量的比较
6. COVID-19 information retrieval with deep-learning based semantic search question answering and abstractive summarization [O] . Andre Esteva, Anuprit Kale, Romain Paulus, 2021

机译：Covid-19信息检索与深学习的语义搜索问题应答和抽象摘要
7. Deep Learning Based Abstractive Text Summarization: Approaches, Datasets, Evaluation Measures, and Challenges [O] . Dima Suleiman, Arafat Awajan 2020

机译：基于深度学习的抽象文本摘要：方法，数据集，评估措施和挑战

Deep Learning Based Abstractive Text Summarization: Approaches, Datasets, Evaluation Measures, and Challenges

摘要

著录项

相似文献

相关主题

期刊订阅