...
首页> 外文期刊>International Journal of Computer Trends and Technology >Extractive Summarization Method for Arabic Text - ESMAT
【24h】

Extractive Summarization Method for Arabic Text - ESMAT

机译:阿拉伯文本的提取摘要方法-ESMAT

获取原文
           

摘要

Due to the huge and rapid growth of online data makes search such massive data collections and finding the relevant information a tough task and time consumption. For this reason, research on automatic summarization techniques has received much attention from industry and academia. Unlike English text which has received much attention of the researchers in this field, Arabic text is still lake to such serious investigations. This reason gave the author of this paper, strong motivation to participate in a pushing Arabic language into the concern domain of automatic text summarization researchers by proposing an extractive summarization method. The proposed method generates a summary of an original document based on a linear combination of text features having different structures. Five summarizers (AQBTSS, Gen–Summ, LSA–Summ, Sakhr and Baseline–1) are used in this study as benchmarks. The proposed method and the benchmarks are evaluated using EASC – the Essex Arabic Summaries Corpus. The results showed that the proposed method performs well, based on recall, precision and average scores, more than the five benchmarks. A good performance achieved by the proposed method proved that the focus on those more complicated features, rather than simple ones, could guide to the most important content of any document.
机译:由于在线数据的巨大且快速的增长,使得搜索如此庞大的数据集和查找相关信息成为一项艰巨的任务和耗时的工作。因此,自动汇总技术的研究受到了工业界和学术界的广泛关注。与英语文本不同,该文本已受到该领域研究人员的关注,而阿拉伯文本仍是进行此类认真研究的基础。这个原因使本文的作者有强烈的动机,通过提出一种提取性摘要方法,参与将阿拉伯语言推入自动文本摘要研究人员关注的领域。所提出的方法基于具有不同结构的文本特征的线性组合来生成原始文档的摘要。本研究使用五个汇总器(AQBTSS,Gen-Summ,LSA-Summ,Sakhr和Baseline-1)作为基准。所建议的方法和基准是使用EASC(埃塞克斯阿拉伯语摘要语料库)进行评估的。结果表明,基于召回率,准确性和平均分数,该方法的性能优于五个基准。所提方法取得的良好性能证明,将重点放在那些较复杂的功能上,而不是简单的功能上,可以指导任何文档的最重要内容。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号