首页> 美国卫生研究院文献>The Scientific World Journal >An Automatic Multidocument Text Summarization Approach Based on Naïve Bayesian Classifier Using Timestamp Strategy
【2h】

An Automatic Multidocument Text Summarization Approach Based on Naïve Bayesian Classifier Using Timestamp Strategy

机译:基于朴素贝叶斯分类器的时间戳策略自动多文档文本摘要方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Nowadays, automatic multidocument text summarization systems can successfully retrieve the summary sentences from the input documents. But, it has many limitations such as inaccurate extraction to essential sentences, low coverage, poor coherence among the sentences, and redundancy. This paper introduces a new concept of timestamp approach with Naïve Bayesian Classification approach for multidocument text summarization. The timestamp provides the summary an ordered look, which achieves the coherent looking summary. It extracts the more relevant information from the multiple documents. Here, scoring strategy is also used to calculate the score for the words to obtain the word frequency. The higher linguistic quality is estimated in terms of readability and comprehensibility. In order to show the efficiency of the proposed method, this paper presents the comparison between the proposed methods with the existing MEAD algorithm. The timestamp procedure is also applied on the MEAD algorithm and the results are examined with the proposed method. The results show that the proposed method results in lesser time than the existing MEAD algorithm to execute the summarization process. Moreover, the proposed method results in better precision, recall, and F-score than the existing clustering with lexical chaining approach.
机译:如今,自动多文档文本摘要系统可以成功地从输入文档中检索摘要语句。但是,它具有许多局限性,例如对基本句子的提取不正确,覆盖率低,句子之间的连贯性差以及冗余。本文介绍了时间戳方法和朴素贝叶斯分类方法用于多文档文本摘要的新概念。时间戳为摘要提供了有序的外观,从而实现了连贯的外观摘要。它从多个文档中提取更相关的信息。在这里,计分策略还用于计算单词的分数以获得单词频率。从可读性和可理解性方面估计较高的语言质量。为了证明所提方法的有效性,本文对所提方法与现有MEAD算法进行了比较。时间戳过程也应用于MEAD算法,并使用所提出的方法检查结果。结果表明,所提出的方法比现有的MEAD算法执行摘要过程所需的时间更少。此外,与现有的具有词法链接方法的聚类相比,所提出的方法具有更高的精度,召回率和F分数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号