首页> 外文会议>2017 International Conference on Engineering and Technology >A method to generate text summary by accounting pronoun frequency for keywords weightage computation
【24h】

A method to generate text summary by accounting pronoun frequency for keywords weightage computation

机译:一种基于代词频率的关键词摘要权重生成文本摘要的方法

获取原文
获取原文并翻译 | 示例

摘要

In recent years large volume of data being generated every day from various sources. Text summarization has become more relevance for quick searching, abstract generating, automatic sorting etc., to larger volume of data. Extractive methods are involved in identifying important part of the text to produce summary. While generating the summary by extractive methods, important keywords are identified by eliminating stopwords. As a part of stopwords removal, pronouns which are used as placeholders for proper nouns in text are usually removed. But frequency information related to pronouns is significant to improve the quality of summary being generated. We propose in this research a method to replace pronouns with their corresponding proper nouns and then compute the frequency of keywords. The keywords weightage has been calculated based on frequency which intern used to extract important sentences to form the summary. Experiments are conducted on text data collection and a gain ratio is computed to measure improvement in summary generated from pronoun replacement method.
机译:近年来,每天都从各种来源生成大量数据。文本摘要对于大量数据的快速搜索,摘要生成,自动分类等变得越来越重要。提取方法涉及识别文本的重要部分以产生摘要。通过提取方法生成摘要时,通过消除停用词来标识重要的关键词。作为停用词删除的一部分,通常会删除用作代名词的专有名词。但是与代词相关的频率信息对于提高所生成摘要的质量非常重要。在这项研究中,我们提出了一种用代名词来替换代词的方法,然后计算关键词的出现频率。关键字权重是根据频次计算出来的,该频次是用来提取重要句子以形成摘要的实习生。对文本数据收集进行了实验,并计算了增益比以衡量代词替换方法所产生的摘要的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号