...
首页> 外文期刊>International journal of information retrieval research >Hybrid Approach for Single Text Document Summarization Using Statistical and Sentiment Features
【24h】

Hybrid Approach for Single Text Document Summarization Using Statistical and Sentiment Features

机译:使用统计和情感特征的单文本文档摘要混合方法

获取原文
获取原文并翻译 | 示例
           

摘要

Summarization is away to represent same information in concise way with equal sense. This can be categorized in two type ive and Extractive type. Our work is focused around Extractive summarization. A generic approach to extractive summarization is to consider sentence as an entity, score each sentence based on some indicative features to ascertain the quality of sentencefor inclusion in summary. Sort the sentences on the score and consider top n sentences for summarization. Mostly statistical features have been used for scoring the sentences. A hybrid model for a single text document summarization is being proposed This hybrid model is an extraction based approach, which is combination of Statistical and semantic technique. The hybrid model depends on the linear combination of statistical measures: sentence position, TF-IDF, Aggregate similarity, centroid, and semantic measure. The idea to include sentiment analysis for salient sentence extraction is derived from the concept that emotion plays an important role in communication to effectively convey any message hence, it can play a vital role in text document summarization. For comparison, five system summaries have been generated: Proposed Work, MEAD system, Microsoft system, OPINOSIS system, and Human generated summary, and evaluation is done using ROUGE score.
机译:简而言之,以同等的意义来概括表示相同的信息是不可能的。可以将其分为ive类型和Extractive类型。我们的工作集中在提取摘要上。提取摘要的通用方法是将句子视为一个整体,根据一些指示性特征对每个句子评分,以确定要包含在摘要中的句子的质量。对分数上的句子进行排序,并考虑前n个句子进行总结。大多数情况下,已使用统计功能为句子评分。提出了一种用于单一文本文档摘要的混合模型。该混合模型是一种基于提取的方法,是统计和语义技术的结合。混合模型取决于统计量度的线性组合:句子位置,TF-IDF,聚合相似度,质心和语义量度。包含情感分析以进行显着句子提取的思想源于这样的概念,即情感在有效传达任何信息的交流中起着重要作用,因此它在文本文档摘要中起着至关重要的作用。为了进行比较,已生成了五个系统摘要:拟议工作,MEAD系统,Microsoft系统,OPINOSIS系统和人工生成的摘要,并使用ROUGE评分进行了评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号