首页> 外文期刊>Computing and informatics >DOCUMENT SUMMARIZATION USING NMF AND PSEUDO RELEVANCE FEEDBACK BASED ON K-MEANS CLUSTERING
【24h】

DOCUMENT SUMMARIZATION USING NMF AND PSEUDO RELEVANCE FEEDBACK BASED ON K-MEANS CLUSTERING

机译:基于K均值聚类的NMF和伪相关反馈的文档摘要

获取原文
获取原文并翻译 | 示例
           

摘要

According to the increment of accessible text data source on the inter net, it has increased the necessity of the automatic text document summarization. However, the performance of the automatic methods might be poor because the semantic gap between high level user's summary requirement and low level vector representation of machine exists. In this paper, to overcome that problem, we propose a new document summarization method using a pseudo relevance feedback based on clustering method and NMF (non-negative matrix factorization). Relevance feedback is effective technique to minimize the semantic gap of information processing, but the general relevance feedback needs an intervention of a user. Additionally, the refined query without user interference by pseudo relevance feedback may be biased. The proposed method provides an automatic relevance judgment to reformulate query using the clustering method for minimizing a bias of query expansion. The method also can improve the quality of document summarization since the summarized documents are influenced by the semantic features of documents and the expanded query. The experimental results demonstrate that the proposed method achieves better performance than the other document summarization methods.
机译:随着互联网上可访问文本数据源的增加,增加了自动文本文档摘要的必要性。但是,自动方法的性能可能会很差,因为高级用户的摘要要求与机器的低级向量表示之间存在语义上的差距。在本文中,为解决该问题,我们提出了一种新的基于聚类方法和NMF(非负矩阵分解)的伪相关反馈的文档摘要方法。关联反馈是使信息处理的语义差距最小化的有效技术,但是一般的关联反馈需要用户的干预。另外,在没有用户干扰的情况下,经过改进的查询会受到伪相关性反馈的影响。所提出的方法提供了一种自动相关性判断,以使用聚类方法重新构造查询,以最小化查询扩展的偏差。由于摘要文档受到文档语义特征和扩展查询的影响,因此该方法还可以提高文档摘要的质量。实验结果表明,与其他文献总结方法相比,该方法具有更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号