首页> 外文会议>International Conference on Knowledge, Information and Creativity Support Systems >Investigating Unit Weighting and Unit Selection Factors in Thai Multi-document Summarization
【24h】

Investigating Unit Weighting and Unit Selection Factors in Thai Multi-document Summarization

机译:调查泰国多文件摘要中的单位加权与单位选择因子

获取原文

摘要

Breaking down documents into small units, unit weighting and unit selection are two important factors in summarization of multiple related documents. This paper presents an investigation on performance of several variants of unit weighting and selection schemes on Thai multi-document summarization. Fifty sets of Thai news articles with their reference summaries are used to evaluate the performance of various weighting and selection methods. Compared to PageRank and Maximal Marginal Relevance (MMR) with four ROUGE measures, the results show that iterative weighting gets higher performance of traditional TF-IDF, the iterative node weighting, query relevance, centroid-based selection, and unit redundancy consideration can help improving summary quality.
机译:将文档分解为小单位,单位加权和单位选择是总结多个相关文件的两个重要因素。本文提出了对泰国多文件概述的几个单位加权和选择方案的性能的调查。使用其参考摘要的五十套泰国新闻文章用于评估各种加权和选择方法的性能。与PageRank和最大边缘相关性(MMR)相比,具有四种胭脂措施,结果表明,迭代加权变得更高的传统TF-IDF性能,迭代节点加权,查询相关性,基于质心选择和单位冗余考虑可以有助于提高总结质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号