【24h】

A topic-based sentence representation for extractive text summarization

机译:用于提取文本摘要的基于主题的句子表示

获取原文

摘要

We examine the effect of probabilistic topic model-based word representations, on sentence-based extractive summarization. We formulate the task of sentence selection as a binary classification problem, and we test a variety of machine learning algorithms, exploring a range of different settings for classification and modelling. A preliminary investigation via a wide experimental evaluation on the MultiLing 2015 MSS dataset illustrates that topic-based representations can prove beneficial to the extractive summarization process, compared to a TF-IDF baseline, with Quadratic Discriminant Analysis and Gradient Boosting providing the best results for micro and macro Fl score, respectively.
机译:我们研究了基于概率主题模型的单词表示对基于句子的提取摘要的影响。我们将句子选择的任务表述为二进制分类问题,并且测试各种机器学习算法,探索分类和建模的各种不同设置。通过对MultiLing 2015 MSS数据集进行的广泛实验评估进行的初步调查表明,与TF-IDF基线相比,基于主题的表示形式可以证明对提取摘要过程有益,并且二次判别分析和梯度提升可为微观分析提供最佳结果和宏F1分数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号