首页> 外文会议>Automatic Speech Recognition amp; Understanding, 2009. ASRU 2009 >Extractive speech summarization by active learning
【24h】

Extractive speech summarization by active learning

机译:主动学习提取语音摘要

获取原文

摘要

In this paper, we propose an active learning approach for feature-based extractive summarization of lecture speech. Most state-of-the-art speech summarization systems are trained by using a large amount of human reference summaries. Active learning targets to minimize human annotation efforts by automatically selecting a small amount of unlabeled examples for labeling. Our method chooses the unlabeled examples according to a combination of informativeness criterion and robustness criterion. Our summarization results show an increasing learning curve of ROUGE-L F-measure, from 0.44 to 0.54, consistently higher than that of using randomly chosen training samples. We also show that, by following the rhetorical structure in presentation slides, it is possible for humans to produce Ȝgold standardȝ reference summaries with very high inter-labeler agreement.
机译:在本文中,我们提出了一种主动学习方法,用于基于特征的演讲演讲摘要。大多数最新的语音摘要系统都是通过使用大量的人工参考摘要进行训练的。主动学习的目标是通过自动选择少量未标记的示例进行标记,以最大程度地减少人工注释的工作量。我们的方法根据信息性标准和鲁棒性标准的组合选择未标记的示例。我们的总结结果显示,ROUGE-L F-度量的学习曲线从0.44到0.54不断增加,始终高于使用随机选择的训练样本的学习曲线。我们还表明,通过遵循演示幻灯片中的修辞结构,人类有可能产生具有很高标记间一致性的“黄金标准”参考摘要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号