首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >ILP-based Compressive Speech Summarization with Content Word Coverage Maximization and Its Oracle Performance Analysis
【24h】

ILP-based Compressive Speech Summarization with Content Word Coverage Maximization and Its Oracle Performance Analysis

机译:具有内容词覆盖率最大化的基于ILP的压缩语音汇总及其Oracle性能分析

获取原文
获取外文期刊封面目录资料

摘要

We propose an integer linear programming (ILP)-based compressive speech summarization method that maximizes the coverage of content words in a resultant summary. It is an unsupervised method and, under the designed constraints, it performs a single-step globally optimal summarization of a given long speech recording, which is decoded as a confusion network form of an automatic speech recognition (ASR) hypothesis sequence. It selects as many different content words as possible from the speech input that inevitably includes a high level of redundancy (e.g. the repetition of the same word) under a given length constraint. In experiments using a lecture speech corpus, we obtained higher summarization performance in terms of ROUGE scores than with a baseline extractive summarization method. We further conduct experimental analyses to obtain the oracle (upper bound) performance of the summarization methods. The analysis results show that the oracle performance is very high even though the ASR hypotheses include recognition errors. It is significantly higher than the system performance and, in addition, the oracle performance of the compressive method is significantly higher than that of the extractive method. These results confirm that our method is a promising approach.
机译:我们提出了一种基于整数线性规划(ILP)的压缩语音总结方法,该方法可最大程度地提高结果摘要中内容词的覆盖范围。这是一种无监督的方法,在设计的约束下,它对给定的长语音记录执行单步全局最佳汇总,然后将其解码为自动语音识别(ASR)假设序列的混淆网络形式。在给定的长度约束下,它从语音输入中选择尽可能多的不同内容词,这些词不可避免地包括高水平的冗余度(例如,相同词的重复)。在使用演讲语音语料库的实验中,我们在ROUGE分数方面获得了比基线提取摘要方法更高的摘要性能。我们进一步进行实验分析,以获得摘要方法的预言性(上限)。分析结果表明,即使ASR假设包括识别错误,oracle的性能还是很高的。它显着高于系统性能,此外,压缩方法的预言性能显着高于提取方法的预言性能。这些结果证实了我们的方法是一种有前途的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号