首页> 外文会议>Multimedia, ISM, 2008 10th IEEE International Symposium on >Combining Structural Analysis and Computer Vision Techniques for Automatic Speech Summarization
【24h】

Combining Structural Analysis and Computer Vision Techniques for Automatic Speech Summarization

机译:结合结构分析和计算机视觉技术进行语音自动摘要

获取原文
获取外文期刊封面目录资料

摘要

Similar to verse and chorus sections that appear as repetitive structures in musical audio, key-concept (or topic) of some speech recordings (e.g., presentations, lectures, etc.) may also repeat itself over the time. Hence, accurate detection of these repetitions may be helpful to the success of automatic speech summarization. Based on this motivation, we consider the applicability of music structural analysis methods to speech summary generation. Our method transforms a 1-D time-domain speech signal to a 2-D image representation, namely (dis)similarity matrix and detects possible repetitions within the matrix by using proper computer vision techniques. In addition, the method does not transcribe speech signal into words, phrases, or sentences. Hence, it can be generalized as speech-to-speech summarization method, in which summarization results are presented by speech instead of text. Furthermore, the method does not need a prior knowledge about the language or grammar of speech signal. Experiments show that, our method can capture the main theme of speech signals compared to the ideal transcription sections defined by experts and computational analysis shows our proposed method has a good performance.
机译:与在音乐音频中以重复结构形式出现的诗句和合唱部分相似,某些语音记录(例如演示,演讲等)的关键概念(或主题)也可能会随时间重复。因此,这些重复的准确检测可能有助于自动语音摘要的成功。基于这种动机,我们考虑了音乐结构分析方法在语音摘要生成中的适用性。我们的方法将一维时域语音信号转换为二维图像表示形式(即(非)相似度矩阵),并通过使用适当的计算机视觉技术来检测矩阵内的可能重复。另外,该方法不会将语音信号转录为单词,短语或句子。因此,它可以推广为语音到语音的摘要方法,该方法中的摘要结果是通过语音而不是文本来呈现的。此外,该方法不需要关于语音信号的语言或语法的先验知识。实验表明,与专家定义的理想转录段相比,该方法可以捕获语音信号的主题,计算分析表明该方法具有良好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号