Combining Structural Analysis and Computer Vision Techniques for Automatic Speech Summarization

机译：结合结构分析和计算机视觉技术进行语音自动摘要

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Similar to verse and chorus sections that appear as repetitive structures in musical audio, key-concept (or topic) of some speech recordings (e.g., presentations, lectures, etc.) may also repeat itself over the time. Hence, accurate detection of these repetitions may be helpful to the success of automatic speech summarization. Based on this motivation, we consider the applicability of music structural analysis methods to speech summary generation. Our method transforms a 1-D time-domain speech signal to a 2-D image representation, namely (dis)similarity matrix and detects possible repetitions within the matrix by using proper computer vision techniques. In addition, the method does not transcribe speech signal into words, phrases, or sentences. Hence, it can be generalized as speech-to-speech summarization method, in which summarization results are presented by speech instead of text. Furthermore, the method does not need a prior knowledge about the language or grammar of speech signal. Experiments show that, our method can capture the main theme of speech signals compared to the ideal transcription sections defined by experts and computational analysis shows our proposed method has a good performance.

机译：与在音乐音频中以重复结构形式出现的诗句和合唱部分相似，某些语音记录（例如演示，演讲等）的关键概念（或主题）也可能会随时间重复。因此，这些重复的准确检测可能有助于自动语音摘要的成功。基于这种动机，我们考虑了音乐结构分析方法在语音摘要生成中的适用性。我们的方法将一维时域语音信号转换为二维图像表示形式（即（非）相似度矩阵），并通过使用适当的计算机视觉技术来检测矩阵内的可能重复。另外，该方法不会将语音信号转录为单词，短语或句子。因此，它可以推广为语音到语音的摘要方法，该方法中的摘要结果是通过语音而不是文本来呈现的。此外，该方法不需要关于语音信号的语言或语法的先验知识。实验表明，与专家定义的理想转录段相比，该方法可以捕获语音信号的主题，计算分析表明该方法具有良好的性能。

著录项

来源
《Multimedia, ISM, 2008 10th IEEE International Symposium on》||P.515-520|共6页
会议地点
作者
Sert Mustafa; Baykal Buyurman; Yazici Adnan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类工业技术;
关键词
Audio content analysis; key-concept detection; speech summarization;

机译：音频内容分析;关键概念检测;语音摘要;

相似文献

外文文献
中文文献
专利

1. Computer Vision Techniques for Automatic Structural Assessment of Underground Pipes [J] . Sunil K. Sinha, Paul W. Fieguth, Maria A. Polak Computer-Aided Civil and Infrastructure Engineering . 2003,第2期

机译：地下管道自动结构评估的计算机视觉技术
2. Sentence-extractive automatic speech summarization and evaluation techniques [J] . Makoto Hirohata, Yosuke Shinnaka, Koji Iwano, Speech Communication . 2006,第9期

机译：提取句子的自动语音摘要和评估技术
3. 3D superimposition of dental casts based on coloured landmark detection using combined computer vision and 3D computer graphics techniques [J] . Computer methods in biomechanics and bio . 2020,第1a2期

机译：基于彩色地标检测的牙科铸件3D叠加，结合了计算机视觉和3D计算机图形技术
4. Combining Structural Analysis and Computer Vision Techniques for Automatic Speech Summarization [C] . Sert Mustafa, Baykal Buyurman, Yazici Adnan IEEE International Symposium on Multimedia . 2008

机译：组合结构分析与计算机视觉技术进行自动语音摘要
5. Automatic spacecraft docking using computer vision and nonlinear control techniques. [D] . Ho, Chi-Chang Johnny. 1991

机译：使用计算机视觉和非线性控制技术的自动航天器对接。
6. Computer Vision System for Welding Inspection of Liquefied Petroleum Gas Pressure Vessels Based on Combined Digital Image Processing and Deep Learning Techniques [O] . Yarens J. Cruz, Marcelino Rivas, Ramón Quiza, 2020

机译：基于组合数字图像处理和深层学习技术的液化石油气压力容器焊接检查计算机视觉系统
7. Computer Vision and Image Analysis based Techniques for Automatic Characterization of Fruits – a Review [O] . Jyoti A Kodagali, S Balaji 2013

机译：基于计算机视觉和图像分析的水果自动表征技术–综述
8. Development of Computer Vision Techniques for Automatic Feature Extraction [R] . Gordon, D. K., Pascucci, R. F. 1987

机译：自动特征提取计算机视觉技术的发展

Combining Structural Analysis and Computer Vision Techniques for Automatic Speech Summarization

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅