Video-assisted segmentation of speech and audio track

机译：语音和音轨的视频辅助分割

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Abstract: Video database research is commonly concerned with the storage and retrieval of visual information invovling sequence segmentation, shot representation and video clip retrieval. In multimedia applications, video sequences are usually accompanied by a sound track. The sound track contains potential cues to aid shot segmentation such as different speakers, background music, singing and distinctive sounds. These different acoustic categories can be modeled to allow for an effective database retrieval. In this paper, we address the problem of automatic segmentation of audio track of multimedia material. This audio based segmentation can be combined with video scene shot detection in order to achieve partitioning of the multimedia material into semantically significant segments. !17

机译：摘要：视频数据库研究通常涉及视觉信息的存储和检索，涉及序列分割，镜头表示和视频剪辑检索。在多媒体应用中，视频序列通常带有音轨。音轨包含有助于镜头分割的潜在线索，例如不同的扬声器，背景音乐，唱歌和独特的声音。可以对这些不同的声学类别进行建模，以实现有效的数据库检索。在本文中，我们解决了多媒体材料的音轨自动分割的问题。这种基于音频的分段可以与视频场景镜头检测结合使用，以实现将多媒体材料划分为语义上重要的分段。！17

著录项

来源
《Conference on multimedia storage and archiving systems》|1999年|p.68-77|共10页
会议地点
作者
Medha Pandit; Univ. of Surrey; Guildford; United Kingdom; Yusseri Yusoff; Univ. of Surrey; Guildford; United Kingdom; Josef Kittler; Univ. of Surrey; Guildford Surrey; United Kingdom; William J. Christmas; Univ. of Surrey; Guildford Surrey; United Kingdom; E.H. Chilton; Univ. of Surrey; Guildford; United Kingdom.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/ Speech Video Soundtracks [J] . Robert Mertens, Po-Sen Huang, Luke Gottlieb, International journal of multimedia data engineering & management . 2012,第3期

机译：说话者差异化在非语音和非语音/语音混合视频音轨的音频索引中的适用性
2. From Pitches to Notes: Creation and Segmentation of Pitch Tracks for Melody Detection in Polyphonic Audio [J] . Rui Pedro Paiva, Teresa Mendes, Amlcar Cardoso Journal of New Music Research . 2008,第3期

机译：从音高到音符：用于和弦音频中旋律检测的音高音轨的创建和分段
3. Unsupervised speaker segmentation and tracking in real-time audio content analysis [J] . Lie Lu, Hong-Jiang Zhang Multimedia systems . 2005,第4期

机译：实时音频内容分析中的无监督说话者分割和跟踪
4. Video-assisted segmentation of speech and audio track [C] . Medha Pandit, Yusseri Yusoff, Josef Kittler, Conference on multimedia storage and archiving systems . 1999

机译：言语辅助分段的语音和音轨
5. Audio segmentation for meetings speech processing. [D] . Boakye, Kofi Agyeman. 2008

机译：会议语音处理的音频分段。
6. No There Is No 150 ms Lead of Visual Speech on Auditory Speech but a Range of Audiovisual Asynchronies Varying from Small Audio Lead to Large Audio Lag [O] . Jean-Luc Schwartz, Christophe Savariaux 2014

机译：不听觉语音没有150 ms的视觉语音导联但是视听异步范围从小音频导联到大音频滞后
7. Real-time audiovisual speech capture and motion tracking for speech-driven facial animation [O] . Jablonski Karl Adam 2013

机译：语音驱动的面部动画的实时视听语音捕获和运动跟踪
8. Lip Tracking for Audio-Visual Speech Recognition [R] . Kaucic, R. A. 1997

机译：用于视听语音识别的唇部跟踪

Video-assisted segmentation of speech and audio track

摘要

著录项

相似文献

相关主题

期刊订阅