Media Scene Learning: A Novel Framework for Automatically Extracting Meaningful Parts from Audio and Video Signals

Akisato Kimura; Hirokazu Kameoka; Kunio Kashino

首页> 外文期刊>NTT Technical Review >Media Scene Learning: A Novel Framework for Automatically Extracting Meaningful Parts from Audio and Video Signals

【24h】

Media Scene Learning: A Novel Framework for Automatically Extracting Meaningful Parts from Audio and Video Signals

机译：媒体场景学习：一种新颖的框架，可从音频和视频信号中自动提取有意义的部分

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe a novel framework called Media Scene Learning (MSL) for automatically extracting key components such as the sound of a single instrument from a given audio signal or a target object from a given video signal. In particular, we introduce two key methods: 1) the Composite Auto-Regressive System (CARS) for decomposing audio signals into several sound components on the basis of a generative model of sounds and 2) Saliency-Based Image Learning (SBIL) for extracting object-like regions from a given video signal on the basis of the characteristics of the human visual system.

机译：我们描述了一种称为媒体场景学习（MSL）的新颖框架，该框架可从给定的音频信号中自动提取关键组件，例如单个乐器的声音，或者从给定的视频信号中提取目标对象。特别是，我们介绍了两种关键方法：1）基于声音生成模型将音频信号分解为几个声音成分的复合自回归系统（CARS），以及2）基于显着性的图像学习（SBIL）提取根据人类视觉系统的特性，从给定视频信号中提取出类似对象的区域。

著录项

来源
《NTT Technical Review》 |2010年第11期|共7页
作者
Akisato Kimura; Hirokazu Kameoka; Kunio Kashino;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类通信;
关键词

相似文献

外文文献
中文文献
专利

1. Development of Learning Media for Video Audio-Visual Stop Motion Based on Contextual Teaching and Learning in Science Learning Water Cycle Material [J] . Ismah Mufidah, Lukman Nulhakim, Trian Pamungkas Alamsyah Jurnal Ilmiah Sekolah Dasar . 2020,第3期

机译：基于上下文教学的视频视听停止运动的学习媒体的发展与科学学习水循环材料
2. Generative Model Driven Representation Learning in a Hybrid Framework for Environmental Audio Scene and Sound Event Recognition [J] . IEEE transactions on multimedia . 2020,第1期

机译：用于环境音频场景和声音事件识别的混合框架中的生成模型驱动表示学习
3. SegChainW2V: Towards a Generic Automatic Video Segmentation Framework, Based on Lexical Chains of Audio Transcriptions and Word Embeddings [J] . Adrian-Gabriel Chifu, Sébastien Fournier Procedia Computer Science . 2016,第1期

机译：SegChainW2V：建立一个基于音频转录和词嵌入的词法链的通用自动视频分割框架
4. Video Retrieval Using Automatically Extracted Audio [C] . Kale Anil, Wakde D.G. 2013 International Conference on Cloud amp; Ubiquitous Computing amp; Emerging Technologies . 2013

机译：使用自动提取的音频进行视频检索
5. Videos, audio clips, and text materials: An investigation of media use in psychology learning. [D] . Carlson, Crystal. 2015

机译：视频，音频片段和文本材料：对心理学学习中媒体使用的调查。
6. Robust and Fast Scene Recognition in Robotics Through the Automatic Identification of Meaningful Images [O] . David Santos, Eric Lopez-Lopez, Xosé M. Pardo, 2019

机译：通过自动识别有意义的图像实现机器人中的鲁棒快速场景识别
7. ENHANCED VIDEO BROWSING USING AUTOMATICALLY EXTRACTED AUDIO EXCERPTS [O] . 2008

机译：使用自动提取的音频摘录增强视频浏览功能
8. Automatic Gain Control Circuit for Video Signals of Scenes of Varying Illumination Levels [R] . Hapgood, J. H., Rash, C. E. 1984

机译：用于改变照明水平场景的视频信号的自动增益控制电路

Media Scene Learning: A Novel Framework for Automatically Extracting Meaningful Parts from Audio and Video Signals

摘要

著录项

相似文献

相关主题

期刊订阅