Nonexclusive audio segmentation and indexing as a pre-processor for audio information mining

机译：非排他音频分割和索引作为音频信息挖掘的预处理器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Much content related information can be extracted from recorded soundtracks, such as those of multimedia files. The soundtracks might be heuristically classified into three categories namely speech, music and ambient or event sounds. Research in the past focused on algorithms to classify audio clips in an exclusive manner. However, soundtracks from media content are often presented as overlapped mixtures of all these three types of sounds. Nonexclusive segmentation and indexing are therefore essential pre-processors for effective audio information mining and metadata generation. This paper emphasizes the importance of nonexclusive indexing and segmentation methods, identifies the challenges and proposes a universal architecture for nonexclusive segmentation and indexing as a pre-processor for audio information mining, metadata extraction and scene analysis. Related feature selection, pattern recognition and signal processing algorithms are presented and testing results discussed.

机译：可以从录制的音轨（例如多媒体文件的音轨）中提取很多与内容相关的信息。可以将试听音乐启发式地分为三类，即语音，音乐和环境或事件声音。过去的研究集中于以专有方式对音频片段进行分类的算法。但是，来自媒体内容的音轨通常被呈现为所有这三种类型的声音的重叠混合。因此，非排他的分段和索引编制对于有效的音频信息挖掘和元数据生成来说是必不可少的预处理器。本文强调了非排他性索引和分段方法的重要性，指出了挑战，并提出了一种用于非排他性分段和索引的通用体系结构作为音频信息挖掘，元数据提取和场景分析的预处理器。提出了相关的特征选择，模式识别和信号处理算法，并讨论了测试结果。

著录项

来源
《International Congress on Image and Signal Processing》|2013年|1593-1597|共5页
会议地点
作者
Li Francis F.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Audio segmentation; Metadata; audio information mining; classification; content descriptor; indexing; scene analysis;

机译：音频分割元数据音频信息挖掘分类内容描述符索引场景分析;

相似文献

外文文献
中文文献
专利

1. HMM-Based Text Segmentation Using Variational Bayes Learning and Its Application to Audio-Visual Indexing [J] . Takafumi Koshinaka, Akitoshi Okumura, Ryosuke Isotani Electronics and Communications in Japan. Part 2, Electronics . 2007,第12期

机译：基于变分贝叶斯学习的基于HMM的文本分割及其在视听索引中的应用
2. A generic audio classification and segmentation approach for multimedia indexing and retrieval [J] . Kiranyaz S., Ahmad Farooq Qureshi, Gabbouj M. IEEE transactions on audio, speech and language processing . 2006,第3期

机译：用于多媒体索引和检索的通用音频分类和分段方法
3. DISTBIC: A speaker-based segmentation for audio data indexing [J] . P. Delacourt, C. J. Wellekens 20f Speech Communication . 2000,第1a2期

机译：DISTBIC：基于说话者的音频数据索引分割
4. Nonexclusive Audio Segmentation and Indexing as a Pre-processor for Audio Information Mining A universal architecture and feature space selection [C] . Francis F. Li International Congress on Image and Signal Processing . 2013

机译：非删除音频分割和索引作为音频信息挖掘通用架构和特征空间选择的预处理器
5. Automatic segmentation, indexing and retrieval of audiovisual data based on combined audio and visual content analysis. [D] . Zhang, Tong. 1999

机译：基于组合的视听内容分析，对视听数据进行自动分段，索引和检索。
6. Recognizing hotspots in Brief Eclectic Psychotherapy for PTSD by text and audio mining [O] . Sytske Wiegersma, Mirjam J. Nijdam, Arjan J. van Hessen, 2020

机译：通过文本和音频挖掘识别PTSD的短暂折衷心理治疗中的热点
7. A generic audio classification and segmentation approach for multimedia indexing and retrieval [O] . Kiranyaz, S, Qureshi, AF, Gabbouj, M 2006

机译：用于多媒体索引和检索的通用音频分类和分段方法
8. Visually based Audio Texture Segmentation For Audio Scene Analysis. [R] . GHOZI, R., FRAJ, O. 2009

机译：用于音频场景分析的基于视觉的音频纹理分割。

Nonexclusive audio segmentation and indexing as a pre-processor for audio information mining

摘要

著录项

相似文献

相关主题

期刊订阅