首页> 外文会议>International Congress on Image and Signal Processing >Nonexclusive audio segmentation and indexing as a pre-processor for audio information mining
【24h】

Nonexclusive audio segmentation and indexing as a pre-processor for audio information mining

机译:非排他音频分割和索引作为音频信息挖掘的预处理器

获取原文

摘要

Much content related information can be extracted from recorded soundtracks, such as those of multimedia files. The soundtracks might be heuristically classified into three categories namely speech, music and ambient or event sounds. Research in the past focused on algorithms to classify audio clips in an exclusive manner. However, soundtracks from media content are often presented as overlapped mixtures of all these three types of sounds. Nonexclusive segmentation and indexing are therefore essential pre-processors for effective audio information mining and metadata generation. This paper emphasizes the importance of nonexclusive indexing and segmentation methods, identifies the challenges and proposes a universal architecture for nonexclusive segmentation and indexing as a pre-processor for audio information mining, metadata extraction and scene analysis. Related feature selection, pattern recognition and signal processing algorithms are presented and testing results discussed.
机译:可以从录制的音轨(例如多媒体文件的音轨)中提取很多与内容相关的信息。可以将试听音乐启发式地分为三类,即语音,音乐和环境或事件声音。过去的研究集中于以专有方式对音频片段进行分类的算法。但是,来自媒体内容的音轨通常被呈现为所有这三种类型的声音的重叠混合。因此,非排他的分段和索引编制对于有效的音频信息挖掘和元数据生成来说是必不可少的预处理器。本文强调了非排他性索引和分段方法的重要性,指出了挑战,并提出了一种用于非排他性分段和索引的通用体系结构作为音频信息挖掘,元数据提取和场景分析的预处理器。提出了相关的特征选择,模式识别和信号处理算法,并讨论了测试结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号