首页> 外文会议>International Congress on Image and Signal Processing >Nonexclusive Audio Segmentation and Indexing as a Pre-processor for Audio Information Mining A universal architecture and feature space selection
【24h】

Nonexclusive Audio Segmentation and Indexing as a Pre-processor for Audio Information Mining A universal architecture and feature space selection

机译:非删除音频分割和索引作为音频信息挖掘通用架构和特征空间选择的预处理器

获取原文

摘要

Much content related information can be extracted from recorded soundtracks, such as those of multimedia files. The soundtracks might be heuristically classified into three categories namely speech, music and ambient or event sounds. Research in the past focused on algorithms to classify audio clips in an exclusive manner. However, soundtracks from media content are often presented as overlapped mixtures of all these three types of sounds. Nonexclusive segmentation and indexing are therefore essential pre-processors for effective audio information mining and metadata generation. This paper emphasizes the importance of nonexclusive indexing and segmentation methods, identifies the challenges and proposes a universal architecture for nonexclusive segmentation and indexing as a pre-processor for audio information mining, metadata extraction and scene analysis. Related feature selection, pattern recognition and signal processing algorithms are presented and testing results discussed.
机译:可以从录制的原声带中提取许多内容相关信息,例如多媒体文件。原声带可能会被启发式分为三个类别,即语音,音乐和环境或事件声音。过去的研究专注于算法以独家方式对音频剪辑进行分类。然而,媒体内容的原声往往呈现为所有这三种类型的声音的重叠混合物。因此,非删除的分割和索引是有效音频信息挖掘和元数据生成的必要预处理。本文强调了非纯粹索引和分段方法的重要性,识别挑战,并提出了一种非激活分割和索引作为音频信息挖掘,元数据提取和场景分析的预处理器的通用架构。提出了相关的特征选择,提出了模式识别和信号处理算法,并讨论了测试结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号