首页> 外文会议>Audio Engineering Society Convention >Audio content annotation, description and management using joint audio detection, segmentation and classification techniques
【24h】

Audio content annotation, description and management using joint audio detection, segmentation and classification techniques

机译:音频内容注释,描述和管理使用联合音频检测,分段和分类技术

获取原文

摘要

The current paper focuses on audio content management by means of joint audio segmentation and classification. We concentrate on the separation of typical audio classes, such as silence / background noise, speech, music and their combinations. A compact feature-vector subset is selected by a Correlation feature selection subset evaluation algorithm after the use of EM clustering algorithm on an initial audio data set. Time and spectral parameters are extracted using filter-banks and wavelets in combination with sliding windows and exponential moving averaging techniques. Features are extracted on a point-to-point basis, using the finest possible time resolution, so that each sample can be individually classified to one of the available groups. Clustering algorithms like EM or Simple K-means are tested to evaluate the final point-to-point classification result, therefore the joint audio detection-classification indexes. The extracted audio detection, segmentation and classification results can be incorporated into appropriate description schemes that would annotate audio events / segments for content description and management purposes.
机译:本文通过联合音频分割和分类侧重于音频内容管理。我们专注于分离典型的音频课程,例如沉默/背景噪音,语音,音乐及其组合。在初始音频数据集上使用EM聚类算法之后,通过相关特征选择子集评估算法选择紧凑的特征 - 向量子集。使用滤波器组和小波与滑动窗口和指数移动平均技术结合提取时间和光谱参数。使用最佳可能的时间分辨率,以点对点提取功能,使每个样本可以单独分类为一个可用组。测试群集算法或简单的K-means被测试以评估最终的点对点分类结果,因此是关节音频检测分类索引。提取的音频检测,分段和分类结果可以合并到适当的描述方案中,这些方案将向内容描述和管理目的进行注释音频事件/段。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号