首页> 外文会议>International Conference on Systems, Signals and Image Processing >Feature contours fusion for determining segment boundaries in audio data
【24h】

Feature contours fusion for determining segment boundaries in audio data

机译:特征轮廓融合确定音频数据中的片段边界

获取原文

摘要

In this paper, an approach to audio segmentation based on fusion of feature trajectories is presented. The proposed scheme uses the variability of envelopes calculated from feature contours to determine change-points in audio stream. The contours are calculated by comparing adjacent frames utilizing distance or divergence functions. Such technique with the selected feature type can emphasize the change-points structure in the data. From calculated trajectory a Hilbert envelope is computed and the peaks and valleys are detected. As obtained results show, the positions of peaks and valleys are close to the actual position of segment boundaries. However, such situation leads to many miss and false alarm errors. Therefore, in our approach we have used a simple fusion based on averaged position with defined tolerance of several feature contours. We have performed an analysis of many features and functions to determine pairs with the high discriminatory power. In the result, for prepared audio stream we determined features and functions to obtain high accuracy of segmentation. The results show that utilizing our technique on audio with several audio classes can improve final detection accuracy.
机译:本文提出了一种基于特征轨迹融合的音频分割方法。所提出的方案使用从特征轮廓计算出的包络的可变性来确定音频流中的变化点。通过使用距离或散度函数比较相邻帧来计算轮廓。具有选定特征类型的此类技术可以强调数据中的变更点结构。根据计算的轨迹,计算希尔伯特包络,并检测峰和谷。如结果所示,峰和谷的位置接近段边界的实际位置。但是,这种情况会导致许多未命中和误报错误。因此,在我们的方法中,我们使用了基于平均位置的简单融合,并定义了多个特征轮廓的公差。我们已经对许多特征和功能进行了分析,以确定具有高区分能力的对。结果,对于准备好的音频流,我们确定了特征和功能以获得高精度的分割。结果表明,将我们的技术应用于具有多个音频类别的音频可以提高最终检测精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号