首页> 外文会议>International Conference on Signals and Electronic Systems >An auditory-based scene change detection in audio data
【24h】

An auditory-based scene change detection in audio data

机译:音频数据中基于听觉的场景变化检测

获取原文

摘要

The problem of auditory scene segmentation plays important role in many audio analysis and processing tasks. The accuracy and robustness of segmentation step have influence on the remaining stages in audio processing chain. In the work a dedicated system for segmentation of the audio stream is presented. The segmentation scheme uses Delta-BIC and metric-based techniques to determine the change-point in audio data. For this purpose a dedicated auditory feature has been proposed, which is based on the gammatone filter bank. The proposed feature (GTEAD) has been designed using inter-channel analysis of the auditory filter bank outputs. For each channel, the temporal envelope and its periodic self-similarities have been calculated. Then, the distances between obtained signals from the neighbouring channels have been computed resulting in the final feature vector. The performance of the GTEAD feature has been compared to the popular MFCC feature using database of audio streams with defined single change-point in each example. The obtained results show that GTEAD feature outperforms MFCC feature in terms of accuracy and the number of detected points.
机译:听觉场景分割问题在许多音频分析和处理任务中起着重要作用。分割步骤的准确性和稳健性对音频处理链中的剩余阶段有影响。在工作中,呈现了用于分段的专用系统。分割方案使用三角形和基于度量的技术来确定音频数据中的变化点。为此目的,已经提出了专用的听觉特征,其基于伽马托滤波器。所提出的特征(GTEAD)已经使用了听觉滤波器组输出的间间分析设计。对于每个频道,已经计算了时间包络及其周期性自相同度。然后,已经计算了来自相邻信道的所获得的信号之间的距离导致最终特征向量。使用每个示例中使用具有定义单个变化点的音频流数据库的流行MFCC功能进行了与流行的MFCC功能的性能。所得结果表明,GTEAD特征在准确性和检测点的数量方面优于MFCC特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号