首页> 外文会议>IEEE Workshop on Spoken Language Technology >Background-tracking acoustic features for genre identification of broadcast shows
【24h】

Background-tracking acoustic features for genre identification of broadcast shows

机译:背景跟踪声学特征,用于广播节目​​的体裁识别

获取原文

摘要

This paper presents a novel method for extracting acoustic features that characterise the background environment in audio recordings. These features are based on the output of an alignment that fits multiple parallel background-based Constrained Maximum Likelihood Linear Regression transformations asynchronously to the input audio signal. With this setup, the resulting features can track changes in the audio background like appearance and disappearance of music, applause or laughter, independently of the speakers in the foreground of the audio. The ability to provide this type of acoustic description in audiovisual data has many potential applications, including automatic classification of broadcast archives or improving automatic transcription and subtitling. In this paper, the performance of these features in a genre identification task in a set of 332 BBC shows is explored. The proposed background-tracking features outperform short-term Perceptual Linear Prediction features in this task using Gaussian Mixture Model classifiers (62% vs 72% accuracy). The use of more complex classifiers, Hidden Markov Models and Support Vector Machines, increases the performance of the system with the novel background-tracking features to 79% and 81% in accuracy respectively.
机译:本文提出了一种提取声音特征的新方法,该特征表征了录音中的背景环境。这些功能是基于对齐的输出,该对齐的输出与输入音频信号异步地拟合多个基于并行背景的“受约束的最大似然线性回归”转换。使用此设置,结果功能可以独立于音频前景中的扬声器,跟踪音频背景中的变化,例如音乐的出现和消失,掌声或笑声。在视听数据中提供这种声音描述的能力具有许多潜在的应用,包括广播档案的自动分类或改进自动转录和字幕显示。在本文中,探讨了在332个BBC节目集中的类型识别任务中这些功能的性能。在使用高斯混合模型分类器的任务中,拟议的背景跟踪功能优于短期感知线性预测功能(准确度为62%vs 72%)。使用更复杂的分类器,隐马尔可夫模型和支持向量机,可将具有新颖背景跟踪功能的系统的准确度分别提高到79%和81%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号