Background-tracking acoustic features for genre identification of broadcast shows

机译：背景跟踪声学特征，用于广播节目的体裁识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a novel method for extracting acoustic features that characterise the background environment in audio recordings. These features are based on the output of an alignment that fits multiple parallel background-based Constrained Maximum Likelihood Linear Regression transformations asynchronously to the input audio signal. With this setup, the resulting features can track changes in the audio background like appearance and disappearance of music, applause or laughter, independently of the speakers in the foreground of the audio. The ability to provide this type of acoustic description in audiovisual data has many potential applications, including automatic classification of broadcast archives or improving automatic transcription and subtitling. In this paper, the performance of these features in a genre identification task in a set of 332 BBC shows is explored. The proposed background-tracking features outperform short-term Perceptual Linear Prediction features in this task using Gaussian Mixture Model classifiers (62% vs 72% accuracy). The use of more complex classifiers, Hidden Markov Models and Support Vector Machines, increases the performance of the system with the novel background-tracking features to 79% and 81% in accuracy respectively.

机译：本文提出了一种提取声音特征的新方法，该特征表征了录音中的背景环境。这些功能是基于对齐的输出，该对齐的输出与输入音频信号异步地拟合多个基于并行背景的“受约束的最大似然线性回归”转换。使用此设置，结果功能可以独立于音频前景中的扬声器，跟踪音频背景中的变化，例如音乐的出现和消失，掌声或笑声。在视听数据中提供这种声音描述的能力具有许多潜在的应用，包括广播档案的自动分类或改进自动转录和字幕显示。在本文中，探讨了在332个BBC节目集中的类型识别任务中这些功能的性能。在使用高斯混合模型分类器的任务中，拟议的背景跟踪功能优于短期感知线性预测功能（准确度为62％vs 72％）。使用更复杂的分类器，隐马尔可夫模型和支持向量机，可将具有新颖背景跟踪功能的系统的准确度分别提高到79％和81％。

著录项

来源
《IEEE Workshop on Spoken Language Technology》|2014年|118-123|共6页
会议地点
作者
Saz Oscar; Doulaty Mortaza; Hain Thomas;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Acoustic background; broadcast data; genre identification;

机译：声学背景;广播数据;体裁;

相似文献

外文文献
中文文献
专利

1. Extraction of acoustic features based on auditory spike code and its application to music genre classification [J] . Shin Seong-Hyeon, Yun Ho-Won, Jang Woo-Jin, Signal Processing, IET . 2019,第2期

机译：基于听觉尖峰编码的声学特征提取及其在音乐流派分类中的应用
2. Acoustic Features Influence Musical Choices Across Multiple Genres [J] . Michael D. Barone, Jotthi Bansal, Matthew H. Woolhouse Frontiers in Psychology . 2017,第4期

机译：声学特征影响多种流派的音乐选择
3. Combining visual and acoustic features for music genre classification [J] . Nanni Loris, Costa Yandre M. G., Lumini Alessandra, Expert Systems with Application . 2016,第MARa期

机译：结合视觉和听觉特征进行音乐流派分类
4. Background-tracking acoustic features for genre identification of broadcast shows [C] . Saz Oscar, Doulaty Mortaza, Hain Thomas IEEE Workshop on Spoken Language Technology . 2014

机译：背景技术广播节目类型识别的声学特征
5. MODELLING AND SIGNAL PROCESSING FOR IDENTIFICATION OF OCEAN SUBSURFACE FEATURES FROM ACOUSTIC REFLECTIONS. [D] . EL-HAWARY, FERIAL. 1981

机译：从声学反射识别海洋表面特征的建模和信号处理。
6. Acoustic Features Influence Musical Choices Across Multiple Genres [O] . Michael D. Barone, Jotthi Bansal, Matthew H. Woolhouse -1

机译：声学特征影响多种流派的音乐选择
7. Background-tracking Acoustic Features for Genre Identification of Broadcast Shows [O] . Saz, Oscar, Doulaty, Mortaza, Hain, Thomas 2015

机译：背景跟踪声学特征的类型识别广播节目

Background-tracking acoustic features for genre identification of broadcast shows

摘要

著录项

相似文献

相关主题

期刊订阅