Exploiting Temporal Feature Integration for Generalized Sound Recognition

Stavros Ntalampiras; Ilyas Potamitis; Nikos Fakotakis

首页> 外文期刊>EURASIP journal on advances in signal processing >Exploiting Temporal Feature Integration for Generalized Sound Recognition

【24h】

Exploiting Temporal Feature Integration for Generalized Sound Recognition

机译：利用时间特征集成进行广义声音识别

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents a methodology that incorporates temporal feature integration for automated generalized sound recognition. Such a system can be of great use to scene analysis and understanding based on the acoustic modality. The performance of three feature sets based on Mel filterbank, MPEG-7 audio protocol, and wavelet decomposition is assessed. Furthermore we explore the application of temporal integration using the following three different strategies: (a) short-term statistics, (b) spectral moments, and (c) autoregressive models. The experimental setup is thoroughly explained and based on the concurrent usage of professional sound effects collections. In this way we try to form a representative picture of the characteristics of ten sound classes. During the first phase of our implementation, the process of audio classification is achieved through statistical models (HMMs) while a fusion scheme that exploits the models constructed by various feature sets provided the highest average recognition rate. The proposed system not only uses diverse groups of sound parameters but also employs the advantages of temporal feature integration.

机译：本文提出了一种方法，该方法结合了时间特征集成以实现自动化的广义声音识别。这样的系统对于基于声学模态的场景分析和理解很有用。评估了基于Mel滤波器组，MPEG-7音频协议和小波分解的三个功能集的性能。此外，我们使用以下三种不同的策略探索时间积分的应用：（a）短期统计，（b）谱矩和（c）自回归模型。实验设置已得到全面解释，并基于专业音效合集的同时使用。通过这种方式，我们尝试形成十种声音类别的特征的代表性图片。在我们实施的第一阶段，音频分类的过程是通过统计模型（HMM）实现的，而利用各种功能集构建的模型的融合方案则提供了最高的平均识别率。所提出的系统不仅使用各种声音参数组，而且还利用了时间特征集成的优点。

著录项

来源
《EURASIP journal on advances in signal processing》 |2009年第17期|共12页
作者
Stavros Ntalampiras; Ilyas Potamitis; Nikos Fakotakis;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类通信;
关键词
Feature; Integration; Recognition;

机译：特征;整合;认可;

相似文献

外文文献
中文文献
专利

1. Exploiting Temporal Feature Integration for Generalized Sound Recognition [J] . Stavros Ntalampiras, Ilyas Potamitis, Nikos Fakotakis EURASIP journal on advances in signal processing . 2009,第17期

机译：利用时间特征集成进行广义声音识别
2. Exploiting temporal and nonstationary features in breathing sound analysis for multiple obstructive sleep apnea severity classification [J] . Jaepil Kim, Taehoon Kim, Donmoon Lee, BioMedical Engineering OnLine . 2017,第1期

机译：在呼吸声分析中利用时间和非平稳特征进行多发性阻塞性睡眠呼吸暂停严重程度分类
3. Recognition of Transient Environmental Sounds Based on Temporal and Frequency Features [J] . Shota Okubo, Zhihao Gong, Kento Fujita, International journal of automation technology . 2019,第6期

机译：基于时间和频率特征的瞬态环境声音识别
4. Dual-Feature Bayesian MAP Classification: Exploiting Temporal Information for Video-Based Face Recognition [C] . John See, Chikkannan Eswaran, Mohammad Faizal Ahmad Fauzi International conference on neural information processing . 2012

机译：双重特征贝叶斯MAP分类：利用时间信息进行基于视频的面部识别
5. Exploitation of Phase and Vocal Excitation Modulation Features for Robust Speaker Recognition [D] . Wang, Ning. 2011

机译：利用相位和人声激励调制功能实现可靠的说话人识别
6. Exploiting temporal and nonstationary features in breathing sound analysis for multiple obstructive sleep apnea severity classification [O] . Jaepil Kim, Taehoon Kim, Donmoon Lee, 2017

机译：在呼吸声分析中利用时间和非平稳特征进行多发性阻塞性睡眠呼吸暂停严重程度分类
7. Exploiting Temporal Feature Integration for Generalized Sound Recognition [O] . 2009

机译：利用时间特征集成进行广义声音识别

Exploiting Temporal Feature Integration for Generalized Sound Recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅