Few-Shot Continual Learning for Audio Classification

机译：用于音频分类的几秒钟不断学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Supervised learning for audio classification typically imposes a fixed class vocabulary, which can be limiting for real-world applications where the target class vocabulary is not known a priori or changes dynamically. In this work, we introduce a few-shot continual learning framework for audio classification, where we can continuously expand a trained base classifier to recognize novel classes based on only few labeled data at inference time. This enables fast and interactive model updates by end-users with minimal human effort. To do so, we leverage the dynamic few-shot learning technique and adapt it to a challenging multi-label audio classification scenario. We incorporate a recent state-of-the-art audio feature extraction model as a backbone and perform a comparative analysis of our approach on two popular audio datasets (ESC-50 and AudioSet). We conduct an in-depth evaluation to illustrate the complexities of the problem and show that, while there is still room for improvement, our method outperforms three baselines on novel class detection while maintaining its performance on base classes.

机译：用于音频分类的监督学习通常强制固定类词汇，这可以限制目标类词汇的实际应用，其中目标类词汇表未知先验或动态变化。在这项工作中，我们为音频分类介绍了几次连续学习框架，在那里我们可以连续扩展训练的基本分类器，以基于推理时间仅少量标记的数据识别新颖的类。这使得最终用户可以快速和交互式的模型更新，以最小的人力努力。为此，我们利用动态的少量射击学习技术，并使其适应挑战的多标签音频分类方案。我们将最近的最先进的音频特征提取模型纳入骨干，并对我们的方法进行比较分析，在两个流行的音频数据集（ESC-50和Audioset）上。我们进行了深入的评估，以说明问题的复杂性并表明，虽然仍有改进的空间，但我们的方法优于三种基线上的三个基线，同时保持其在基础类上的性能。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2021年|321-325|共5页
会议地点
作者
Yu Wang; Nicholas J. Bryan; Mark Cartwright; Juan Pablo Bello; Justin Salamon;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Vocabulary; Adaptation models; Limiting; Error analysis; Supervised learning; Feature extraction;

机译：培训;词汇;适应模型;限制;误差分析;监督学习;特征提取;

相似文献

外文文献
中文文献
专利

1. Learning transferable features in meta-learning for few-shot text classification [J] . Xu Jincheng, Du Qingfeng Pattern recognition letters . 2020,第Jula期

机译：学习Meta-Learning中的可转让功能，用于几次文本分类
2. Few-Shot Learning for Domain-Specific Fine-Grained Image Classification [J] . Xin Sun, Hongwei Xv, Junyu Dong, Industrial Electronics, IEEE Transactions on . 2021,第4期

机译：对于域特定的细粒度图像分类，几秒钟学习
3. Sensorineural hearing loss classification via deep-HLNet and few-shot learning [J] . Xi Chen, Qinghua Zhou, Rushi Lan, Multimedia Tools and Applications . 2021,第2期

机译：通过深层HLNET和少量学习进行传感器助听损失分类
4. Protoda: Efficient Transfer Learning for Few-Shot Intent Classification [C] . Manoj Kumar, Varun Kumar, Hadrien Glaude, Spoken Language Technology Workshop . 2021

机译：PROTODA：几次射门意图分类的高效转移学习
5. Open Set Classification for Deep Learning in Large-Scale and Continual Learning Models [D] . Roady, Ryne. 2020

机译：在大规模和持续学习模型中开放集分类
6. Few-shot learning for classification of novel macromolecular structures in cryo-electron tomograms [O] . Ran Li, Liangyong Yu, Bo Zhou, 2020

机译：多次射击学习用于Cryo-Electron Tomograms中的新型大分子结构的分类
7. Complementing Representation Deficiency in Few-shot Image Classification: A Meta-Learning Approach [O] . Xian Zhong, Cheng Gu, Wenxin Huang, 2021

机译：补充少量图像分类的代表性缺陷：元学习方法

Few-Shot Continual Learning for Audio Classification

摘要

著录项

相似文献

相关主题

期刊订阅