Leveraging User Input and Feedback for Interactive Sound Event Detection and Annotation

机译：利用用户输入和反馈进行交互式声音事件检测和注释

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Tagging of environment audio events is essential in many areas. However, finding sound events and labeling them within a long audio file is tedious and time-consuming. Building an automatic recognition system using modern machine learning is often not feasible because it requires a large number of human-labeled training examples and it is not reliable enough for all uses. I propose interactive sound event detection to solve the issue by combining machine search with human tagging, specifically focusing on the effectiveness of various types of user-inputs to the interactive sound searching. The types of user inputs that I will explore include binary relevance feedback, segmentation, and vocal imitation. I expect that leveraging one or combination of these user inputs would help users find audio contents of interest quickly and accurately, even in the situation where there are not enough training examples for a typical automated system.

机译：环境音频事件的标记对于许多领域至关重要。但是，在长音频文件中查找声音事件并将其标记为繁琐且耗时。使用现代机器学习构建自动识别系统通常是不可行的，因为它需要大量的人类标记的训练示例，并且对于所有用途来说是不可靠的。我提出了交互式声音事件检测来解决这些问题，通过将机器搜索与人类标记组合，专门关注各种类型用户输入的有效性来交互式声音搜索。我将浏览的用户输入类型包括二进制相关反馈，分段和声乐模仿。我希望利用这些用户输入的一个或组合可以帮助用户快速准确地找到感兴趣的音频内容，即使在没有足够的典型自动化系统的训练示例的情况下。

著录项

来源
《International Conference on Intelligent User Interfaces》|2018年|686p|共2页
会议地点
作者
Bongjun Kim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Interactive machine learning; Sound event detection; Human-in-the-loop system;

机译：互动机器学习;声音事件检测;人载系统;

相似文献

外文文献
中文文献
专利

1. User input in iterative design for prevention product development: leveraging interdisciplinary methods to optimize effectiveness [J] . Guthrie Kate M., Rosen Rochelle K., Vargas Sara E., Drug delivery and translational research . 2017,第5期

机译：预防产品开发的迭代设计中的用户输入：利用跨学科方法优化效率
2. A Human-in-the-Loop System for Sound Event Detection and Annotation [J] . BONGJUN KIM, BRYAN PARDO ACM Transactions on Interactive Intelligent Systems . 2018,第2期

机译：声音事件检测和注释的人在环系统
3. Adaptive decision feedback detection with parallel interference cancellation and constellation constraints for multiuser multi-input-multi-output systems [J] . Li P., de Lamare R.C., Liu J. Communications, IET . 2013,第6期

机译：具有多用户多输入多输出系统的并行干扰消除和星座约束的自适应决策反馈检测
4. Leveraging User Input and Feedback for Interactive Sound Event Detection and Annotation [C] . Bongjun Kim International Conference on Intelligent User Interfaces . 2018

机译：利用用户输入和反馈进行交互式声音事件检测和注释
5. Sound Event Annotation and Detection with Less Human Effort [D] . Kim, Bongjun. 2020

机译：声音事件注释和检测较少人力努力
6. User input in iterative design for prevention product development: Leveraging interdisciplinary methods to optimize effectiveness [O] . Kate M. Guthrie, Rochelle K. Rosen, Sara E. Vargas, -1

机译：用于预防产品开发的迭代设计中的用户输入：利用跨学科方法来优化有效性
7. Interactive Sound Texture Synthesis through Semi-Automatic User Annotations [O] . Schwarz, Diemo, Caramiaux, Baptiste 2014

机译：通过半自动用户注释进行交互式声音纹理合成

Leveraging User Input and Feedback for Interactive Sound Event Detection and Annotation

摘要

著录项

相似文献

相关主题

期刊订阅