首页> 外文会议>ELMAR 2012 Proceedings. >Performance comparison of several techniques to detect keywords in audio streams and audio scene

【24h】

Performance comparison of several techniques to detect keywords in audio streams and audio scene

机译：几种检测音频流和音频场景中关键字的技术的性能比较

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper is focused on the task of detecting words of interest in an audio scene (a room, a lab or a workshop) or in a continually recorded stream of speech, music and other sounds. The solution of this task is important in many applications, e.g. for command control in houses for handicapped persons, for automating some manufacturing and logistical operations, or for information retrieval from large audio archives. We investigate the use of three keyword spotting techniques and compare them with a classic large vocabulary sp eech reco gnit ion sy st em. To evaluat e t heir performance, we specified and studied two model applications: 1) search in large audio broadcast archive; 2) voice control of an interactive system. The investigated techniques were evaluated from several points of view, namely their speed (real-time factor), accuracy (equal error rate, figure of merit, receiver op erating characteristics), the demands for training data and the impact of different types of noise.

机译：本文的重点是在音频场景（房间，实验室或车间）或连续记录的语音，音乐和其他声音流中检测感兴趣的单词的任务。此任务的解决方案在许多应用中都很重要，例如用于残疾人的房屋中的命令控制，一些制造和物流操作的自动化，或从大型音频档案中检索信息。我们研究了三种关键字发现技术的使用，并将它们与经典的大词汇表语音识别系统进行比较。为了评估继承人的表现，我们指定并研究了两个模型应用程序：1）在大型音频广播档案中搜索； 2）交互式系统的语音控制。从多个角度对研究的技术进行了评估，即它们的速度（实时因子），准确性（相等错误率，品质因数，接收机工作特性），对训练数据的需求以及不同类型噪声的影响。

著录项

来源
《ELMAR 2012 Proceedings. 》|2012年|p.215- 218|共4页
会议地点 Zadar(HR);Zadar(HR)
作者
Bohac Marek;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类水路运输技术管理 ; 水路运输技术管理 ;
关键词

相似文献

外文文献
中文文献
专利

1. Using roadside surveys to detect short-eared owls: A comparison of visual and audio techniques [J] . Matt D. Larson, Denver W. Holt Wildlife Society Bulletin . 2016 ,第2期

机译：使用路边调查来检测短耳猫头鹰：视觉和音频技术的比较
2. Discussion on ????????Improvements in the precision measurement of capacitance????????, ????????The design of an audio-frequency amplifier for high-precision voltage measurement????????, ????????The design and performance of high-precision audio-frequency current transformers???????? and ????????Techniques for the calibration of standard current transformers up to 20 kc/s???????? before the Measurement and Control Section, 10th January, 1961 [J] . Proceedings of the IEE - Part B: Electronic and Communication Engineering . 1961 ,第39期

机译：关于电容精度测量的改进的讨论高精度测量电压的音频放大器的设计高精度音频电流互感器的设计与性能以及用于标定高达20 kc / s的标准电流互感器的技术1961年1月10日，在测量与控制科之前
3. The authors' replies to the discussion on ????????Improvements in the precision measurement of capacitance????????, ????????The design of an audio-frequency amplifier for high-precision voltage measurement????????, ????????The design and performance of high-precision audio-frequency current transformers???????? and ????????Techniques for the calibration of standard current transformers up to 20 kc/s???????? [J] . Rayner G.H., Ford L.H., Harkness S., Proceedings of the IEE - Part B: Electronic and Communication Engineering . 1961 ,第39期

机译：作者对?????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????--------高精度电压测量高精度音频电流互感器的设计与性能以及用于标定高达20 kc / s的标准电流互感器的技术
4. Performance comparison of several techniques to detect keywords in audio streams and audio scene [C] . Bohac Marek Croatian Society Electronics in Marine international symposium . 2012

机译：若干技术在音频流和音频场景中检测关键字的几种技术的性能比较
5. Comparison of classification techniques for speech/audio applications. [D] . Shao, Ying. 2003

机译：语音/音频应用分类技术的比较。
6. Comparison of Two Brushing Methods- Fone’s vs Modified Bass Method in Visually Impaired Children Using the Audio Tactile Performance (ATP) Technique [O] . Chrishantha Joybell, Ramesh Krishnan, Suresh Kumar V 2015

机译：使用音频触觉表现（ATP）技术对视力障碍儿童的Fone和改良Bass两种刷牙方法进行比较
7. Prediction of hearing thresholds: Comparison of cortical evoked response audiometry and auditory steady state response audiometry techniques [O] . Yeung KNK, Wong LLN 2007

机译：听力阈值的预测：皮质诱发反应测听和听觉稳态反应测听技术的比较
8. Rich System Combination For Keyword Spotting In Noisy and Acoustically Heterogeneous Audio Streams. [R] . Akbacak, M., Burget, L., Wang, W., 2013

机译：用于噪声和声学异构音频流中关键字定位的丰富系统组合。

Performance comparison of several techniques to detect keywords in audio streams and audio scene

摘要

著录项

相似文献

相关主题

期刊订阅