...
首页> 外文期刊>Applied Acoustics >Detection and classification of human-produced nonverbal audio events
【24h】

Detection and classification of human-produced nonverbal audio events

机译:人生成的非语言音频事件的检测和分类

获取原文
获取原文并翻译 | 示例
           

摘要

Audio wearable devices, or hearables, are becoming an increasingly popular consumer product. Some of these hearables contain an in-ear microphone to capture audio signals inside the user's occluded earcanal. Mainly, the microphone is used to pick up speech in noisy environments, but it can also capture other signals, such as nonverbal events that could be used to interact with the device or a computer. Teeth or tongue clicking could be used to interact with a device in a discreet manner, and coughing or throat-clearing sounds could be used to monitor the health of a user. In this paper, 10 human produced nonverbal audio events are detected and classified in real-time with a classifier using the Bag-of-Audio-Words algorithm. To build this algorithm, different clustering and classification methods are compared. Mel-Frequency Cepstral Coefficient features are used alongside Auditory-inspired Amplitude Modulation features and Per-Channel Energy Normalization features. To combine the different features, concatenation performance at the input level and at the histogram level is compared. The real-time detector is built using the detection by classification technique, classifying on a 400 ms window with 75% overlap. The detector is tested in a controlled noisy environment on 10 subjects. The classifier had a sensitivity of 81.5% while the detector using the same classifier had a sensitivity of 69.9% in a quiet environment. (C) 2020 Elsevier Ltd. All rights reserved.
机译:音频可穿戴设备或听力,正在成为消费产品越来越受欢迎的消费产品。其中一些可听见的是包含一个耳内麦克风,用于捕获用户封闭的耳机内的音频信号。主要是,麦克风用于在嘈杂的环境中接收语音,但它也可以捕获其他信号,例如可用于与设备或计算机交互的非言语事件。牙齿或舌头可以用来以谨慎的方式与设备交互,并且可以使用咳嗽或咽喉清除声音来监测用户的健康。在本文中,使用禁止音频字算法实时检测和分类10个人产生的非语言音频事件。为了构建该算法,比较了不同的聚类和分类方法。熔融频率谱系距特征与听觉启发的幅度调制特征和每个通道的能量归一化功能一起使用。要将不同的特征组合,比较输入级别和直方图级别的串联性能。使用分类技术检测建立实时检测器,在400毫秒的窗口上进行分类,75%重叠。检测器在10个受试者的受控嘈杂环境中进行测试。分类器的敏感性为81.5%,而使用相同分类器的探测器在安静的环境中的敏感度为69.9%。 (c)2020 elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号