【24h】

Automatic Analysis of Speech and Acoustic Events for Ambient Assisted Living

机译:自动分析环境辅助生活中的语音和声音事件

获取原文

摘要

We present a prototype of an ambient assisted living (AAL) with multimodal user interaction. In our research, the AAL environment is one studio room of 60 + square meters that has several tables, chairs and a sink, as well as equipped with four stationary microphones and two omni-directional video cameras. In this paper, we focus mainly on audio signal processing techniques for monitoring the assistive smart space and recognition of speech and non-speech acoustic events for automatic analysis of human's activities and detection of possible emergency situations with the user (when an emergent help is needed). Acoustical modeling in our audio recognition system is based on single order Hidden Markov Models with Gaussian Mixture Models. The recognition vocabulary includes 12 non-speech acoustic events for different types of human activities plus 5 useful spoken commands (keywords), including a subset of alarm audio events. We have collected an audio-visual corpus containing about 1.3 h of audio data from 5 testers, who performed proposed test scenarios, and made the practical experiments with the system, results of which are reported in this paper.
机译:我们提出了具有多模式用户交互功能的环境辅助生活(AAL)的原型。在我们的研究中,AAL环境是一个60平方米以上的工作室,其中有几张桌子,椅子和一个水槽,还配备了四个固定麦克风和两个全向摄像机。在本文中,我们主要侧重于音频信号处理技术,用于监视辅助智能空间以及识别语音和非语音声学事件,从而自动分析人类活动并与用户一起检测可能的紧急情况(需要紧急帮助时) )。我们的音频识别系统中的声学建模基于具有高斯混合模型的单阶隐马尔可夫模型。识别词汇表包括针对不同类型的人类活动的12种非语音声学事件,以及5种有用的语音命令(关键字),包括警报音频事件的子集。我们从5个测试人员那里收集了一个包含约1.3 h音频数据的视听语料库,这些测试人员执行了拟议的测试方案,并对该系统进行了实际实验,其结果在本文中进行了报道。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号