首页> 外文期刊>Sensors Journal, IEEE >Versatile Recognition Using Haar-Like Feature and Cascaded Classifier
【24h】

Versatile Recognition Using Haar-Like Feature and Cascaded Classifier

机译:使用Haar-Like特征和级联分类器的多功能识别

获取原文
获取原文并翻译 | 示例
           

摘要

This paper describes a world first versatile recognition algorithm suitable for processing images, sound and acceleration signals simultaneously with extremely low calculation cost while maintaining high recognition rates. There are three main contributions. The first is the introduction of a versatile recognition using Haar-like feature for images, sound and acceleration signals. The novel 1-D Haar-like features are proposed as very rough band pass filters for signals in temporal dimension. The second is a content-aware classifier which is based on the cascaded classifier and positive estimation. The cascaded classifier with positive estimation is introduced to allow a sensor node to computes finely only when the inputs are target-like and difficult to recognize, and stop computing when inputs obtain enough confidence. The third is a method of intermediate signal representation called Integral Signals and $Delta$-Integral Signals for calculation cost reduction in Haar-like feature based recognition. In this paper, the proposed recognition is experimented for a variety of sound recognition applications such as speechon-speech, gender, speaker, emotion, and environmental sounds recognition. The preliminary results on human activity recognition and face detection are also given to show the versatility. The proposed algorithm yields sound recognition performance comparable to the conventional state-of-art method called MFCC while 96%–99% efficient in terms of the total amount of add and multiply operations. The proposed algorithm is evaluated with a versatile recognition processor implemented in 90-nm CMOS technology . For speechonspeech classification on 8-kHz 8-bit sound, the power consumption per frame rate is 0.28 $mu{rm W/fps}$. When the sensor is operated with a d-nuty ratio of 1%, the power consumption is reduced to 28.5 $mu{rm W}$.
机译:本文介绍了世界上首个通用识别算法,该算法适用于同时处理图像,声音和加速度信号,并且计算成本极低,同时又保持较高的识别率。主要有三点贡献。首先是采用类似于Haar的功能对图像,声音和加速度信号进行通用识别的介绍。提出了新颖的一维类似Haar的特征,作为在时间维度上非常粗糙的带通滤波器。第二个是基于级联分类器和肯定估计的内容感知分类器。引入具有正估计的级联分类器,以允许传感器节点仅在输入类似于目标且难以识别时才进行精细计算,并在输入获得足够的置信度时停止计算。第三种是中间信号表示方法,称为积分信号和$ Delta $-积分信号,用于在基于Haar的特征识别中计算成本降低。在本文中,对提议的识别进行了各种语音识别应用的实验,例如语音/非语音,性别,说话者,情感和环境声音识别。人体活动识别和面部检测的初步结果也表明了其多功能性。提出的算法产生的声音识别性能可与传统的最新技术MFCC相提并论,而加法和乘法运算的总效率为96%–99%。利用在90纳米CMOS技术中实现的通用识别处理器对提出的算法进行了评估。对于8 kHz 8位声音的语音/非语音分类,每帧速率的功耗为0.28 $ mu {rm W / fps} $。当传感器以1%的坚果比率运行时,功耗降低到28.5 $ mu {rm W} $。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号