首页> 外文会议>2016 International Conference on Next Generation Intelligent Systems >Fusion of audio visual cues for vehicle classification
【24h】

Fusion of audio visual cues for vehicle classification

机译:融合视听提示进行车辆分类

获取原文
获取原文并翻译 | 示例

摘要

Road planning and traffic monitoring is conducted based on survey of traffic volume. In recent years many of researchers have developed vision and audio based techniques for detection and classification of moving vehicles. Audio based technique suffers from low accuracy but has low computational cost. Then there is visual based approach which has significantly higher accuracy but demands high computational resources. This paper proposes a new approach which utilizes both audio and video of traffic data to perform traffic volume survey. Vehicle detection can be done from audio signal. Video frames around audio peaks are selectively extracted. Then visual feature vectors are extracted from the binary image of the vehicle. Audio features represented using Mel Frequency Cepstral Coefficients (MFCC) are extracted from the regions around the vehicle peak. Classification is done using multilayer feed forward neural network which gave an overall classification accuracy of 92.67% for seven vehicle classes with the chosen set of audio visual features.
机译:道路规划和交通监控基于对交通量的调查进行。近年来,许多研究人员已经开发了基于视觉和音频的技术来检测和分类行驶中的车辆。基于音频的技术具有较低的准确性,但是具有较低的计算成本。然后是基于视觉的方法,该方法具有更高的准确性,但需要大量的计算资源。本文提出了一种利用交通数据的音频和视频进行交通量调查的新方法。可以从音频信号进行车辆检测。音频峰值周围的视频帧被有选择地提取。然后从车辆的二值图像中提取视觉特征向量。从车辆峰值周围的区域中提取使用梅尔频率倒谱系数(MFCC)表示的音频特征。使用多层前馈神经网络进行分类,该网络对具有所选视听功能的七种车辆类别的总体分类精度为92.67%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号