首页> 外文会议>Asian Conference on Computer Vision >A Comparative Study of Encoding, Pooling and Normalization Methods for Action Recognition

【24h】

A Comparative Study of Encoding, Pooling and Normalization Methods for Action Recognition

机译：编码、池和规范化方法在动作识别中的比较研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Bag of visual words (BoVW) models have been widely and successfully used in video based action recognition. One key step in constructing BoVW representation is to encode feature with a codebook. Recently, a number of new encoding methods have been developed to improve the performance of BoVW based object recognition and scene classification, such as soft assignment encoding [1], sparse encoding [2], locality-constrained linear encoding [3] and Fisher kernel encoding [4]. However, their effects for action recognition are still unknown. The main objective of this paper is to evaluate and compare these new encoding methods in the context of video based action recognition. We also analyze and evaluate the combination of encoding methods with different pooling and normalization strategies. We carry out experiments on KTH dataset [5] and HMDB51 dataset [6]. The results show the new encoding methods can significantly improve the recognition accuracy compared with classical VQ. Among them, Fisher kernel encoding and sparse encoding have the best performance. By properly choosing pooling and normalization methods, we achieve the state-of-the-art performance on HMDB51.

机译：视觉单词包（BoVW）模型在基于视频的动作识别中得到了广泛而成功的应用。构建BoVW表示的一个关键步骤是使用码本对特征进行编码。最近，为了提高基于BoVW的目标识别和场景分类的性能，人们开发了一些新的编码方法，如软分配编码[1]、稀疏编码[2]、局部约束线性编码[3]和Fisher核编码[4]。然而，它们对动作识别的影响尚不清楚。本文的主要目的是在基于视频的动作识别中评估和比较这些新的编码方法。我们还分析和评估了编码方法与不同的池和规范化策略的组合。我们在KTH数据集[5]和HMDB51数据集[6]上进行了实验。结果表明，与经典矢量量化方法相比，新的编码方法可以显著提高识别精度。其中，Fisher核编码和稀疏编码的性能最好。通过正确选择池和规范化方法，我们在HMDB51上实现了最先进的性能。

著录项

来源
《Asian Conference on Computer Vision》|2013年||共14页
会议地点
作者
Xingxing Wang; LiMin Wang; Yu Qiao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词

相似文献

外文文献
中文文献
专利

1. Comparative Study on 4 EIA Kits for Screening Antibody to Hepatitis C Virus in Pooled Sera [J] . 生物医学与环境科学（英文版） . 1998,第001期
2. Illumination compensation and normalization in eigenspace-based face recognition: A comparative study of different pre-processing approaches [J] . Javier Ruiz-del-Solar, Julio Quinteros Pattern recognition letters . 2008,第14期

机译：基于特征空间的人脸识别中的光照补偿和归一化：不同预处理方法的比较研究
3. Human action recognition based on multi-layer Fisher vector encoding method [J] . Sekma Manel, Mejdoub Mahmoud, Ben Amer Chokri Pattern recognition letters . 2015,第NOVa1期

机译：基于多层Fisher矢量编码方法的人体动作识别
4. A Comparative Study of 2D PCA Face Recognition Method with Other Statistically Based Face Recognition Methods [J] . R Senthilkumar, R K Gnanamurthy Journal of The Institution of Engineers (India): Series B . 2016,第3期

机译：二维PCA人脸识别方法与其他基于统计的人脸识别方法的比较研究
5. A Comparative Study of Encoding, Pooling and Normalization Methods for Action Recognition [C] . Xingxing Wang, LiMin Wang, Yu Qiao Asian conference on computer vision . 2013

机译：动作识别的编码，合并和规范化方法的比较研究
6. Feature Extraction and Encoding for Video Action Recognition [D] . Zuo, Zheming. 2020

机译：视频动作识别特征提取和编码
7. Investigation of the mechanism of action of atmospheric pollutants on the central nervous system and comparative evaluation of methods of study. [O] . A I Bokina, N D Eksler, A D Semenenko, 1976

机译：大气污染物对中枢神经系统作用机理的研究及研究方法的比较评价。
8. Feature Encodings and Poolings for Action and Event Recognition: A Comprehensive Survey [O] . Changyu Liu, Qian Zhang, Bin Lu, 2017

机译：行动和事件识别的特征编码和汇集：综合调查
9. Multiperspective Recognition Applied to the Computer-Aided Medical Diagnosis - A Comparative Study of Methods. [R] . Kurzynski, M. W., Puchala, E. 2001

机译：多目标识别在计算机辅助医学诊断中的应用 - 方法比较研究。

A Comparative Study of Encoding, Pooling and Normalization Methods for Action Recognition

摘要

著录项

相似文献

相关主题

期刊订阅