...
首页> 外文期刊>IEEE transactions on multimedia >Dynamic Texture Classification Using Unsupervised 3D Filter Learning and Local Binary Encoding
【24h】

Dynamic Texture Classification Using Unsupervised 3D Filter Learning and Local Binary Encoding

机译:使用无监督3D滤波器学习和局部二进制编码的动态纹理分类

获取原文
获取原文并翻译 | 示例

摘要

Local binary descriptors, such as local binary pattern (LBP) and its various variants, have been studied extensively in texture and dynamic texture analysis due to their outstanding characteristics, such as grayscale invariance, low computational complexity and good discriminability. Most existing local binary feature extraction methods extract spatio-temporal features from three orthogonal planes of a spatio-temporal volume by viewing a dynamic texture in 3D space. For a given pixel in a video, only a proportion of its surrounding pixels is incorporated in the local binary feature extraction process. We argue that the ignored pixels contain discriminative information that should be explored. To fully utilize the information conveyed by all the pixels in a local neighborhood, we propose extracting local binary features from the spatio-temporal domain with 3D filters that are learned in an unsupervised manner so that the discriminative features along both the spatial and temporal dimensions are captured simultaneously. The proposed approach consists of three components: 1) 3D filtering; 2) binary hashing; and 3) joint histogramming. Densely sampled 3D blocks of a dynamic texture are first normalized to have zero mean and are then filtered by 3D filters that are learned in advance. To preserve more of the structure information, the filter response vectors are decomposed into two complementary components, namely, the signs and the magnitudes, which are further encoded separately into binary codes. The local mean pixels of the 3D blocks are also converted into binary codes. Finally, three types of binary codes are combined via joint or hybrid histograms for the final feature representation. Extensive experiments are conducted on three commonly used dynamic texture databases: 1) UCLA; 2) DynTex; and 3) YUVL. The proposed method provides comparable results to, and even outperforms, many state-of-the-art methods.
机译:局部二进制描述符,例如局部二进制模式(LBP)及其各种变体,由于其出色的特性(例如灰度不变性,低计算复杂度和良好的可分辨性)而在纹理和动态纹理分析中得到了广泛的研究。大多数现有的局部二值特征提取方法都是通过在3D空间中查看动态纹理,从时空体积的三个正交平面中提取时空特征。对于视频中的给定像素,在本地二进制特征提取过程中仅合并了一部分周围像素。我们认为被忽略的像素包含应探索的区分性信息。为了充分利用局部邻域中所有像素传达的信息,我们建议使用以无监督方式学习的3D滤波器从时空域中提取局部二元特征,以便沿空间和时间维度识别特征同时捕获。所提出的方法包括三个部分:1)3D过滤; 2)二进制哈希; 3)联合直方图。首先将动态纹理的密集采样3D块标准化为零均值,然后通过预先学习的3D滤波器进行滤波。为了保留更多的结构信息,将滤波器响应向量分解为两个互补分量,即符号和幅度,然后分别将其编码为二进制代码。 3D块的局部平均像素也被转换成二进制代码。最后,通过联合或混合直方图将三种类型的二进制代码组合起来,以表示最终的特征。在三个常用的动态纹理数据库上进行了广泛的实验:1)UCLA; 2)DynTex;和3)YUVL。所提出的方法可提供与许多最新方法相当的结果,甚至优于其他许多最新方法。

著录项

  • 来源
    《IEEE transactions on multimedia 》 |2019年第7期| 1694-1708| 共15页
  • 作者单位

    Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China|Univ Oulu, Ctr Machine Vis & Signal Anal, FI-90014 Oulu, Finland;

    Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China;

    Univ Oulu, Ctr Machine Vis & Signal Anal, FI-90014 Oulu, Finland|Natl Univ Def Technol, Coll Syst Engn, Changsha 410073, Hunan, Peoples R China;

    Univ Oulu, Ctr Machine Vis & Signal Anal, FI-90014 Oulu, Finland;

    Southeast Univ, Sch Biol Sci & Med Engn, Minist Educ, Key Lab Child Dev & Learning Sci, Nanjing 210096, Jiangsu, Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Dynamic texture; motion; feature extraction; local binary pattern;

    机译:动态纹理;运动;特征提取;局部二值模式;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号