首页> 外文期刊>Multimedia, IEEE Transactions on >Scalable Mobile Visual Classification by Kernel Preserving Projection Over High-Dimensional Features
【24h】

Scalable Mobile Visual Classification by Kernel Preserving Projection Over High-Dimensional Features

机译:通过保留高维特征上的投影的可扩展移动视觉分类

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Scalable mobile visual classification—classifying images/videos in a large semantic space on mobile devices in real-time—is an emerging problem as observing the paradigm shift towards mobile platforms and the explosive growth of visual data. Though seeing the advances in detecting thousands of concepts in the servers, the scalability is handicapped in mobile devices due to the severe resource constraints within. However, certain emerging applications require such scalable visual classification with prompt response for detecting local contexts (e.g., Google Glass) or ensuring user satisfaction. In this work, we point out the ignored challenges for scalable mobile visual classification and provide a feasible solution. To overcome the limitations of mobile visual classification, we propose an unsupervised linear dimension reduction algorithm, kernel preserving projection (KPP), which approximates the kernel matrix of high dimensional features with low dimensional linear embedding. We further introduce sparsity to the projection matrix to ensure its compliance with mobile computing (with merely 12% non-zero entries). By inspecting the similarity of linear dimension reduction with low-rank linear distance metric and Taylor expansion of RBF kernel, we justified the feasibility for the proposed KPP method over high-dimensional features. Experimental results on three public datasets confirm that the proposed method outperforms existing dimension reduction methods. What is even more, we can greatly reduce the storage consumption and efficiently compute the classification results on the mobile devices.
机译:随着观察范式向移动平台的转移以及视觉数据的爆炸性增长,可扩展的移动视觉分类(在移动设备上的大型语义空间中实时对图像/视频进行实时分类)是一个新兴问题。尽管看到了检测服务器中成千上万个概念的进步,但是由于内部的严格资源限制,可伸缩性在移动设备中受到了限制。但是,某些新兴应用程序需要具有可伸缩性的可视化分类,并具有快速响应,以检测本地上下文(例如Google Glass)或确保用户满意度。在这项工作中,我们指出了可伸缩的移动视觉分类所面临的挑战,并提供了可行的解决方案。为了克服移动视觉分类的局限性,我们提出了一种无监督的线性降维算法,即核保留投影(KPP),该算法通过低维线性嵌入来近似高维特征的核矩阵。我们进一步将稀疏性引入投影矩阵,以确保其与移动计算兼容(只有12%的非零条目)。通过检查低阶线性距离度量和RBF核的泰勒展开的线性降维的相似性,我们证明了提出的KPP方法在高维特征上的可行性。在三个公共数据集上的实验结果证实,该方法优于现有的降维方法。更重要的是,我们可以大大减少存储消耗并有效地在移动设备上计算分类结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号