首页> 美国卫生研究院文献>PLoS Clinical Trials >A proposal of prior probability-oriented clustering in feature encoding strategies
【2h】

A proposal of prior probability-oriented clustering in feature encoding strategies

机译:关于特征编码策略中先验概率聚类的建议

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Codebook-based feature encodings are a standard framework for image recognition issues. A codebook is usually constructed by clusterings, such as the k-means and the Gaussian Mixture Model (GMM). A codebook size is an important factor to decide the trade-off between recognition performance and computational complexity and a traditional framework has the disadvantage to image recognition issues when a large codebook; the number of unique clusters becomes smaller than a designated codebook size because some clusters converge to close positions. This paper focusses on the disadvantage from a perspective of the distribution of prior probabilities and presents a clustering framework including two objectives that are alternated to the k-means and the GMM. Our approach is first evaluated with synthetic clustering datasets to analyze a difference to traditional clustering. In the experiment section, although our approach alternated to the k-means generates similar results to the k-means results, our approach is able to finely tune clusters for our objective. Our approach alternated to the GMM significantly improves our objective and constructs intuitively appropriate clusters, especially for huge and complicatedly distributed samples. In the experiment on image recognition issues, two state-of-the-art encodings, the Fisher Vector (FV) using the GMM and the Vector of Locally Aggregated Descriptors (VLAD) using the k-means, are evaluated with two publicly available image datasets, the Birds and the Butterflies. For the results of the VLAD with our approach, the recognition performances tend to be worse compared to the original VLAD results. On the other hand, the FV using our approach is able to improve the performance, especially in a larger codebook size.
机译:基于密码本的功能编码是解决图像识别问题的标准框架。码本通常由聚类构成,例如k均值和高斯混合模型(GMM)。码本的大小是决定识别性能和计算复杂度之间权衡的重要因素,而传统的框架在大型码本下具有图像识别问题的缺点。唯一簇的数量变得小于指定的码本大小,因为某些簇会聚到接近的位置。本文从先验概率的分布角度着眼于劣势,并提出了一个包含两个目标的聚类框架,这两个目标分别替代了k均值和GMM。首先使用合成聚类数据集评估我们的方法,以分析与传统聚类的差异。在实验部分,尽管我们的方法替代了k均值,其结果与k均值的结果相似,但我们的方法仍能够针对我们的目标微调聚类。我们替代GMM的方法极大地改善了我们的目标,并构建了直观上合适的聚类,尤其是对于庞大且复杂分布的样本。在图像识别问题的实验中,使用两个公开可用的图像评估了两种最先进的编码:使用GMM的Fisher向量(FV)和使用k均值的局部聚集描述符向量(VLAD)数据集,鸟类和蝴蝶。对于采用我们的方法的VLAD结果,与原始VLAD结果相比,识别性能往往更差。另一方面,使用我们的方法的FV可以提高性能,尤其是在较大的码本中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号