...
首页> 外文期刊>Journal of visual communication & image representation >q-Gaussian mixture models for image and video semantic indexing
【24h】

q-Gaussian mixture models for image and video semantic indexing

机译:用于图像和视频语义索引的q高斯混合模型

获取原文
获取原文并翻译 | 示例

摘要

Gaussian mixture models which extend Bag-of-Visual-Words (BoW) to a probabilistic framework have been proved to be effective for image and video semantic indexing. Recently, the q-Gaussian distribution, derived from Tsallis statistics [11], has been shown to be useful for representing patterns in many complex systems in physics. We propose q-Gaussian mixture models (q-GMMs), mixture models of q-Gaussian distributions with a parameter q to control its tail-heaviness, for image and video semantic indexing [1]. The long-tailed distributions obtained for q > 1 are expected to effectively represent complexly correlated data, and hence, to improve robustness against outliers. The main improvements over our previous study [1] are q-GMM super-vector representation to efficiently compute the q-GMM kernel, and detailed experimental analysis showing accuracy and testing-cost comparison with recent kernel methods. Our proposed method outperformed BoW and achieved 49.42% and 10.90% in Mean Average Precision on the PASCAL VOC 2010 and the TRECVID 2010 Semantic Indexing, respectively.
机译:高斯混合模型将视觉词袋(BoW)扩展到概率框架,已被证明对图像和视频语义索引有效。最近,从Tsallis统计[11]导出的q-Gaussian分布已显示出可用于表示物理学中许多复杂系统中的模式。我们提出了q-高斯混合模型(q-GMMs),q-Gaussian分布的混合模型,带有参数q来控制其尾部沉重度,用于图像和视频语义索引[1]。对于q> 1所获得的长尾分布,有望有效地表示复杂相关的数据,从而提高针对异常值的鲁棒性。对我们之前的研究[1]的主要改进是有效地计算q-GMM内核的q-GMM超向量表示,以及详细的实验分析,显示了与最新内核方法的准确性和测试成本的比较。我们提出的方法优于BoW,在PASCAL VOC 2010和TRECVID 2010语义索引上的平均平均精度分别达到49.42%和10.90%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号