q-Gaussian mixture models for image and video semantic indexing

Nakamasa Inoue; Koichi Shinoda

首页> 外文期刊>Journal of visual communication & image representation >q-Gaussian mixture models for image and video semantic indexing

【24h】

q-Gaussian mixture models for image and video semantic indexing

机译：用于图像和视频语义索引的q高斯混合模型

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Gaussian mixture models which extend Bag-of-Visual-Words (BoW) to a probabilistic framework have been proved to be effective for image and video semantic indexing. Recently, the q-Gaussian distribution, derived from Tsallis statistics [11], has been shown to be useful for representing patterns in many complex systems in physics. We propose q-Gaussian mixture models (q-GMMs), mixture models of q-Gaussian distributions with a parameter q to control its tail-heaviness, for image and video semantic indexing [1]. The long-tailed distributions obtained for q > 1 are expected to effectively represent complexly correlated data, and hence, to improve robustness against outliers. The main improvements over our previous study [1] are q-GMM super-vector representation to efficiently compute the q-GMM kernel, and detailed experimental analysis showing accuracy and testing-cost comparison with recent kernel methods. Our proposed method outperformed BoW and achieved 49.42% and 10.90% in Mean Average Precision on the PASCAL VOC 2010 and the TRECVID 2010 Semantic Indexing, respectively.

机译：高斯混合模型将视觉词袋（BoW）扩展到概率框架，已被证明对图像和视频语义索引有效。最近，从Tsallis统计[11]导出的q-Gaussian分布已显示出可用于表示物理学中许多复杂系统中的模式。我们提出了q-高斯混合模型（q-GMMs），q-Gaussian分布的混合模型，带有参数q来控制其尾部沉重度，用于图像和视频语义索引[1]。对于q> 1所获得的长尾分布，有望有效地表示复杂相关的数据，从而提高针对异常值的鲁棒性。对我们之前的研究[1]的主要改进是有效地计算q-GMM内核的q-GMM超向量表示，以及详细的实验分析，显示了与最新内核方法的准确性和测试成本的比较。我们提出的方法优于BoW，在PASCAL VOC 2010和TRECVID 2010语义索引上的平均平均精度分别达到49.42％和10.90％。

著录项

来源
《Journal of visual communication & image representation》 |2013年第8期|1450-1457|共8页
作者
Nakamasa Inoue; Koichi Shinoda;
展开▼
作者单位

Department of Computer Science, Tokyo Institute of Technology, Tokyo 152-8552, Japan;

Department of Computer Science, Tokyo Institute of Technology, Tokyo 152-8552, Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Semantic indexing; Gaussian mixture models; q-Gaussian mixture models; GMM supervector; Image classification; Bag-of-words; Codebook; Tsallis statistics;

机译：语义索引;高斯混合模型;q-高斯混合模型;GMM超向量;图像分类;口碑密码本;查利斯统计;

相似文献

外文文献
中文文献
专利

1. Corrections to “Exploiting Web Images for Semantic Video Indexing Via Robust Sample-Specific Loss” [J] . Yang Y., Zha Z.-J., Gao Y., Multimedia, IEEE Transactions on . 2015,第2期

机译：对“通过健壮的特定于样本的损失利用Web图像进行语义视频索引”的更正
2. Exploiting Web Images for Semantic Video Indexing Via Robust Sample-Specific Loss [J] . Yang Y., Zha Z.-J., Gao Y., Multimedia, IEEE Transactions on . 2014,第6期

机译：利用Web图片获取语义视频，通过针对特定样本的强大损失进行索引
3. Semantic Image and Video Indexing in Broad Domains [J] . Worring M, Schreiber G IEEE transactions on multimedia . 2007,第5期

机译：广泛领域中的语义图像和视频索引
4. q-Gaussian Mixture Models Based on Non-extensive Statistics for Image and Video Semantic Indexing [C] . Nakamasa Inoue, Koichi Shinoda Asian conference on computer vision . 2013

机译：基于非广义统计的q-高斯混合模型用于图像和视频语义索引
5. Optimization-based summarization and indexing of extended videos, with application to instructional video semantics. [D] . Liu, Tiecheng. 2003

机译：基于优化的扩展视频摘要和索引，应用于教学视频语义。
6. Towards knowledge-based retrieval of medical images. The role of semantic indexing image content representation and knowledge-based retrieval. [O] . H. J. Lowe, I. Antipov, W. Hersh, 1998

机译：致力于基于知识的医学图像检索。语义索引图像内容表示和基于知识的检索的作用。
7. Semantics-based Video Indexing Using a Stochastic Modeling Approach [O] . Yong Wei, Suchendra M. Bh, Kang Li 2007

机译：基于随机建模方法的基于语义的视频索引
8. Method and Device for Online Dynamic Semantic Video Compression and Video Indexing. [R] . Liu, T., Kender, J. R. 2004

机译：动态语义视频压缩和视频索引。

q-Gaussian mixture models for image and video semantic indexing

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅