首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP 2009 >Latent semantic retrieval of personal photos with sparse user annotation by fused image/speech/text features
【24h】

Latent semantic retrieval of personal photos with sparse user annotation by fused image/speech/text features

机译:通过融合的图像/语音/文本特征对具有稀疏用户注释的个人照片进行潜在的语义检索

获取原文

摘要

While users prefer high-level semantic photo descriptions (e.g., who, what, when, where), we wish to minimize the need to annotate photos using such descriptions by the user. We propose a latent semantic personal photo retrieval approach using fused image/speech/text features. We use low-level image features to derive relationships among sparsely annotated photos, and probabilistic latent semantic analysis (PLSA) models based on fused image/speech/text features to analyze photo ldquotopicsrdquo. We then retrieve the photos using text or speech queries of simple high-level semantic words only. In preliminary experiments, while only 10% of the photos were manually annotated, the photos could be well retrieved with very encouraging results.
机译:尽管用户更喜欢高级语义照片描述(例如,谁,什么,什么时候,何时何地),但我们希望最大限度地减少用户使用此类描述对照片进行注释的需求。我们提出了一种潜在的语义个人照片检索方法,使用融合的图像/语音/文本特征。我们使用低级图像特征来导出稀疏带注释的照片之间的关系,并基于融合图像/语音/文本特征的概率潜在语义分析(PLSA)模型来分析照片“主题”。然后,我们仅使用简单的高级语义词的文本或语音查询来检索照片。在初步实验中,虽然仅手动注释了10%的照片,但可以很好地检索照片,并获得令人鼓舞的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号