首页> 外文会议>International Conference on Multimedia Big Data >Multi2Rank: Multimedia Multiview Ranking
【24h】

Multi2Rank: Multimedia Multiview Ranking

机译:Multi2Rank:多媒体多视图排名

获取原文

摘要

Multimedia retrieval is a search and ranking task defined over multiple modalities. These modalities include speech, image, and text, which provide different views of the multimedia object. Queries to a multimedia retrieval system often take the form of a text only query and return a ranked result set which combines these multiple views. The text only query includes multiple phrases which identify features of a specific view. This multiview problem presents a challenge in mapping these phrases into the correct view feature space. A second challenge for the multimedia retrieval system is in building a ranking model which considers the unique feature space of each view. In this paper, we propose a hierarchical multimedia multiview rank learning model, called MultiRank, to overcome the challenges of this unique ranking problem. The first layer of our model uses natural language processing techniques to identify view specific phrases and output a ranked mapping of the phrases into their respective views. Next, we model the individual feature space for each multimedia view and create a view specific model using gradient boosted regression trees. The ranked set from each unique view is then passed to the final layer of the hierarchy, where the model generates a final ranked result set. We show that our multiview rank learning approach is effective by evaluating the methods using a large Internet video repository, queries, and ground truth, from the TRECVid evaluations.
机译:多媒体检索是在多种模式下定义的搜索和排名任务。这些形式包括语音,图像和文本,它们提供了多媒体对象的不同视图。对多媒体检索系统的查询通常采用仅文本查询的形式,并返回结合了这些多个视图的排序结果集。仅文本查询包括多个短语,这些短语标识特定视图的功能。这个多视图问题在将这些短语映射到正确的视图特征空间中提出了挑战。多媒体检索系统的第二个挑战是建立一个考虑每个视图的唯一特征空间的排名模型。在本文中,我们提出了一种称为MultiRank的分层多媒体多视图等级学习模型,以克服这种独特的排名问题的挑战。我们模型的第一层使用自然语言处理技术来识别特定于视图的短语,并将短语的排名映射输出到它们各自的视图中。接下来,我们为每个多媒体视图建模各自的特征空间,并使用梯度增强的回归树创建特定于视图的模型。然后,将每个唯一视图中的排名集传递到层次结构的最后一层,在该模型中,模型将生成最终排名结果集。我们表明,通过从TRECVid评估中使用大型Internet视频存储库,查询和基本事实来评估方法,我们的多视图等级学习方法是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号