Multi2Rank: Multimedia Multiview Ranking

机译：Multi2Rank：多媒体多视图排名

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multimedia retrieval is a search and ranking task defined over multiple modalities. These modalities include speech, image, and text, which provide different views of the multimedia object. Queries to a multimedia retrieval system often take the form of a text only query and return a ranked result set which combines these multiple views. The text only query includes multiple phrases which identify features of a specific view. This multiview problem presents a challenge in mapping these phrases into the correct view feature space. A second challenge for the multimedia retrieval system is in building a ranking model which considers the unique feature space of each view. In this paper, we propose a hierarchical multimedia multiview rank learning model, called MultiRank, to overcome the challenges of this unique ranking problem. The first layer of our model uses natural language processing techniques to identify view specific phrases and output a ranked mapping of the phrases into their respective views. Next, we model the individual feature space for each multimedia view and create a view specific model using gradient boosted regression trees. The ranked set from each unique view is then passed to the final layer of the hierarchy, where the model generates a final ranked result set. We show that our multiview rank learning approach is effective by evaluating the methods using a large Internet video repository, queries, and ground truth, from the TRECVid evaluations.

机译：多媒体检索是在多种模式下定义的搜索和排名任务。这些形式包括语音，图像和文本，它们提供了多媒体对象的不同视图。对多媒体检索系统的查询通常采用仅文本查询的形式，并返回结合了这些多个视图的排序结果集。仅文本查询包括多个短语，这些短语标识特定视图的功能。这个多视图问题在将这些短语映射到正确的视图特征空间中提出了挑战。多媒体检索系统的第二个挑战是建立一个考虑每个视图的唯一特征空间的排名模型。在本文中，我们提出了一种称为MultiRank的分层多媒体多视图等级学习模型，以克服这种独特的排名问题的挑战。我们模型的第一层使用自然语言处理技术来识别特定于视图的短语，并将短语的排名映射输出到它们各自的视图中。接下来，我们为每个多媒体视图建模各自的特征空间，并使用梯度增强的回归树创建特定于视图的模型。然后，将每个唯一视图中的排名集传递到层次结构的最后一层，在该模型中，模型将生成最终排名结果集。我们表明，通过从TRECVid评估中使用大型Internet视频存储库，查询和基本事实来评估方法，我们的多视图等级学习方法是有效的。

著录项

来源
《International Conference on Multimedia Big Data》|2015年|80-87|共8页
会议地点
作者
Etter David; Domeniconi Carlotta;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Information Search and Retrieval; Retrieval models; Search process;

机译：信息检索与检索;检索模型;检索过程;

相似文献

外文文献
中文文献
专利

1. Identifying and ranking potential driver genes of Alzheimer's disease using multiview evidence aggregation [J] . Mukherjee Sumit, Perumal Thanneer M., Daily Kenneth, Bioinformatics . 2019,第14期

机译：使用多视图证据汇总鉴定和排名阿尔茨海默病的潜在驾驶基因
2. Multiview saliency detection based on improved multimanifold ranking [J] . Yanjiao Shi, Yugen Yi, Ke Zhang, Journal of electronic imaging . 2014,第6期

机译：基于改进多流形排序的多视图显着性检测
3. Virtual drug screen schema based on multiview similarity integration and ranking aggregation [J] . Kang H., Sheng Z., Zhu R., Journal of chemical information and modeling . 2012,第3期

机译：基于多视图相似度集成和排名聚合的虚拟药品筛选方案
4. Multi2Rank: Multimedia Multiview Ranking [C] . Etter David, Domeniconi Carlotta International Conference on Multimedia Big Data . 2015

机译：Multi2Rank：多媒体多视图排名
5. Multiview Rank Learning for Multimedia Known Item Search [D] . Etter, David L. 2015

机译：多媒体已知项搜索的多视图等级学习
6. Identifying and ranking potential driver genes of Alzheimer’s disease using multiview evidence aggregation [O] . Sumit Mukherjee, Thanneer M Perumal, Kenneth Daily, -1

机译：使用多视图证据聚合来识别和排序阿尔茨海默氏病的潜在驱动基因
7. Multiview Semi-Supervised Ranking for Automatic Image Annotation [O] . Ali Fakeri-tabrizi, Massih-reza Amini, Patrick Gallinari 2015

机译：自动图像标注的多视图半监督排序

Multi2Rank: Multimedia Multiview Ranking

摘要

著录项

相似文献

相关主题

期刊订阅