Spotting celebrities among peers in a TV show: how to exploit web querying for weakly supervised visual diarization

机译：在电视节目中发现名人：如何利用Web查询弱监督的视觉深度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a novel solution for popularity recognition. This methodology consists of categorizing and exploiting web image resources of people in terms of relevant identities. To demonstrate its usefulness, we also study the effects of incorporating this procedure into a visual diarization system.In our setting, training data is obtained by querying Google Images about the known identities of the participants in a TV show (we only know who participates but not when). As Google queries may return imprecise results, images retrieved for each query (or identity) are processed to distinguish true from false positives, keeping the former while filtering out the latter.Next, facial clustering is performed to drive a filtering process discarding noisy samples (false positives) from returned images (i.e. only images linked to the principal cluster are adopted as training data).For popularity recognition, Random Forest (RF) and support vector machine (SVM) classifiers have been tested. Three different types of features have been proposed to build the models upon: image related features, query related features and clustering related features.Feature ranking and feature selection techniques applied show that clustering related features are the most important for popularity recognition. In fact, the RF model based on the 3 top ranked features achieves an accuracy close to 100% on test set. Results also demonstrate that integrating the popularity solution into our visual diarization pipeline helps to reduce the Diarization Error Rate (DER) in a 2%, removing around a 15% of noisy identities, which confirms the quality of this procedure and its high performance in other scenarios.

机译：在本文中，我们提出了一种新的普及识别解决方案。这种方法包括在相关身份方面对人们进行分类和利用网络图像资源。为了展示其有用性，我们还研究将该程序纳入视觉深度缓解系统的效果。我们的环境中，通过查询电视节目中的参与者的已知身份（我们只知道谁参加）来获得培训数据（我们只知道谁参加不是什么时候）。由于Google查询可能会返回不精确的结果，因此对每个查询（或标识）检索的图像被处理以区分从晶阳性的真实验证，以在过滤后者时保持前者。对丢失的，以驱动丢失噪声样本的面部聚类（来自返回的图像（即仅采用训练数据仅采用仅采用链接到主群集群的图像）。对于受欢迎的识别，随机森林（RF）和支持向量机（SVM）分类器已经进行了测试。已经提出了三种不同类型的特征来构建模型：图像相关特征，查询相关的特征和聚类相关特征。应用排名和特征选择技术所应用的群集相关功能对于人气识别是最重要的。实际上，基于3个顶级排名特征的RF模型在测试集上实现了接近100％的精度。结果还证明将普及解决方案集成到我们的视觉日期管道中有助于减少2％的深度减速误差率（Der），从而达到15％的嘈杂身份，这证实了该程序的质量及其高性能场景。

著录项

来源
《International Joint Conference on Web Intelligence and Intelligent Agent Technology》|2020年|877-884|共8页
会议地点
作者
Cristina Luna Jiménez; Ricardo Kleinlein; Fernando Fernández-Martínez; José M.Moya; Zoraida Callejas; Jose Manuel Pardo Muñoz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Support vector machines; Radio frequency; Training; Visualization; TV; Image recognition; Training data;

机译：支持向量机;射频;培训;可视化;电视;图像识别;培训数据;

相似文献

外文文献
中文文献
专利

1. Exploiting Web Images for Weakly Supervised Object Detection [J] . Tao Qingyi, Yang Hao, Cai Jianfei IEEE transactions on multimedia . 2019,第5期

机译：利用Web图像进行弱监督对象检测
2. Exploiting Web Images for Weakly Supervised Object Detection [J] . Tao Qingyi, Yang Hao, Cai Jianfei IEEE transactions on multimedia . 2019,第5期

机译：利用Web图像以弱监督对象检测
3. NLWSNet:a weakly supervised network for visual sentiment analysis in mislabeled web images [J] . Luo-yang XUE, Qi-rong MAO, Xiao-hua HUANG, 浙江大学学报（英文版）（C辑：计算机与电子） . 2020,第009期

机译：NLWSNet：在误标记的网络图像中的视觉情绪分析的弱监督网络
4. Audiovisual speaker diarization of TV series [C] . Bost Xavier, Linares Georges, Gueye Serigne IEEE International Conference on Acoustics, Speech and Signal Processing . 2015

机译：电视连续剧的视听扬声器二分法
5. Weakly supervised learning from multiple modalities: Exploiting video, audio and text for video understanding. [D] . Cour, Timothee. 2009

机译：多种模式的弱监督学习：利用视频，音频和文本进行视频理解。
6. Atomistic simulations and network-based modeling of the Hsp90-Cdc37 chaperone binding with Cdk4 client protein: A mechanism of chaperoning kinase clients by exploiting weak spots of intrinsically dynamic kinase domains [O] . Josh Czemeres, Kurt Buse, Gennady M. Verkhivker 2011

机译：Hsp90-Cdc37伴侣蛋白与Cdk4客户蛋白结合的原子模拟和基于网络的建模：通过利用内在动态激酶结构域的弱点来伴侣蛋白激酶客户的机制
7. Brenda R. Weber: Makeover TV. Selfhood, Citizenship and Celebrity. Durham & London: Duke University Press. 2009. [O] . Anne Jerslev 2010

机译：Brenda R. Weber：改头换面电视。自我，公民和名人。达勒姆和伦敦：杜克大学出版社。 2009年。

Spotting celebrities among peers in a TV show: how to exploit web querying for weakly supervised visual diarization

摘要

著录项

相似文献

相关主题

期刊订阅