PeR-ViS: Person Retrieval in Video Surveillance using Semantic Description

机译：每次VIS：使用语义描述视频监视中的人员检索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A person is usually characterized by descriptors like age, gender, height, cloth type, pattern, color, etc. Such descriptors are known as attributes and/or soft-biometrics. They link the semantic gap between a person's description and retrieval in video surveillance. Retrieving a specific person with the query of semantic description has an important application in video surveillance. Using computer vision to fully automate the person retrieval task has been gathering interest within the research community. However, the Current, trend mainly focuses on retrieving persons with image-based queries, which have major limitations for practical usage. Instead of using an image query, in this paper, we study the problem of person retrieval in video surveillance with a semantic description. To solve this problem, we develop a deep learning-based cascade filtering approach (PeR-ViS), which uses Mask R-CNN [14] (person detection and instance segmentation) and DenseNet-161 [16] (soft-biometric classification). On the standard person retrieval dataset of SoftBioSearch [6], we achieve 0.566 Average IoU and 0.792 %w IoU > 0.4, surpassing the current state-of-the-art by a large margin. We hope our simple, reproducible, and effective approach will help ease future research in the domain of person retrieval in video surveillance. The source code will be released after the paper is accepted for publication with base-line and pretrained weights. The source code and pre-trained weights available at https://parshwa1999.github.io/PeR-ViS/.

机译：一个人通常是由年龄，性别，高度，布型，图案，颜色等的描述符的特征。这样的描述符被称为属性和/或软生物测量。他们将一个人的描述与视频监控中的检索之间的语义差距联系起来。用语义描述检索特定人员在视频监控中具有重要应用。使用计算机愿景全自动化该人员检索任务一直在收集研究界的兴趣。但是，目前的趋势主要关注检索基于图像的查询的人，这对实际使用具有重大限制。在本文中，我们在使用语义描述中研究了视频监视中检索人员检索问题的问题。要解决这个问题，我们开发了一种深入的学习级联滤波方法（每次VIS），它使用掩模R-CNN [14]（人检测和实例分割）和DenSenet-161 [16]（软生物分类分类）。在SoftBioSearch的标准人检索数据集[6]中，我们实现了0.566平均IOU和0.792％W IOO> 0.4，超越了当前的最先进的余量。我们希望我们的简单，可重复和有效的方法将有助于缓解在视频监控中的人员领域的未来研究。源代码将在接受纸张与底线和预制权重的出版后释放。 https://parshwa1999.github.io/per-vis/可用源代码和预先训练的权重。

著录项

来源
《IEEE Winter Applications and Computer Vision Workshops》|2021年|41-50|共10页
会议地点
作者
Parshwa Shah; Arpit Garg; Vandit Gajjar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Computer vision; Image segmentation; Image color analysis; Filtering; Conferences; Semantics; Video surveillance;

机译：计算机视觉;图像分割;图像颜色分析;过滤;会议;语义;视频监控;

相似文献

外文文献
中文文献
专利

1. Probabilistic Semantic Retrieval for Surveillance Videos With Activity Graphs [J] . Chen Yuting, Wang Joseph, Bai Yannan, IEEE transactions on multimedia . 2019,第3期

机译：具有活动图的监视视频的概率语义检索
2. Semantic retrieval of events from indoor surveillance video databases [J] . Chengcui Zhang, Xin Chen, Liping Zhou, Pattern recognition letters . 2009,第12期

机译：从室内监控视频数据库中语义检索事件
3. SURVEILLANCE VIDEO INDEXING AND RETRIEVAL USING OBJECT FEATURES AND SEMANTIC EVENTS [J] . THI-LAN LE, MONIQUE THONNAT, ALAIN BOUCHER, International Journal of Pattern Recognition and Artificial Intelligence . 2009,第7期

机译：使用对象功能和语义事件监视视频索引和检索
4. Semantic Description and Information Retrieval Research of Surveillance Video in Smart Transportation System [C] . Boxiong Yang, Jing Huang, Yuqi Yang International Conference on Electromechanical Control Technology and Transportation . 2015

机译：智能运输系统监控视频的语义描述和信息检索研究
5. Semantic Description of Activities in Videos. [D] . Moreira de Souza, Fillipe Dias. 2017

机译：视频中活动的语义描述。
6. Visual Semantic Based 3D Video Retrieval System Using HDFS [O] . C.Ranjith Kumar, S. Suguna -1

机译：使用HDFS的基于视觉语义的3D视频检索系统
7. Probabilistic Semantic Retrieval for Surveillance Videos with Activity Graphs [O] . Chen, Yuting, Wang, Joseph, Bai, Yannan, 2017

机译：具有活动性的监视视频的概率语义检索图表

PeR-ViS: Person Retrieval in Video Surveillance using Semantic Description

摘要

著录项

相似文献

相关主题

期刊订阅