首页> 外文会议>IEEE Winter Applications and Computer Vision Workshops >PeR-ViS: Person Retrieval in Video Surveillance using Semantic Description
【24h】

PeR-ViS: Person Retrieval in Video Surveillance using Semantic Description

机译:每次VIS:使用语义描述视频监视中的人员检索

获取原文

摘要

A person is usually characterized by descriptors like age, gender, height, cloth type, pattern, color, etc. Such descriptors are known as attributes and/or soft-biometrics. They link the semantic gap between a person's description and retrieval in video surveillance. Retrieving a specific person with the query of semantic description has an important application in video surveillance. Using computer vision to fully automate the person retrieval task has been gathering interest within the research community. However, the Current, trend mainly focuses on retrieving persons with image-based queries, which have major limitations for practical usage. Instead of using an image query, in this paper, we study the problem of person retrieval in video surveillance with a semantic description. To solve this problem, we develop a deep learning-based cascade filtering approach (PeR-ViS), which uses Mask R-CNN [14] (person detection and instance segmentation) and DenseNet-161 [16] (soft-biometric classification). On the standard person retrieval dataset of SoftBioSearch [6], we achieve 0.566 Average IoU and 0.792 %w IoU > 0.4, surpassing the current state-of-the-art by a large margin. We hope our simple, reproducible, and effective approach will help ease future research in the domain of person retrieval in video surveillance. The source code will be released after the paper is accepted for publication with base-line and pretrained weights. The source code and pre-trained weights available at https://parshwa1999.github.io/PeR-ViS/.
机译:一个人通常是由年龄,性别,高度,布型,图案,颜色等的描述符的特征。这样的描述符被称为属性和/或软生物测量。他们将一个人的描述与视频监控中的检索之间的语义差距联系起来。用语义描述检索特定人员在视频监控中具有重要应用。使用计算机愿景全自动化该人员检索任务一直在收集研究界的兴趣。但是,目前的趋势主要关注检索基于图像的查询的人,这对实际使用具有重大限制。在本文中,我们在使用语义描述中研究了视频监视中检索人员检索问题的问题。要解决这个问题,我们开发了一种深入的学习级联滤波方法(每次VIS),它使用掩模R-CNN [14](人检测和实例分割)和DenSenet-161 [16](软生物分类分类) 。在SoftBioSearch的标准人检索数据集[6]中,我们实现了0.566平均IOU和0.792%W IOO> 0.4,超越了当前的最先进的余量。我们希望我们的简单,可重复和有效的方法将有助于缓解在视频监控中的人员领域的未来研究。源代码将在接受纸张与底线和预制权重的出版后释放。 https://parshwa1999.github.io/per-vis/可用源代码和预先训练的权重。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号