首页> 外文会议>Conference on Multimedia Information Processing and Retrieval >Large-Scale Object Detection of Images from Network Cameras in Variable Ambient Lighting Conditions
【24h】

Large-Scale Object Detection of Images from Network Cameras in Variable Ambient Lighting Conditions

机译:可变环境照明条件下来自网络摄像机的图像的大规模目标检测

获取原文

摘要

Computer vision relies on labeled datasets for training and evaluation in detecting and recognizing objects. The popular computer vision program, YOLO ("You Only Look Once"), has been shown to accurately detect objects in many major image datasets. However, the images found in those datasets, are independent of one another and cannot be used to test YOLO's consistency at detecting the same object as its environment (e.g. ambient lighting) changes. This paper describes a novel effort to evaluate YOLO's consistency for large-scale applications. It does so by working (a) at large scale and (b) by using consecutive images from a curated network of public video cameras deployed in a variety of real-world situations, including traf?c intersections, national parks, shopping malls, university campuses, etc. We speci?cally examine YOLO's ability to detect objects in different scenarios (e.g., daytime vs. night), leveraging the cameras' ability to rapidly retrieve many successive images for evaluating detection consistency. Using our camera network and advanced computing resources (supercomputers), we analyzedmorethan5millionimagescapturedby140network cameras in 24 hours. Compared with labels marked by humans (considered as "ground truth"), YOLO struggles to consistently detect the same humans and cars as their positions change from one frame to the next; it also struggles to detect objects at night time. Our ?ndings suggest that state-of-the art vision solutions should be trained by data from network camera with contextual information before they can be deployed in applications that demand high consistency on object detection.
机译:计算机视觉依靠标记的数据集进行检测和识别对象的训练和评估。流行的计算机视觉程序YOLO(“您只看一次”)已被证明可以准确地检测许多主要图像数据集中的对象。但是,在这些数据集中找到的图像是彼此独立的,并且不能用于检测YOLO在检测到同一物体(例如其环境(例如环境照明))变化时的一致性。本文介绍了一种新颖的方法,用于评估YOLO在大型应用程序中的一致性。它是通过(a)大规模工作和(b)使用来自在各种现实情况(包括交通路口,国家公园,购物中心,大学)中部署的公共摄像机的精选网络的连续图像来实现的我们专门检查了YOLO在不同场景(例如白天与夜晚)中检测物体的能力,并利用相机的能力快速检索了许多连续的图像以评估检测的一致性。利用我们的摄像机网络和先进的计算资源(超级计算机),我们分析了24小时内140个网络摄像机捕获的500万张图像。与带有人类标记的标签(被称为“地面真相”)相比,YOLO努力在位置从一帧变化到下一帧的过程中,始终如一地检测到相同的人类和汽车。它还难以在夜间检测物体。我们的研究结果表明,在将网络摄像机的数据与上下文信息一起使用之前,应先对其进行培训,然后再将其部署到对物体检测要求高度一致性的应用程序中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号