首页> 外国专利> Technologies for improved object detection accuracy with multi-scale representation and training

Technologies for improved object detection accuracy with multi-scale representation and training

机译:通过多尺度表示和训练提高物体检测精度的技术

摘要

Technologies for multi-scale object detection include a computing device including a multi-layer convolution network and a multi-scale region proposal network (RPN). The multi-layer convolution network generates a convolution map based on an input image. The multi-scale RPN includes multiple RPN layers, each with a different receptive field size. Each RPN layer generates region proposals based on the convolution map. The computing device may include a multi-scale object classifier that includes multiple region of interest (ROI) pooling layers and multiple associated fully connected (FC) layers. Each ROI pooling layer has a different output size, and each FC layer may be trained for an object scale based on the output size of the associated ROI pooling layer. Each ROI pooling layer may generate pooled ROIs based on the region proposals and each FC layer may generate object classification vectors based on the pooled ROIs. Other embodiments are described and claimed.
机译:用于多尺度对象检测的技术包括计算设备,该计算设备包括多层卷积网络和多尺度区域提议网络(RPN)。多层卷积网络基于输入图像生成卷积图。多尺度RPN包括多个RPN层,每个层具有不同的接收场大小。每个RPN层都基于卷积图生成区域建议。该计算设备可以包括多尺度对象分类器,该多尺度对象分类器包括多个关注区域(ROI)池层和多个相关联的完全连接(FC)层。每个ROI池化层具有不同的输出大小,并且可以基于关联的ROI池化层的输出大小为对象规模训练每个FC层。每个ROI池化层可以基于区域提议来生成池化的ROI,并且每个FC层可以基于池化的ROI来生成对象分类向量。描述和要求保护其他实施例。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号