首页> 外国专利> Joint Depth estimation and semantic segmentation from a single image

Joint Depth estimation and semantic segmentation from a single image

机译:单个图像的联合深度估计和语义分割

摘要

Global semantic and depth layouts of a scene in an image are estimated (126) through machine learning (e.g. as a template classification problem) by computing device(s). Local semantic and depth layouts are also estimated (128) for respective ones of a plurality of segments of the scene of the image through machine learning (e.g. using a convolutional neural network, CNN, or support vector machine, SVM) by the computing device(s) (e.g. also as a template classification problem). The estimated global semantic and depth layouts are merged (130) (e.g. using conditional random field) with the local semantic and depth layouts by the computing device(s) to semantically label (120) and assign a depth value (122) to image pixels (e.g. thereby ensuring consistency). The estimated global depth layout may assign a respective absolute distance to a plurality of pixels in the image. The merging may include smoothing the depth labels assigned to individual pixels. Also disclosed is the prediction of local semantic and depth layouts of individual segments using the estimated global semantic and depth layouts of the scene, and jointly forming a semantically-labelled version of the image in which individual pixels are assigned a semantic label and depth value.
机译:计算设备通过机器学习(例如,作为模板分类问题)估计(126)图像中场景的全局语义和深度布局。还通过计算设备通过机器学习(例如,使用卷积神经网络,CNN或支持向量机,SVM)对图像场景的多个片段中的各个片段的局部语义和深度布局进行估计(128)( s)(例如,也作为模板分类问题)。计算设备将估计的全局语义和深度布局与本地语义和深度布局合并(130)(例如,使用条件随机场),以在语义上标记(120),并将深度值(122)分配给图像像素。 (例如,从而确保一致性)。估计的全局深度布局可以将相应的绝对距离分配给图像中的多个像素。合并可以包括平滑分配给各个像素的深度标签。还公开了使用所估计的场景的全局语义和深度布局来预测各个片段的局部语义和深度布局,并共同形成图像的语义标记版本,其中为各个像素分配了语义标记和深度值。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号