首页> 外文会议>IEEE International Conference on Computer Vision >Image parsing: unifying segmentation, detection, and recognition
【24h】

Image parsing: unifying segmentation, detection, and recognition

机译:图像解析:统一分割,检测和识别

获取原文
获取外文期刊封面目录资料

摘要

We propose a general framework for parsing images into regions and objects. In this framework, the detection and recognition of objects proceed simultaneously with image segmentation in a competitive and cooperative manner. We illustrate our approach on natural images of complex city scenes where the objects of primary interest are faces and text. This method makes use of bottom-up proposals combined with top-down generative models using the data driven Markov chain Monte Carlo (DDMCMC) algorithm, which is guaranteed to converge to the optimal estimate asymptotically. More precisely, we define generative models for faces, text, and generic regions- e.g. shading, texture, and clutter. These models are activated by bottom-up proposals. The proposals for faces and text are learnt using a probabilistic version of AdaBoost. The DDMCMC combines reversible jump and diffusion dynamics to enable the generative models to explain the input images in a competitive and cooperative manner. Our experiments illustrate the advantages and importance of combining bottom-up and top-down models and of performing segmentation and object detection/recognition simultaneously.
机译:我们提出了一个将图像解析为区域和对象的一般框架。在该框架中,对象的检测和识别以竞争和协作方式与图像分割同时进行。我们说明了我们对复杂城市场景的自然图像的方法,其中主要兴趣的对象是面部和文本。该方法利用自下而上的建议使用数据驱动的马尔可夫链蒙特卡罗(DDMCMC)算法与自上而下的生成模型结合,保证将收敛到渐近估计值。更确切地说,我们为面部,文本和通用区域定义了生成模型 - 例如,阴影,纹理和杂乱。这些模型由自下而上的建议激活。使用Adaboost的概率版本学习面孔和文本的提案。 DDMCMC结合了可逆跳转和扩散动态,使得生成模型以竞争和合作方式解释输入图像。我们的实验说明了结合自下而下模型和同时执行分割和物体检测/识别的优点和重要性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号