首页> 中文期刊> 《电子学报》 >一种联合文本和图像信息的行人检测方法

一种联合文本和图像信息的行人检测方法

         

摘要

针对纯视觉行人检测方法存在的误检、漏检率高,遮挡目标以及小尺度目标检测精度低等问题,提出一种联合文本和图像信息的行人检测方法.该方法首先利用图像分析的方法初步获取图像目标的候选框,其次通过文本分析的方法获取文本中有关图像目标的实体表达,并提出一种基于马尔科夫随机场的模型用于推断图像候选框与文本实体表达之间的共指关系(Coreference Relation),以此达到联合图像和文本信息以辅助机器视觉提高交通场景下行人检测精度的目的.在增加了图像文本描述的加州理工大学行人检测数据集上进行的测评结果表明,该方法不仅可以在图像信息的基础上联合文本信息提高交通场景中的行人检测精度,也能在文本信息的基础上联合图像信息提高文本中的指代消解(Anaphora Resolution)精度.%Existing vision-based pedestrian detection methods encounter many flaws,such as high false and miss detection rates,low detection accuracy on partial occluded and small scale objects,etc.In this paper,we propose a pedestrian detection method combining textual and visual information together.First,we use a vision-based method to initially localize the candidate visual objects.Second,we analyze the text information to get the text mentions corresponding to the visual objects.Finally,we propose a Markov random field-based model to infer the coreference relations between the candidate visual objects and textual mentions,so that the visual and textual information can be fused efficiently.The experimental results on the Caltech pedestrian detection benchmark enriched with textual description information have shown that the proposed method can not only improve the pedestrian detection accuracy by combining textual information with visual information,but also outperform the baseline anaphora resolution model by combining visual information with textual information.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号