首页> 外文会议>IEEE International Conference on Computer Vision >TorontoCity: Seeing the World with a Million Eyes
【24h】

TorontoCity: Seeing the World with a Million Eyes

机译:torontocity:看到百万只眼睛的世界

获取原文

摘要

In this paper we introduce the TorontoCity benchmark, which covers the full greater Toronto area (GTA) with 712.5km~2 of land, 8439km of road and around 400,000 buildings. Our benchmark provides different perspectives of the world captured from airplanes, drones and cars driving around the city. Manually labeling such a large scale dataset is infeasible. Instead, we propose to utilize different sources of high-precision maps to create our ground truth. Towards this goal, we develop algorithms that allow us to align all data sources with the maps while requiring minimal human supervision. We have designed a wide variety of tasks including building height estimation (reconstruction), road centerline and curb extraction, building instance segmentation, building contour extraction (reorganization), semantic labeling and scene type classification (recognition). Our pilot study shows that most of these tasks are still difficult for modern convolutional neural networks.
机译:在本文中,我们介绍了多域基准,涵盖了大型多伦多地区(GTA),土地为712.5km〜2,道路8439公里和约40万个建筑物。我们的基准市场提供了从飞机,无人机和在城市驾驶的汽车捕获的世界的不同观点。手动标记这样的大型数据集是不可行的。相反,我们建议利用不同的高精度地图来源来创造我们的实践。对此目标,我们开发算法,使我们能够将所有数据源与地图对齐,同时需要最小的人类监督。我们设计了各种各样的任务,包括建筑高度估计(重建),道路中心线和遏制提取,建筑物实例分割,建筑轮廓提取(重组),语义标签和场景类型分类(识别)。我们的试验研究表明,这些任务中的大多数仍然难以实现现代卷积神经网络。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号