首页> 外文学位 >Data Synthesis for Object Recognition
【24h】

Data Synthesis for Object Recognition

机译:用于目标识别的数据综合

获取原文
获取原文并翻译 | 示例

摘要

Large and balanced datasets are normally crucial for many machine learning models, especially when the problem is defined in a high dimensional space due to high complexity. In real-world applications, it is usually very hard and/or expensive to obtain adequate amounts of labeled data, even with the help of crowd-sourcing. To address these problems, a possible approach is to create synthetic data and use it for training. This approach has been applied in many application areas of computer vision including document recognition, object retrieval, and object classification. While a boosted performance has been demonstrated using synthetic data, the boosted performance is limited by two main factors in existing approaches. First, most existing approaches for creating and using synthetic data are application-specific and thus lack the ability to benefit other application areas. Further, such application-specific approaches are often heuristic in nature. Second, existing approaches do not recognize an inherent difference between synthetic data and actual data which is termed as a synthetic gap in my proposal. The synthetic gap in existing approaches is due to the fact that not all possible patterns and structures of actual data are present in the synthetic data. To address the problems of using synthetic data and using it to better improve the performance of learning algorithm, this proposal considers general ways of creating and using synthetic data. The problem caused by the synthetic gap is studied and approaches to overcome the gap are proposed. Experimental results demonstrate that the proposed approach is efficient and can boost the performance of many computer vision applications including building roof classification, character classification, and point cloud object classification.
机译:大型且平衡的数据集通常对于许多机器学习模型至关重要,尤其是由于复杂性高而在高维空间中定义问题时。在实际应用中,即使在众包的帮助下,获取足够数量的标记数据通常也非常困难和/或昂贵。为了解决这些问题,一种可能的方法是创建综合数据并将其用于训练。这种方法已应用于计算机视觉的许多应用领域,包括文档识别,对象检索和对象分类。尽管已使用综合数据证明了性能的提高,但是现有方法中的两个主要因素限制了性能的提高。首先,大多数现有的用于创建和使用合成数据的方法都是特定于应用程序的,因此缺乏使其他应用程序领域受益的能力。此外,这种特定于应用程序的方法通常本质上是启发式的。其次,现有方法无法识别综合数据与实际数据之间的固有差异,这在我的建议中被称为综合差距。现有方法中的合成差距是由于以下事实造成的:并非所有可能的实际数据模式和结构都存在于合成数据中。为了解决使用合成数据并更好地提高学习算法性能的问题,该建议考虑了创建和使用合成数据的一般方法。研究了由合成间隙引起的问题,并提出了克服间隙的方法。实验结果表明,该方法是有效的,可以提高许多计算机视觉应用程序的性能,包括建筑物屋顶分类,角色分类和点云对象分类。

著录项

  • 作者

    Zhang, Xi.;

  • 作者单位

    Illinois Institute of Technology.;

  • 授予单位 Illinois Institute of Technology.;
  • 学科 Computer science.
  • 学位 Ph.D.
  • 年度 2017
  • 页码 140 p.
  • 总页数 140
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号