Data Synthesis for Object Recognition

机译：用于目标识别的数据综合

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Large and balanced datasets are normally crucial for many machine learning models, especially when the problem is defined in a high dimensional space due to high complexity. In real-world applications, it is usually very hard and/or expensive to obtain adequate amounts of labeled data, even with the help of crowd-sourcing. To address these problems, a possible approach is to create synthetic data and use it for training. This approach has been applied in many application areas of computer vision including document recognition, object retrieval, and object classification. While a boosted performance has been demonstrated using synthetic data, the boosted performance is limited by two main factors in existing approaches. First, most existing approaches for creating and using synthetic data are application-specific and thus lack the ability to benefit other application areas. Further, such application-specific approaches are often heuristic in nature. Second, existing approaches do not recognize an inherent difference between synthetic data and actual data which is termed as a synthetic gap in my proposal. The synthetic gap in existing approaches is due to the fact that not all possible patterns and structures of actual data are present in the synthetic data. To address the problems of using synthetic data and using it to better improve the performance of learning algorithm, this proposal considers general ways of creating and using synthetic data. The problem caused by the synthetic gap is studied and approaches to overcome the gap are proposed. Experimental results demonstrate that the proposed approach is efficient and can boost the performance of many computer vision applications including building roof classification, character classification, and point cloud object classification.

机译：大型且平衡的数据集通常对于许多机器学习模型至关重要，尤其是由于复杂性高而在高维空间中定义问题时。在实际应用中，即使在众包的帮助下，获取足够数量的标记数据通常也非常困难和/或昂贵。为了解决这些问题，一种可能的方法是创建综合数据并将其用于训练。这种方法已应用于计算机视觉的许多应用领域，包括文档识别，对象检索和对象分类。尽管已使用综合数据证明了性能的提高，但是现有方法中的两个主要因素限制了性能的提高。首先，大多数现有的用于创建和使用合成数据的方法都是特定于应用程序的，因此缺乏使其他应用程序领域受益的能力。此外，这种特定于应用程序的方法通常本质上是启发式的。其次，现有方法无法识别综合数据与实际数据之间的固有差异，这在我的建议中被称为综合差距。现有方法中的合成差距是由于以下事实造成的：并非所有可能的实际数据模式和结构都存在于合成数据中。为了解决使用合成数据并更好地提高学习算法性能的问题，该建议考虑了创建和使用合成数据的一般方法。研究了由合成间隙引起的问题，并提出了克服间隙的方法。实验结果表明，该方法是有效的，可以提高许多计算机视觉应用程序的性能，包括建筑物屋顶分类，角色分类和点云对象分类。

著录项

作者
Zhang, Xi.;
展开▼
作者单位

Illinois Institute of Technology.;

展开▼
授予单位 Illinois Institute of Technology.;
学科 Computer science.
学位 Ph.D.
年度 2017
页码 140 p.
总页数 140
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. NTT Develops Non-rigid Object Recognition Technology that Achieves Recognition of Database Images—Not Only Rigid 3D Objects but also Non-rigid Objects—at Low Cost [J] . NTT Technical Review . 2019,第5期

机译：NTT开发了非刚性对象识别技术，该技术可以低成本实现数据库图像的识别，不仅可以识别刚性3D对象，还可以识别非刚性对象
2. The KIT object models database: An object model database for object recognition; localization and manipulation in service robotics [J] . Alexander Kasper, Zhixing Xue, Riidiger Dillmann The International journal of robotics research . 2012,第8期

机译：KIT对象模型数据库：用于对象识别的对象模型数据库；服务机器人中的本地化和操作
3. DVS Benchmark Datasets for Object Tracking, Action Recognition, and Object Recognition [J] . Yuhuang Hu, Hongjie Liu, Michael Pfeiffer, Frontiers in Neuroscience . 2016,第2016期

机译：用于对象跟踪，动作识别和对象识别的DVS基准数据集
4. Synthesis of a man machine interface environment for object recognition using tactile data [C] . Contreiras, F., Avril, Software Engineering for Real Time Systems, 1989., Second International Conference on . 1989

机译：人机界面环境的综合，用于使用触觉数据进行对象识别
5. Visual Recognition and Synthesis of Human-Object Interactions [D] . ?Chao, Yu-Wei 2019

机译：人体对象交互的视觉识别和合成
6. DVS Benchmark Datasets for Object Tracking Action Recognition and Object Recognition [O] . Yuhuang Hu, Hongjie Liu, Michael Pfeiffer, 2016

机译：用于对象跟踪动作识别和对象识别的DVS基准数据集
7. DVS benchmark datasets for object tracking, action recognition and object recognition [O] . Hu, Yuhuang, Liu, Hongjie, Pfeiffer, Michael, 2016

机译：用于对象跟踪，动作识别和对象识别的DVS基准数据集
8. Integrated Color CCD and LADAR Based Object Modeling and Recognition: ACoregistration Approach to Multisensor Target Recognition with Extensions to Exploit Digital Elevation Map Data [R] . Beveridge, J. R., Draper, B. A., Stevens, M. R., 1997

机译：基于集成彩色CCD和LaDaR的对象建模与识别：具有扩展的多传感器目标识别的交互注册方法，用于利用数字高程地图数据

Data Synthesis for Object Recognition

摘要

著录项

相似文献

相关主题

期刊订阅