首页> 外文会议>International Conference on Image and Vision Computing New Zealand >Machine Learning with Synthetic Data – a New Way to Learn and Classify the Pictorial Augmented Reality Markers in Real-Time
【24h】

Machine Learning with Synthetic Data – a New Way to Learn and Classify the Pictorial Augmented Reality Markers in Real-Time

机译:使用合成数据的机器学习 - 一种实时学习和分类图形增强现实标记的新方法

获取原文

摘要

The idea of Augmented Reality (AR) appeared in the early 60s, which recently received a large amount of public attention. AR allows us to work, learn, play, and connect with the world around us both virtually and physically in real-time. However, picking the AR marker to match the users’ needs is one of the most challenging tasks due to different marker encryption/decryption methods and essential requirements. Barcode AR cards are fast and efficient, but they do not contain much visual information; pictorial coloured AR card, on the other hand, is slow and not reliable. This paper proposes a solution to obtain detectable arbitrary pictorial/colour AR cards in real-time by applying the benefit of machine learning and the power of synthetic data generation techniques. This technique solves the issue of labour-intensive tasks of manual annotations when building a massive training dataset of deep-learning. Thus, with a small number of input of the AR-enhanced target figures (as few as one for each coloured card), the synthetic data generated process will produce a deep-learning trainable dataset using computer-graphic rendering techniques (ten of thousands from just one image). Second, the generated dataset is then trained with a chosen object recognition convolutional neural network, acting as the AR marker tracking functionality. Our proposed idea works effectively well without modifying the original contents (of the chosen AR card). The benefits of using synthetic data generated techniques help us to improve the AR marker recognition accuracy and reduce the marker registration time. The trained model is capable of processing video sequences at approximately 25 frames per second without GPU Acceleration, which is suitable for AR experience on the mobile/web platform. We believed that it could be a promising low-cost AR approach in many areas, such as education and gaming.
机译:增强现实(AR)的想法出现在60年代初,最近收到了大量的公众关注。 AR允许我们在实时和物理上与我们一起工作,学习,播放和连接世界。但是,采摘AR标记以匹配用户需求是由于不同的标记加密/解密方法和基本要求的最具挑战性任务之一。条形码AR卡速度快,高效,但它们不包含太多的视觉信息;图片彩色AR卡,另一方面,缓慢而不可靠。本文提出了一种通过应用机器学习的好处和合成数据生成技术的功利来实时获得可检测任意图形/彩色AR卡的解决方案。该技术解决了在建立深度学习的大规模培训数据集时,解决了手动注释的劳动密集型任务问题。因此,通过少量输入的AR增强型目标数字(每种彩色卡的少量),合成数据生成的过程将产生使用计算机图形渲染技术的深学习培训数据集(来自数千个来自只是一个图像)。其次,然后使用所选择的对象识别卷积神经网络训练生成的数据集,其作为AR标记跟踪功能。我们所提出的想法有效地运作良好而不修改原始内容(所选的AR卡)。使用合成数据产生的技术的好处有助于我们提高AR标记识别准确性并降低标记注册时间。训练的模型能够在每秒大约25帧的情况下处理视频序列,而没有GPU加速,这适用于移动/网络平台上的AR经验。我们认为,在许多领域,如教育和游戏,可能是一个有前途的低成本AR方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号