首页> 外文会议>International Conference for Convergence in Technology >Surface Automation - Interacting with Applications using Black Box Approach
【24h】

Surface Automation - Interacting with Applications using Black Box Approach

机译:表面自动化 - 使用黑色盒子方法与应用程序进行交互

获取原文
获取外文期刊封面目录资料

摘要

One of the biggest challenges of any Robotic Process Automation (RPA) tools is automating scenarios in tricky environments such as virtual machines (VMs) and remote desktops or when dealing with legacy applications where the technical know-how of the interfaces are limited. Typical approach to interacting with applications under regular scenarios is to use application technology specific APIs to build knowledge metadata. However, these approaches fail when it comes to automating in the black box environments because we cannot have technical understanding or APIs of the applications. Now, the only way to automate here is by taking the screenshot of the applications and understanding the controls available inside the screenshots using image processing algorithms. In this paper we present a novel approach for an end-to-end automation of such scenarios. We employ various AI routines to understand structural and semantic aspects of enterprise applications which in turn are used for building automations and we group all these techniques under a common umbrella of Surface Automation. Our approach involves using robust Optical Character Recognition (OCR) which internally uses LSTMs for reading texts like labels, Natural Language Processing (NLP) for semantic understanding of the page, Image Region Similarity for matching icons like home button and Computer Vision based Object Detection for recognizing predefined controls [text-fields, buttons, dropdown etc.].
机译:任何机器人过程自动化(RPA)工具的最大挑战之一就是在棘手的环境中自动化方案,例如虚拟机(VM)和远程桌面,或者在处理接口技术知识的遗留应用程序中。在定期场景下与应用程序交互的典型方法是使用应用技术特定的API构建知识元数据。但是,在黑盒环境中自动化时,这些方法由于我们不能具有技术理解或应用程序的API而失败。现在,这里唯一的自动化方法是使用图像处理算法拍摄应用程序的屏幕截图并了解屏幕截图中可用的控件。在本文中,我们提出了一种新的方法,用于这种情况的端到端自动化。我们采用各种AI例程来了解企业应用程序的结构和语义方面,这些应用程序又用于建立自动化,并在普通的表面自动化伞下对所有这些技术进行组。我们的方法涉及使用强大的光学字符识别(OCR),内部使用LSTMS来读取标签,自然语言处理(NLP)等文本,用于对页面的语义理解,用于匹配家庭按钮和基于计算机视觉的对象检测的图标的图像区域相似度识别预定义控件[文本字段,按钮,下拉等]。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号