首页> 外文期刊>ACM Transactions on Interactive Intelligent Systems >Supporting the Design of Machine Learning Workflows with a Recommendation System
【24h】

Supporting the Design of Machine Learning Workflows with a Recommendation System

机译:使用推荐系统支持机器学习工作流程的设计

获取原文
获取原文并翻译 | 示例
       

摘要

Machine learning and data analytics tasks in practice require several consecutive processing steps. Rapid-Miner is a widely used software tool for the development and execution of such analytics workflows. Unlike many other algorithm toolkits, it comprises a visual editor that allows the user to design processes on a conceptual level. This conceptual and visual approach helps the user to abstract from the technical details during the development phase and to retain a focus on the core modeling task. The large set of preimplemented data analysis and machine learning operations available in the tool, as well as their logical dependencies, can, however, be overwhelming in particular for novice users. In this work, we present an add-on to the RapidMiner framework that supports the user during the modeling phase by recommending additional operations to insert into the currently developed machine learning workflow. First, we propose different recommendation techniques and evaluate them in an offline setting using a pool of several thousand existing workflows. Second, we present the results of a laboratory study, which show that our tool helps users to significantly increase the efficiency of the modeling process. Finally, we report on analyses using data that were collected during the real-world deployment of the plug-in component and compare the results of the live deployment of the tool with the results obtained through an offline analysis and a replay simulation.
机译:实际上,机器学习和数据分析任务需要几个连续的处理步骤。 Rapid-Miner是用于开发和执行此类分析工作流的广泛使用的软件工具。与许多其他算法工具包不同,它包含一个可视化编辑器,使用户可以在概念上设计流程。这种概念上和视觉上的方法可帮助用户在开发阶段从技术细节中抽象出来,并专注于核心建模任务。但是,该工具中大量的预实现的数据分析和机器学习操作以及它们的逻辑依存关系可能尤其使新手用户不知所措。在这项工作中,我们提出了RapidMiner框架的附加组件,通过建议在当前开发的机器学习工作流程中插入其他操作来为用户在建模阶段提供支持。首先,我们提出了不同的推荐技术,并使用包含数千个现有工作流的池在脱机设置中对其进行评估。其次,我们介绍了一项实验室研究的结果,表明我们的工具可帮助用户显着提高建模过程的效率。最后,我们使用在插件组件的实际部署期间收集的数据报告分析,并将工具的实时部署结果与通过脱机分析和重播模拟获得的结果进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号