首页> 外文期刊>PLoS One >Designing machine learning workflows with an application to topological data analysis
【24h】

Designing machine learning workflows with an application to topological data analysis

机译:设计机器学习工作流程与拓扑数据分析的应用

获取原文
           

摘要

In this paper we define the concept of the Machine Learning Morphism (MLM) as a fundamental building block to express operations performed in machine learning such as data preprocessing, feature extraction, and model training. Inspired by statistical learning, MLMs are morphisms whose parameters are minimized via a risk function. We explore operations such as composition of MLMs and when sets of MLMs form a vector space. These operations are used to build a machine learning workflow from data preprocessing to final task completion. We examine the Mapper Algorithm from Topological Data Analysis as an MLM, and build several workflows for binary classification incorporating Mapper on Hospital Readmissions and Credit Evaluation datasets. The advantage of this framework lies in the ability to easily build, organize, and compare multiple workflows, and allows joint optimization of parameters across multiple steps in an application.
机译:在本文中,我们将机器学习态势(MLM)的概念定义为基本构建块,以表达在机器学习中执行的操作,例如数据预处理,特征提取和模型训练。 灵感来自统计学习,MLMS是通过风险功能最小化的态度的态度。 我们探索MLMS的组成等操作以及MLMS形成矢量空间的组成。 这些操作用于构建从数据预处理到最终任务完成的计算机学习工作流程。 从拓扑数据分析中检查Mapper算法作为MLM,并在医院入院和信用评估数据集中构建结合Mapper的二进制分类工作流程。 该框架的优势在于轻松构建,组织和比较多个工作流程的能力,并允许在应用程序中进行多个步骤联合优化参数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号