Designing machine learning workflows with an application to topological data analysis

Eric Cawi; Patricio S. La Rosa; Arye Nehorai

首页> 外文期刊>PLoS One >Designing machine learning workflows with an application to topological data analysis

【24h】

Designing machine learning workflows with an application to topological data analysis

机译：设计机器学习工作流程与拓扑数据分析的应用

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we define the concept of the Machine Learning Morphism (MLM) as a fundamental building block to express operations performed in machine learning such as data preprocessing, feature extraction, and model training. Inspired by statistical learning, MLMs are morphisms whose parameters are minimized via a risk function. We explore operations such as composition of MLMs and when sets of MLMs form a vector space. These operations are used to build a machine learning workflow from data preprocessing to final task completion. We examine the Mapper Algorithm from Topological Data Analysis as an MLM, and build several workflows for binary classification incorporating Mapper on Hospital Readmissions and Credit Evaluation datasets. The advantage of this framework lies in the ability to easily build, organize, and compare multiple workflows, and allows joint optimization of parameters across multiple steps in an application.

机译：在本文中，我们将机器学习态势（MLM）的概念定义为基本构建块，以表达在机器学习中执行的操作，例如数据预处理，特征提取和模型训练。灵感来自统计学习，MLMS是通过风险功能最小化的态度的态度。我们探索MLMS的组成等操作以及MLMS形成矢量空间的组成。这些操作用于构建从数据预处理到最终任务完成的计算机学习工作流程。从拓扑数据分析中检查Mapper算法作为MLM，并在医院入院和信用评估数据集中构建结合Mapper的二进制分类工作流程。该框架的优势在于轻松构建，组织和比较多个工作流程的能力，并允许在应用程序中进行多个步骤联合优化参数。

著录项

来源
《PLoS One》 |2019年第12期|共26页
作者
Eric Cawi; Patricio S. La Rosa; Arye Nehorai;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类医药、卫生;
关键词

相似文献

外文文献
中文文献
专利

1. giotto-tda: : A Topological Data Analysis Toolkit for Machine Learning and Data Exploration [J] . Guillaume Tauzin, Umberto Lupo, Lewis Tunstall, Journal of machine learning research . 2021,第a期

机译：Giotto-TDA：：用于机器学习和数据探索的拓扑数据分析工具包
2. Topological data analysis and machine learning for recognizing atmospheric river patterns in large climate datasets [J] . Muszynski Grzegorz, Kashinath Karthik, Kurlin Vitaliy, Geoscientific Model Development . 2019,第2期

机译：用于识别大型气候数据集中大气河型的拓扑数据分析和机器学习
3. Topological data analysis and machine learning for recognizing atmospheric river patterns in large climate datasets [J] . Muszynski Grzegorz, Kashinath Karthik, Kurlin Vitaliy, Geoscientific Model Development Discussions . 2019,第2期

机译：大型气候数据集中识别大气河流模式的拓扑数据分析与机器学习
4. Proteomic and informatic approaches in the U-BIOPRED severe asthma project: large-scale MS~E, data mining, topological data analysis, and machine learning [C] . Dominic Burg, Doroteya Staykova, Xian Yang, ASMS Conference on Mass Spectrometry and Allied Topics . 2014

机译：U-Biopred严重哮喘项目中的蛋白质组学和信息方法：大规模MS〜E，数据挖掘，拓扑数据分析和机器学习
5. Feature Extraction Using Topological Data Analysis for Machine Learning and Network Science Applications [D] . Guo, Wei . 2020

机译：采用机器学习和网络科学应用的拓扑数据分析特征提取
6. Correction: Designing machine learning workflows with an application to topological data analysis [O] . Eric Cawi, Patricio S La Rosa, Arye Nehorai 2020

机译：校正：设计机器学习工作流程及其在拓扑数据分析中的应用
7. A machine learning workflow for molecular analysis: application to melting points [O] . Ganesh Sivaraman, Nicholas E Jackson, Benjamin Sanchez-Lengeling, 2020

机译：用于分子分析的机器学习工作流程：熔化点的应用

Designing machine learning workflows with an application to topological data analysis

摘要

著录项

相似文献

相关主题

期刊订阅