首页> 外文会议>AAAI Symposia >Machine Representation of Data Analyses: Towards a Platform for Collaborative Data Science
【24h】

Machine Representation of Data Analyses: Towards a Platform for Collaborative Data Science

机译:数据分析的机器表示:迈向协作数据科学的平台

获取原文

摘要

Artificial intelligence and data science play an increasingly important role in solving today's scientific and social challenges. To be successful, the data-driven approach to social good requires effective collaboration between data scientists, subject-matter experts, policymakers, and other stakeholders. We envision a cloud platform for data science that would facilitate collaboration between stakeholders and possess AI capabilities for discovering, benchmarking, and organizing data analyses. Here we present a foundational technology motivated by this vision. Our system automatically extracts a high-level dataflow graph from a data analysis. The graph describes how data flows through an analysis pipeline, including which statistical methods are used and how they fit together. The system requires no special annotations from the data analyst and consumes analyses written in Python using standard tools, such as Scikit-learn and StatsModels. In this paper, we explain how our system works and how it fits into our larger vision for a collaborative data science platform.
机译:人工智能和数据科学在解决当今的科学和社会挑战方面发挥着越来越重要的作用。为了成功,数据驱动的社会良好方法需要有效的数据科学家,主题专家,政策制定者和其他利益攸关方之间的合作。我们为数据科学设想将促进利益相关者之间合作的数据科学平台,并具有用于发现,基准测试和组织数据分析的AI功能。在这里,我们提出了一个受此愿景激励的基础技术。我们的系统从数据分析中自动提取高级数据流图。该图描述了数据如何流过分析管道,包括使用哪种统计方法以及它们如何合适。该系统不需要从数据分析师的特殊注释,并使用标准工具(例如Scikit-Learn和StatsSmodel)的Python中写入的分析。在本文中,我们解释了我们的系统如何运作以及它如何符合我们对协作数据科学平台的更大愿景。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号