首页> 外文会议>IEEE International Congress on Big Data >Evaluation-driven research in data science: Leveraging cross-field methodologies
【24h】

Evaluation-driven research in data science: Leveraging cross-field methodologies

机译:数据科学评价驱动研究:利用跨场方法

获取原文

摘要

While prior evaluation methodologies for data-science research have focused on efficient and effective teamwork on independent data science problems within given fields [1], this paper argues that an enriched notion of evaluation-driven research (EDR) supports methodologies and effective solutions to data-science problems across multiple fields. We adopt the view that progress in data-science research is enriched through the examination of a range of problems in many different areas (traffic, healthcare, finance, sports, etc.) and through the development of methodologies and evaluation paradigms that span diverse disciplines, domains, problems, and tasks. A number of questions arise when one considers the multiplicity of data science fields and the potential for cross-disciplinary “sharing” of methodologies, for example: the feasibility of generalizing problems, tasks, and metrics across domains; ground-truth considerations for different types of problems; issues related to data uncertainty in different fields; and the feasibility of enabling cross-field cooperation to encourage diversity of solutions. We posit that addressing the problems inherent in such questions provides a foundation for EDR across diverse fields. We ground our conclusions and insights in a brief preliminary study developed within the Information Access Division of the National Institute of Standards and Technology as a part of a new Data Science Research Program (DSRP). The DSRP focuses on this cross-disciplinary notion of EDR and includes a new Data Science Evaluation series to facilitate research collaboration, to leverage shared technology and infrastructure, and to further build and strengthen the data-science community.
机译:虽然数据 - 科学研究的现有评估方法专注于给定领域的独立数据科学问题的有效和有效的团队合作[1],但本文认为评估驱动研究(EDR)的丰富概念支持数据的方法和有效的数据解决方案多个字段中的疑问问题。我们采用了数据 - 科学研究的进步,通过考察许多不同地区(交通,医疗保健,金融,体育等)以及开发跨越各种学科的方法和评估范式的一系列问题来验收。 ,域名,问题和任务。当人们考虑多个数据科学领域以及方法的跨学科“共享”的可能性时出现了许多问题,例如:概括域跨域的问题,任务和指标的可行性;不同类型问题的地面真实考虑因素;与不同领域的数据不确定性相关的问题;以及实现跨场合作,鼓励解决方案多样性的可行性。我们对这些问题中固有的问题的问题提供了针对各种领域的EDR的基础。我们在全国标准和技术学院信息访问部门开发的简短初步研究中,我们完成了我们的结论和见解,作为新的数据科学研究计划(DSRP)的一部分。 DSRP侧重于EDR的这种跨学科概念,包括一个新的数据科学评估系列,以促进研究合作,利用共享技术和基础设施,进一步构建和加强数据科学界。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号