首页> 外文会议>IEEE International Congress on Big Data >Evaluation-driven research in data science: Leveraging cross-field methodologies
【24h】

Evaluation-driven research in data science: Leveraging cross-field methodologies

机译:以评估为导向的数据科学研究:利用跨领域方法

获取原文

摘要

While prior evaluation methodologies for data-science research have focused on efficient and effective teamwork on independent data science problems within given fields [1], this paper argues that an enriched notion of evaluation-driven research (EDR) supports methodologies and effective solutions to data-science problems across multiple fields. We adopt the view that progress in data-science research is enriched through the examination of a range of problems in many different areas (traffic, healthcare, finance, sports, etc.) and through the development of methodologies and evaluation paradigms that span diverse disciplines, domains, problems, and tasks. A number of questions arise when one considers the multiplicity of data science fields and the potential for cross-disciplinary “sharing” of methodologies, for example: the feasibility of generalizing problems, tasks, and metrics across domains; ground-truth considerations for different types of problems; issues related to data uncertainty in different fields; and the feasibility of enabling cross-field cooperation to encourage diversity of solutions. We posit that addressing the problems inherent in such questions provides a foundation for EDR across diverse fields. We ground our conclusions and insights in a brief preliminary study developed within the Information Access Division of the National Institute of Standards and Technology as a part of a new Data Science Research Program (DSRP). The DSRP focuses on this cross-disciplinary notion of EDR and includes a new Data Science Evaluation series to facilitate research collaboration, to leverage shared technology and infrastructure, and to further build and strengthen the data-science community.
机译:尽管先前的数据科学研究评估方法着眼于在给定领域内针对独立数据科学问题的有效和有效的团队合作[1],但本文认为,评估驱动研究(EDR)的丰富概念支持数据的方法论和有效解决方案跨多个领域的科学问题。我们认为,通过研究许多不同领域(交通,医疗保健,金融,体育等)中的一系列问题以及通过开发涵盖不同学科的方法和评估范式,可以丰富数据科学研究的进展,领域,问题和任务。当人们考虑数据科学领域的多样性以及跨学科“共享”方法的潜力时,就会产生许多问题,例如:跨领域归纳问题,任务和度量的可行性;针对不同类型问题的实际考虑;与不同领域的数据不确定性有关的问题;以及进行跨领域合作以鼓励解决方案多样化的可行性。我们认为,解决此类问题中固有的问题为跨不同领域的EDR提供了基础。在新的数据科学研究计划(DSRP)的一部分下,我们在美国国家标准技术研究院信息访问部门进行的简短初步研究中,基于我们的结论和见识。 DSRP专注于EDR的跨学科概念,并包括一个新的数据科学评估系列,以促进研究合作,利用共享技术和基础架构,并进一步建立和加强数据科学界。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号