首页> 外文会议>International conference on enterprise information systems >Responsible Data Science: Using Event Data in a 'People Friendly' Manner
【24h】

Responsible Data Science: Using Event Data in a 'People Friendly' Manner

机译:负责任的数据科学:以“亲民”的方式使用事件数据

获取原文

摘要

The omnipresence of event data and powerful process mining techniques make it possible to quickly learn process models describing what people and organizations really do. Recent breakthroughs in process mining resulted in powerful techniques to discover the real processes, to detect deviations from normative process models, and to analyze bottlenecks and waste. Process mining and other data science techniques can be used to improve processes within any organization. However, there are also great concerns about the use of data for such purposes. Increasingly, customers, patients, and other stakeholders worry about "irresponsible" forms of data science. Automated data decisions may be unfair or non-transparent. Confidential data may be shared unintentionally or abused by third parties. Each step in the "data science pipeline" (from raw data to decisions) may create inaccuracies, e.g., if the data used to learn a model reflects existing social biases, the algorithm is likely to incorporate these biases. These concerns could lead to resistance against the large-scale use of data and make it impossible to reap the benefits of process mining and other data science approaches. This paper discusses Responsible Process Mining (RPM) as a new challenge in the broader field of Responsible Data Science (RDS). Rather than avoiding the use of (event) data altogether, we strongly believe that techniques, infrastructures and approaches can be made responsible by design. Not addressing the challenges related to RPM/RDS may lead to a society where (event) data are misused or analysis results are deeply mistrusted.
机译:事件数据的无处不在和强大的流程挖掘技术使快速学习描述人员和组织实际行为的流程模型成为可能。在过程挖掘方面的最新突破导致了强大的技术来发现实际过程,检测与规范过程模型的偏差以及分析瓶颈和浪费。流程挖掘和其他数据科学技术可用于改善任何组织内的流程。但是,对于将数据用于此类目的也存在极大的担忧。客户,患者和其他利益相关者越来越担心数据科学的“不负责任”形式。自动化的数据决策可能是不公平的或不透明的。机密数据可能无意间被共享或被第三方滥用。 “数据科学流水线”中的每个步骤(从原始数据到决策)可能会产生不准确性,例如,如果用于学习模型的数据反映了现有的社会偏见,则该算法可能会合并这些偏见。这些担忧可能导致对大规模使用数据的抵制,并使其无法获得流程挖掘和其他数据科学方法的好处。本文讨论了负责任的流程挖掘(RPM),这是更广泛的负责任的数据科学(RDS)领域中的新挑战。我们坚决相信,并非完全避免使用(事件)数据,而是可以通过设计使技术,基础架构和方法负责。不解决与RPM / RDS相关的挑战可能会导致一个社会,其中(事件)数据被滥用或分析结果受到高度信任。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号