首页> 外文会议>International conference on business information systems >Towards a Record Linkage Layer to Support Big Data Integration
【24h】

Towards a Record Linkage Layer to Support Big Data Integration

机译:走向记录链接层以支持大数据集成

获取原文

摘要

Record linkage is a crucial step in big data integration (DDI). It is also one of its major challenges with the increasing number of structured data sources that need to be linked and do not share common attributes. Our research-in-progress aims to develop a record linkage layer that assists data scientist in integrating a variety of data sources. A structured literature review of 68 papers reveals (1) key data sets, (2) available classification algorithms (match or no match), and (3) similarity measures to consider in BDI projects. The results highlight the foundational requirements for the development of the record linkage layer such as processing unstructured attributes. As BDI emerges as a priority for industry, our work proposes a record linkage layer that provide similarity measures and integration algorithms while assisting its selection. A record linkage layer can contribute to big data adoption in industry settings and improve quality of big data integration processes to effectively support business decision-making.
机译:记录链接是大数据集成(DDI)中的关键步骤。这也是其主要挑战之一,因为越来越多的结构化数据源需要链接并且不共享公共属性。我们正在进行的研究旨在开发一个记录链接层,以帮助数据科学家集成各种数据源。对68篇论文的结构化文献综述揭示了(1)关键数据集,(2)可用的分类算法(匹配或不匹配)以及(3)BDI项目中要考虑的相似性度量。结果突出了开发记录链接层的基本要求,例如处理非结构化属性。随着BDI成为行业的优先事项,我们的工作提出了一个记录链接层,该层提供相似性度量和集成算法,同时协助其选择。记录链接层可以促进大数据在行业环境中的采用,并提高大数据集成流程的质量,以有效地支持业务决策。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号