首页> 外文会议>IEEE International Conference on Data Mining Workshops >Historical Data Integration a Study of WWI Canadian Soldiers
【24h】

Historical Data Integration a Study of WWI Canadian Soldiers

机译:历史数据集成-一战加拿大士兵的研究

获取原文

摘要

Record linkage is the process of identifying and linking records that refer to the same entities across several databases. In this paper we integrate three historical data sources (Canadian soldiers in the Canadian Expeditionary Force (CEF) who served in World War I, CEF casualties of World War I, and the Canadian census of 1901) to study the Canadian soldiers and casualties of World War I. We link the soldiers dataset to the casualties one to be able to identify the soldiers that died in WWI. In addition, we link the soldiers dataset to the Canadian census of 1901 to enrich the available attributes. The goal is to generate longitudinal data about the Canadian soldiers that would allow researchers to perform a systematic analysis of who lived and who died. The imprecision of historical data, along with the unavailability of expert links and a limited number of attributes make the linkage process a challenging task. We present in this paper methodology to integrate the three data sources and a preliminary analysis of the longitudinal data.
机译:记录链接是在多个数据库中标识和链接引用相同实体的记录的过程。在本文中,我们整合了三个历史数据源(在第一次世界大战中服役的加拿大远征军(CEF)中的加拿大士兵,第一次世界大战中的CEF伤亡人数和1901年的加拿大人口普查)来研究世界上的加拿大士兵和伤亡人数第一次世界大战。我们将士兵数据集与伤亡人数相关联,以便能够识别在第一次世界大战中丧生的士兵。此外,我们将士兵数据集链接到1901年的加拿大人口普查,以丰富可用的属性。目的是生成有关加拿大士兵的纵向数据,这将使研究人员能够对谁活着和谁死进行系统的分析。历史数据的不精确性,以及专家链接的不可用和有限数量的属性,使得链接过程成为一项艰巨的任务。我们在本文中介绍了整合三个数据源和纵向数据的初步分析的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号