首页> 外文期刊>International Journal of Data Science and Analytics >An improved machine learning application for the integration of record systems for missing US service members
【24h】

An improved machine learning application for the integration of record systems for missing US service members

机译:一种改进的机器学习应用,用于集成缺少美国服务成员的记录系统

获取原文
获取原文并翻译 | 示例
           

摘要

The Defense POW/MIA Accounting Agency (DPAA) continues to diligently locate, recover, and identify over 81,000 missing US service members from past conflicts. To fulfill this important mission, massive amounts of information must be integrated from historical records, genealogy records, anthropological data, archaeological data, odontology data, and DNA. Previously a machine learning record-linkage application was developed to integrate DNA Family Reference Samples (FRS) data systems with the DPAA's master data. This application was shown to link large record systems with a high level of accuracy and precision. Here this work is extended to further optimize the blocking strategy used during record linkage as well as the record match alpha-level threshold for the Bayesian Classifier. Optimization of the blocking strategy was able to improve application run-time per record by 20%. After record-match alpha-level optimization, the application was found to link 89.6% of the record out-group to DPAA master data at an accuracy of 99.6%. The improved run-time efficiency and match rate of the record-linkage pipeline will greatly benefit not only the DPAA's FRS import process but also the linking of other big data sources supporting the DPAA mission.
机译:国防战俘/ MIA会计机构(DPAA)继续努力从过去的冲突中努力地定位,恢复,并确定81,000多名美国服务会员。为了满足这一重要的任务,必须从历史记录,家谱记录,人类学数据,考古数据,牙道数据和DNA中融入大量信息。以前,开发了一种机器学习记录链接应用程序以将DNA系列参考样本(FRS)数据系统与DPAA的主数据集成。此应用程序显示为将具有高精度和精度的大型记录系统链接。此处扩展了这项工作,以进一步优化记录链接期间使用的阻塞策略以及贝叶斯分类器的记录匹配alpha级阈值。封锁策略的优化能够将每录记录的申请运行时间提高20%。在录制匹配的alpha级优化后,发现应用程序将89.6%的记录输出组链接到DPAA主数据,精度为99.6%。 Record-Linkage管道的提高运行时间效率和匹配率将极大地利用DPAA的FRS导入过程,也将极大地利用,但也是支持DPAA任务的其他大数据源的链接。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号