首页> 外文期刊>Information Systems >Multi-source uncertain entity resolution: Transforming holocaust victim reports into people
【24h】

Multi-source uncertain entity resolution: Transforming holocaust victim reports into people

机译:多来源不确定的实体解决方案:将大屠杀受害者的报告转变为人们

获取原文
获取原文并翻译 | 示例

摘要

In this work we present a multi-source uncertain entity resolution model and show its implementation in a use case of Yad Vashem, the central repository of Holocaust-era information. The Yad Vashem dataset is unique with respect to classic entity resolution, by virtue of being both massively multi-source and by requiring multi-level entity resolution. With today's abundance of information sources, this project motivates the use of multi source resolution on a big-data scale. We instantiate the proposed model using the MFIBlocks entity resolution algorithm and a machine learning approach, based upon decision trees to transform soft clusters into ranked clustering of records, representing possible entities. An extensive empirical evaluation demonstrates the unique properties of this dataset that make it a good candidate for multi-source entity resolution. We conclude with proposing avenues for future research in this realm.
机译:在这项工作中,我们提出了一个多源不确定实体解析模型,并在Yad Vashem(大屠杀时代信息的中央存储库)的用例中展示了其实现。 Yad Vashem数据集在经典实体解析方面具有独特性,这是因为它既具有大量的多源资源又需要多层实体解析。在当今信息资源丰富的情况下,该项目促使在大数据规模上使用多源分辨率。我们使用MFIBlocks实体解析算法和机器学习方法实例化所提出的模型,该模型基于决策树将软聚类转换为代表可能实体的记录的排序聚类。广泛的经验评估证明了该数据集的独特属性,使其成为多源实体解析的理想选择。最后,我们提出了在该领域中进行未来研究的途径。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号