...
首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Addressing Sample Size Challenges in Linked Data Through Data Fusion
【24h】

Addressing Sample Size Challenges in Linked Data Through Data Fusion

机译:通过数据融合解决链接数据中的示例大小挑战

获取原文
           

摘要

Linking secondary clinical data with patient-reported data at the patient-level brings together a comprehensive view of the patient but sample sizes can be a challenge. This study demonstrates the fusion of Patient Reported Outcomes in surveys with clinical data in claims enabling the study of associations between quality of life and disease-treatment interactions at scale especially for rare diseases. In this work, we show the ability to implement data fusion in a disease agnostic way thereby enabling the use of more advanced machine learning algorithms on larger data sets, while still being able to use the resulting fused data to perform disease specific analysis. This is in contrast to usual approaches where the data fusion might be attempted on disease specific data sets which can be too small to be amenable to analysis by advanced methods. The proposed data fusion methodology circumvents some of the assumptions typically imposed on the data fusion process that are untestable and usually invalid by taking advantage of the subset of the data that can be linked in the two data sources.
机译:将患者报告的患者级数据链接辅助临床数据在患者级别带来综合患者的综合视图,但样本尺寸可能是一个挑战。本研究表明,患者的融合报告的调查结果,索赔中的临床资料能够研究生命质量与疾病治疗相互作用之间的关联,特别是对于罕见疾病。在这项工作中,我们展示了以疾病侵害的方式实现数据融合的能力,从而能够在较大的数据集上使用更先进的机器学习算法,同时仍然能够使用所产生的融合数据来执行疾病特定的分析。这与通常的方法形成对比,其中数据融合可能会对疾病特定数据集进行,这可能太小,无法通过高级方法进行分析。所提出的数据融合方法避免了通常施加在数据融合过程上的一些假设,这些假设是不可发挥的,并且通常通过利用可以在两个数据源中链接的数据的子集而无效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号