首页> 外文期刊>BMC Medical Informatics and Decision Making >Big data hurdles in precision medicine and precision public health
【24h】

Big data hurdles in precision medicine and precision public health

机译:大数据阻碍了精准医学和精准公共卫生

获取原文
       

摘要

Nowadays, trendy research in biomedical sciences juxtaposes the term ‘precision’ to medicine and public health with companion words like big data, data science, and deep learning. Technological advancements permit the collection and merging of large heterogeneous datasets from different sources, from genome sequences to social media posts or from electronic health records to wearables. Additionally, complex algorithms supported by high-performance computing allow one to transform these large datasets into knowledge. Despite such progress, many barriers still exist against achieving precision medicine and precision public health interventions for the benefit of the individual and the population. The present work focuses on analyzing both the technical and societal hurdles related to the development of prediction models of health risks, diagnoses and outcomes from integrated biomedical databases. Methodological challenges that need to be addressed include improving semantics of study designs: medical record data are inherently biased, and even the most advanced deep learning’s denoising autoencoders cannot overcome the bias if not handled a priori by design. Societal challenges to face include evaluation of ethically actionable risk factors at the individual and population level; for instance, usage of gender, race, or ethnicity as risk modifiers, not as biological variables, could be replaced by modifiable environmental proxies such as lifestyle and dietary habits, household income, or access to educational resources. Data science for precision medicine and public health warrants an informatics-oriented formalization of the study design and interoperability throughout all levels of the knowledge inference process, from the research semantics, to model development, and ultimately to implementation.
机译:如今,生物医学科学的流行研究将“精密”一词与大数据,数据科学和深度学习等伴随词并列在一起。技术进步允许从不同来源收集和合并大型异构数据集,从基因组序列到社交媒体帖子,或者从电子健康记录到可穿戴设备。此外,高性能计算支持的复杂算法使人们能够将这些大型数据集转化为知识。尽管取得了这些进展,但在为个人和人口的利益而实现精密医学和精密公共卫生干预措施方面仍然存在许多障碍。本工作着重于分析与健康风险预测模型,综合生物医学数据库的诊断和结果的预测模型的发展有关的技术和社会障碍。需要解决的方法论挑战包括改善研究设计的语义:病历数据天生就有偏差,即使最先进的深度学习的降噪自动编码器也无法克服偏差,如果不通过设计先验地解决。面临的社会挑战包括在个人和人口层面评估在道德上可行的风险因素;例如,可以将性别,种族或种族作为风险调节剂,而不是作为生物学变量,可以用可修改的环境替代代替,例如生活方式和饮食习惯,家庭收入或获得教育资源。精准医学和公共卫生的数据科学保证了研究设计和互操作性的面向信息学的形式化,贯穿于知识推理过程的各个层次,从研究语义到模型开发,最终到实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号