...
首页> 外文期刊>International Journal of Biometeorology: Journal of the International Society of Biometeorology >Integrating and analyzing medical and environmental data using ETL and Business Intelligence tools
【24h】

Integrating and analyzing medical and environmental data using ETL and Business Intelligence tools

机译:使用ETL和商业智能工具集成和分析医疗和环境数据

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Processing data that originates from different sources (such as environmental and medical data) can prove to be a difficult task, due to the heterogeneity of variables, storage systems, and file formats that can be used. Moreover, once the amount of data reaches a certain threshold, conventional mining methods (based on spreadsheets or statistical software) become cumbersome or even impossible to apply. Data Extract, Transform, and Load (ETL) solutions provide a framework to normalize and integrate heterogeneous data into a local data store. Additionally, the application of Online Analytical Processing (OLAP), a set of Business Intelligence (BI) methodologies and practices for multidimensional data analysis, can be an invaluable tool for its examination and mining. In this article, we describe a solution based on an ETL + OLAP tandem used for the on-the-fly analysis of tens of millions of individual medical, meteorological, and air quality observations from 16 provinces in Spain provided by 20 different national and regional entities in a diverse array for file types and formats, with the intention of evaluating the effect of several environmental variables on human health in future studies. Our work shows how a sizable amount of data, spread across a wide range of file formats and structures, and originating from a number of different sources belonging to various business domains, can be integrated in a single system that researchers can use for global data analysis and mining.
机译:由于可以使用的变量,存储系统和文件格式的异质性,从不同源(例如环境和医疗数据)发起的处理数据可以证明是一项艰巨的任务。此外,一旦数据量达到某个阈值,传统的采矿方法(基于电子表格或统计软件)变得麻烦甚至不可能申请。数据提取,变换和加载(ETL)解决方案提供了一个框架,用于将异构数据集成到本地数据存储中。此外,在线分析处理(OLAP),一组商业智能(BI)方法和多维数据分析的实践,可以是其检查和采矿的宝贵工具。在本文中,我们描述了一种基于ETL + OLAP串联的解决方案,用于在20万个单独的医疗,气象和空气质量观测到来自西班牙的16个省份提供的20多个国家和区域不同阵列中的实体用于文件类型和格式,目的是在未来的研究中评估几种环境变量对人类健康的影响。我们的工作显示了如何跨越各种文件格式和结构的大量数据,以及源自属于各种商业域的许多不同来源,可以集成在研究人员可以用于全局数据分析的单个系统中和采矿。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号