首页> 外文学位 >Evaluating the portability of health care data to SQL like Big Data environment.

【24h】

Evaluating the portability of health care data to SQL like Big Data environment.

机译：评估医疗保健数据到像大数据环境这样的SQL的可移植性。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Big Data deals with huge-volumes of complex, exponentially growing data sets from multiple, sources. With rapid growth in networking we are now able to generate immense amount of data in almost any field imaginable, including physical, biological and biomedical sciences. While most industries have been far more successful at harnessing the value from large-scale integration and analysis of big data, the health care industry is just getting its feet wet. One impediment for the Health Care industry adoption of Big Data analytics has been the dependence of many of their models on the RDBMS technology. With the diversity and amounts of data in health care industry there is an increasing need to evaluate components in big data frameworks and gauge their adaptability to analytics techniques. However, recent developments in the Hadoop ecosystem environment have led to breakthroughs enabling RDBMS like tools in big data environments. In this paper we evaluate the portability of existing RDBMS solutions employing such SQL like big data tools. Our work focuses on benchmarking multiple SQL like big data technologies over HDFS for Study Data Tabulation Model (SDTM) used in clinical trial databases for improving the efficiency of research in clinical trials. We will examine their potential for improving the efficiency of research in big data clinical trials. Publicly available healthcare data (from National Institute of Drug Abuse (NIDA)) is utilized as a test bed to measure key parameters like usability, adaptability and modularity, robustness and efficiency. Our intention is to demonstrate the portability of the execution of ad-hoc SQL queries on the fly occurring in current clinical trial functionality and evaluate if it can be replicated in a big data SQL like back-end system with relative ease and transparency.

机译：大数据处理来自多个来源的大量复杂，呈指数增长的数据集。随着网络的快速发展，我们现在能够在几乎任何可以想象的领域（包括物理，生物和生物医学科学）中生成大量数据。尽管大多数行业在利用大规模集成和大数据分析带来的价值方面取得了更大的成功，但医疗保健行业才刚刚起步。卫生保健行业采用大数据分析的一个障碍是其许多模型都依赖RDBMS技术。随着医疗保健行业中数据的多样性和数量的增加，越来越需要评估大数据框架中的组件并评估其对分析技术的适应性。但是，Hadoop生态系统环境的最新发展带来了突破，使RDBMS像大数据环境中的工具一样。在本文中，我们评估了使用像大数据工具这样的SQL的现有RDBMS解决方案的可移植性。我们的工作重点是通过HDFS对用于临床试验数据库中的研究数据列表模型（SDTM）的多个SQL之类的大数据技术进行基准测试，以提高临床试验的研究效率。我们将研究它们在提高大数据临床试验研究效率方面的潜力。公开可用的医疗数据（来自美国药物滥用研究所（NIDA））被用作测试床，以测量关键参数，如可用性，适应性和模块化，稳健性和效率。我们的目的是演示在当前临床试验功能中即时执行即席SQL查询的可移植性，并评估它是否可以相对容易和透明地复制到后端系统等大数据SQL中。

著录项

作者
Grover, Akshay.;
展开▼
作者单位

University of Maryland, Baltimore County.;

展开▼
授予单位 University of Maryland, Baltimore County.;
学科 Computer science.;Information science.
学位 M.S.
年度 2015
页码 62 p.
总页数 62
原文格式 PDF
正文语种 eng
中图分类
关键词
入库时间 2022-08-17 11:52:26

相似文献

外文文献
中文文献
专利

1. Performance Evaluation of Nosql-Cassandra over Relational Data Store-Mysql for Bigdata [J] . Sangeeta Gupta, Narsimha International Journal of Technology . 2015,第4期

机译：Nosql-Cassandra在关系数据存储-Mysql大数据上的性能评估
2. Performance Evaluation of Nosql-Cassandra over Relational Data Store-Mysql for Bigdata [J] . Sangeeta Gupta, Narsimha International Journal of Technology . 2015,第4期

机译：Nosql-Cassandra在关系数据存储-Mysql大数据上的性能评估
3. Evaluation of Electronic and Paper-Pen Data Capturing Tools for Data Quality in a Public Health Survey in a Health and Demographic Surveillance Site, Ethiopia: Randomized Controlled Crossover Health Care Information Technology Evaluation [J] . Atinkut Alamirrew Zeleke, Abebaw Gebeyehu Worku, Adina Demissie, JMIR mHealth and uHealth . 2019,第2期

机译：埃塞俄比亚卫生和人口普查站点公共卫生调查中用于数据质量的电子纸笔数据捕获工具的评估：随机控制交叉医疗信息技术评估
4. Implementation of Data Transform Method into NoSQL Database for Healthcare Data [C] . Yang Chao-Tung, Liu Jung-Chun, Hsu Wen-Hung, International Conference on Parallel and Distributed Computing, Applications and Technologies . 2013

机译：NoSQL数据库中用于医疗保健数据的数据转换方法的实现
5. A Quantitative Evaluation of the Performance Impact of Type-I Virtualization on a NewSQL Relational Database Management System [D] . Osborne, James Bryan. 2020

机译：I型虚拟化对NewsQL关系数据库管理系统的性能影响的定量评估
6. Evaluating the Cassandra NoSQL Database Approach for Genomic Data Persistency [O] . Rodrigo Aniceto, Rene Xavier, Valeria Guimarães, 2015

机译：评估Cassandra NoSQL数据库方法的基因组数据持久性
7. Health, health care, and the environment. Econometric evidence from German micro data [O] . Manfred Erbsland, Walter Ried, Volker Ulrich 1995

机译：健康，医疗保健和环境。来自德国微数据的计量计量证据

Evaluating the portability of health care data to SQL like Big Data environment.

摘要

著录项

相似文献

相关主题

期刊订阅