首页> 美国卫生研究院文献>other >Comparing the Performance of NoSQL Approaches for Managing Archetype-Based Electronic Health Record Data
【2h】

Comparing the Performance of NoSQL Approaches for Managing Archetype-Based Electronic Health Record Data

机译:比较NoSQL方法用于管理基于原型的电子病历数据的性能

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

This study provides an experimental performance evaluation on population-based queries of NoSQL databases storing archetype-based Electronic Health Record (EHR) data. There are few published studies regarding the performance of persistence mechanisms for systems that use multilevel modelling approaches, especially when the focus is on population-based queries. A healthcare dataset with 4.2 million records stored in a relational database (MySQL) was used to generate XML and JSON documents based on the openEHR reference model. Six datasets with different sizes were created from these documents and imported into three single machine XML databases (BaseX, eXistdb and Berkeley DB XML) and into a distributed NoSQL database system based on the MapReduce approach, Couchbase, deployed in different cluster configurations of 1, 2, 4, 8 and 12 machines. Population-based queries were submitted to those databases and to the original relational database. Database size and query response times are presented. The XML databases were considerably slower and required much more space than Couchbase. Overall, Couchbase had better response times than MySQL, especially for larger datasets. However, Couchbase requires indexing for each differently formulated query and the indexing time increases with the size of the datasets. The performances of the clusters with 2, 4, 8 and 12 nodes were not better than the single node cluster in relation to the query response time, but the indexing time was reduced proportionally to the number of nodes. The tested XML databases had acceptable performance for openEHR-based data in some querying use cases and small datasets, but were generally much slower than Couchbase. Couchbase also outperformed the response times of the relational database, but required more disk space and had a much longer indexing time. Systems like Couchbase are thus interesting research targets for scalable storage and querying of archetype-based EHR data when population-based use cases are of interest.
机译:这项研究提供了对基于NoSQL数据库的基于人口的查询的实验性能评估,该数据库存储了基于原型的电子健康记录(EHR)数据。很少有关于使用多级建模方法的系统的持久性机制的性能的研究,特别是在关注基于人口的查询时。在openEHR参考模型的基础上,将具有420万条记录的医疗数据集存储在关系数据库(MySQL)中,以生成XML和JSON文档。从这些文档中创建了六个大小不同的数据集,并将它们导入到三个单机XML数据库(BaseX,eXistdb和Berkeley DB XML)中,并导入到基于MapReduce方法Couchbase的分布式NoSQL数据库系统中,该数据库部署在不同的集群配置1中, 2、4、8和12台机器。基于人口的查询已提交到这些数据库和原始关系数据库。显示了数据库大小和查询响应时间。 XML数据库比Couchbase慢得多,并且需要更多空间。总体而言,Couchbase的响应时间比MySQL更好,特别是对于较大的数据集。但是,Couchbase需要为每个不同公式化的查询建立索引,并且建立索引的时间随数据集的大小而增加。在查询响应时间方面,具有2、4、8和12个节点的群集的性能并不比单节点群集好,但是索引时间却与节点数量成比例地减少。在某些查询用例和小型数据集中,经过测试的XML数据库对于基于openEHR的数据具有可接受的性能,但通常比Couchbase慢得多。 Couchbase还优于关系数据库的响应时间,但需要更多的磁盘空间,并且索引时间更长。因此,当基于人口的用例受到关注时,诸如Couchbase之类的系统将成为可伸缩存储和基于原型的EHR数据查询的有趣研究目标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号