首页> 外文期刊>Cloud Computing, IEEE Transactions on >Splitting Large Medical Data Sets Based on Normal Distribution in Cloud Environment
【24h】

Splitting Large Medical Data Sets Based on Normal Distribution in Cloud Environment

机译:基于云环境中的正态分布分离大医学数据集

获取原文
获取原文并翻译 | 示例
       

摘要

The surge of medical and e-commerce applications has generated tremendous amount of data, which brings people to a so-called “Big Data” era. Different from traditional large data sets, the term “Big Data” not only means the large size of data volume but also indicates the high velocity of data generation. However, current data mining and analytical techniques are facing the challenge of dealing with large volume data in a short period of time. This paper explores the efficiency of utilizing the Normal Distribution (ND) method for splitting and processing large volume medical data in cloud environment, which can provide representative information in the split data sets. The ND-based new model consists of two stages. The first stage adopts the ND method for large data sets splitting and processing, which can reduce the volume of data sets. The second stage implements the ND-based model in a cloud computing infrastructure for allocating the split data sets. The experimental results show substantial efficiency gains of the proposed method over the conventional methods without splitting data into small partitions. The ND-based method can generate representative data sets, which can offer efficient solution for large data processing. The split data sets can be processed in parallel in Cloud computing environment.
机译:医疗和电子商务应用程序的激增产生了巨大的数据,使人们带来了一个所谓的“大数据”时代。与传统的大数据集不同,术语“大数据”不仅意味着大尺寸的数据量,而且表示数据生成的高速度。然而,目前的数据挖掘和分析技术正面临在短时间内处理大量数据的挑战。本文探讨了利用效率 normal distration (<斜体xmlns:mml =“http://www.w3.org/1998/math/mathml”xmlns:xlink =“http://www.w3.org/1999/xlink”> nd )用于在云环境中拆分和处理大卷医疗数据的方法,其可以在分割数据集中提供代表性信息。基于ND的新模型由两个阶段组成。第一阶段采用<斜体xmlns:mml =“http://www.w3.org/1998/math/mathml”xmlns:xlink =“http://www.w3.org/1999/xlink”> nd 用于大数据集分裂和处理的方法,可以减少数据集的体积。第二阶段在云计算基础架构中实现基于ND的模型,用于分配分割数据集。实验结果表明,在传统方法中,在不将数据分成小隔板的情况下,所提出的方法的实质性效率。基于ND的方法可以生成代表性数据集,可以提供大量数据处理的有效解决方案。可以在云计算环境中并行处理分割数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号