首页> 外文学位 >Towards scalable and privacy-preserving integration of distributed heterogeneous data.
【24h】

Towards scalable and privacy-preserving integration of distributed heterogeneous data.

机译:致力于分布式异构数据的可伸缩性和隐私保护集成。

获取原文
获取原文并翻译 | 示例

摘要

With the trend of cloud computing, data and computing are moved away from desktop and are instead provided as a service from the cloud. Data-as-a-service enables access to a wealth of data across distributed and heterogeneous data sources in the cloud. It remains a challenge, however, to ensure the privacy, interoperability, and scalability for such services.;We designed and developed DObjects, a general-purpose P2P-based query and data operations infrastructure that can be deployed in the cloud and provides access to heterogeneous data sources. The system builds on top of a distributed mediator-wrapper architecture where individual system nodes serve as mediators and/or wrappers and interact with each other in a P2P fashion. As an analogy, the system nodes can be considered as droplets, small elements that provide similar functionality in the cloud. Just as thousands or millions of droplets form a single drop in nature, in cloud computing, groups of droplets that provide similar functionality can form a micro-cloud. Micro-clouds are an integral part of the whole cloud computing system.;The dissertation also discusses the novel dynamic query execution engine within the data query infrastructure that dynamically adapts to network and node conditions. The query processing is capable of fully benefiting from all the distributed resources to minimize the query response time and maximize system throughput. In addition to leveraging the traditional distributed query optimization techniques, the (sub)queries are deployed on droplets in a dynamic and iterative manner in order to guarantee the best reaction to network and resource dynamics.;Finally, the dissertation presents an extension to the basic DObjects model that enables access to private data that is distributed and needs anonymization. The extension enables droplets to form virtual groups to addresses two privacy issues for the sensitive data: privacy of data subjects and confidentiality of data providers. The dissertation discusses decentralized protocols that enable data sharing for horizontally partitioned databases given these constraints. Concretely, given a query spanning multiple databases, the query results do not contain individually identifiable information. In addition, institutions do not reveal their databases to each other apart from the query results.
机译:随着云计算的趋势,数据和计算已从台式机移开,而是从云中作为服务提供。数据即服务使您可以跨云中的分布式和异构数据源访问大量数据。但是,确保此类服务的隐私性,互操作性和可伸缩性仍然是一个挑战。我们设计和开发了DObjects,这是一种通用的基于P2P的查询和数据操作基础架构,可以部署在云中并提供对异构数据源。该系统建立在分布式介体包装器体系结构的顶部,在该体系结构中,各个系统节点充当介体和/或包装器,并以P2P方式相互交互。打个比方,系统节点可以看作是液滴,是在云中提供类似功能的小元素。就像成千上万的液滴形成自然界中的单个液滴一样,在云计算中,提供相似功能的液滴组可以形成微云。微型云是整个云计算系统不可或缺的部分。本文还讨论了数据查询基础架构中动态适应网络和节点条件的新型动态查询执行引擎。查询处理能够充分利用所有分布式资源,从而最大程度地减少查询响应时间并最大化系统吞吐量。除了利用传统的分布式查询优化技术之外,(子)查询还以动态和迭代的方式部署在小滴上,以确保对网络和资源动态的最佳响应。最后,本文提出了对基础查询的扩展。 DObjects模型,可以访问已分发的需要匿名的私有数据。该扩展使液滴能够形成虚拟组,以解决敏感数据的两个隐私问题:数据主体的隐私和数据提供者的保密。本文讨论了分散的协议,这些协议可以在给定这些约束的情况下实现对水平分区数据库的数据共享。具体而言,给定一个查询跨越多个数据库,查询结果不包含可单独识别的信息。此外,除了查询结果外,机构不会相互公开其数据库。

著录项

  • 作者

    Jurczyk, Pawel.;

  • 作者单位

    Emory University.;

  • 授予单位 Emory University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 154 p.
  • 总页数 154
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号