首页> 中文期刊> 《计算机学报》 >基于软件定义网络和多属性决策的Ceph存储系统节点选择方法

基于软件定义网络和多属性决策的Ceph存储系统节点选择方法

         

摘要

云存储中的Ceph分布式文件系统以其开原性和提供统一存储能力的特点在企业和科研领域得到广泛关注和应用.CRUSH算法是Ceph分布式文件系统中的伪随机数据分布算法,能实现在异构大规模层级结构化存储集群中高效分布数据对象及其副本.经典Ceph云存储系统中在副本模式下选择存储节点时该系统仅以节点存储容量作为唯一选择条件,并没有考虑到网络和节点的负载状况,这影响了系统在网络性能差和节点高负载的情况下的读写性能.因此,在CRUSH算法中加入网络状态和节点负载的测量对提高负载均衡性具有非常重要的作用.但在传统的网络构架中要获取网络状态需要繁琐的配置和大量的测量开销.为解决这些问题,该文设计了基于软件定义网络技术的Ceph云存储系统模型和存储节点选择策略,首先利用软件定义网络技术实时获取网络和负载状况,以简化网络配置和减小测量开销,然后通过建立并求解出综合考虑了多种因素的多属性决策数学模型来确定存储节点位置.通过在实际环境中对设计的存储节点选择方法进行读写操作的测试,测试结果表明,与现有的CRUSH算法相比,提出的存储节点选择方法可以在保持与原有Ceph系统相同的写操作性能的同时,读小文件操作时的吞吐量和读大文件的响应时间得到明显改善.%The traditional storage model cannot cope with massive data storage capacity scalability, data reliability and high performance, cloud storage systems came into being under this background.Cloud storage systems use distributed file systems and other technologies to assemble different storage devices into a pool of resources through network connections, unified provision of storage services, with high scalability, high reliability and so on.There are many kinds of distributed file systems for cloud storage, Ceph distributed file system, with its open source nature, and providinguniform storage capability, has been widely concerned in enterprises and scientific research fields.With the characters of open source and providing uniform storage capability, the Ceph storage system has been widely concerned in scientific research fields and industry application as one of the most regular cloud storage system.Data distribution strategy is a key technology in distributed file system, which determines location of data storage, load balancing and fault tolerance of the system.CRUSH algorithm is a pseudo-random algorithm for data distribution in Ceph distributed file system which can distribute data objects and their replicas efficiently in large-scale and heterogeneous hierarchical structured storage clusters.However, the open-source Ceph storage system uses storage capacity as the sole consideration for selecting storage nodes in replication scheme in its CRUSH algorithm.It ignores the loads on both the network and individual nodes and negatively affects the system's read and write performance under heavy loads or poor conditions.It is important to utilize the network state information and node load in CRUSH algorithm to improve the load balance.But in the traditional network architecture, it needs cumbersome configuration and much of the measurement overhead.To address these deficiencies, we propose a Ceph enhancement that incorporates software-defined network (SDN) abstraction and an improved strategy for storage node selection.First the nodes' and network's load status are obtained via SDN to simplify the network configuration and alleviate the measurement overhead.Compared with the traditional network architecture, getting network state requires cumbersome configuration and a lot of measurement overhead, Software Defined Network (SDN) separates the control plane and the data plane phase.Through the centralized control plane, it Simplifies network measurement and management and provides a flexible and efficient maintenance strategy, which are adopted the SDN technology to complete the monitoring of network and node load.Second, we establish a multi-attributes decision-making model to select storage nodes optimally.It aims to solve the load unbalanced problem of storage node caused by the storage capacity as the constraint condition in CRUSH algorithm.An improved CRUSH algorithm is proposed to add the factors of network state and load in node weight factor and the determination of weight factor has a finer granularity.We tested the performance of our proposed model in a live environment.The results indicated that the designed model and strategy can significantly improved the throughput for small files and response times for reading large files while offering write performance similar to the unmodified Ceph storage system compared with the original CRUSH algorithm.

著录项

  • 来源
    《计算机学报》 |2019年第2期|323-338|共16页
  • 作者单位

    桂林电子科技大学计算机与信息安全学院;

    广西 桂林 541004;

    桂林电子科技大学认知无线电与信息处理省部共建教育部重点实验室;

    广西 桂林 541004;

    桂林电子科技大学认知无线电与信息处理省部共建教育部重点实验室;

    广西 桂林 541004;

    桂林理工大学信息科学与工程学院;

    广西 桂林 541004;

    桂林电子科技大学计算机与信息安全学院;

    广西 桂林 541004;

    桂林电子科技大学认知无线电与信息处理省部共建教育部重点实验室;

    广西 桂林 541004;

    桂林电子科技大学认知无线电与信息处理省部共建教育部重点实验室;

    广西 桂林 541004;

    桂林电子科技大学计算机与信息安全学院;

    广西 桂林 541004;

  • 原文格式 PDF
  • 正文语种 chi
  • 中图分类 人工智能理论;
  • 关键词

    软件定义网络; Ceph存储系统; 多属性决策; 副本模式; 权重因子;

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号