【24h】

GraphSSD: Graph Semantics Aware SSD

机译:GraphSSD:图形语义感知SSD

获取原文

摘要

Graph analytics play a key role in a number of applications such as social networks, drug discovery, and recommendation systems. Given the large size of graphs that may exceed the capacity of the main memory, application performance is bounded by storage access time. Out-of-core graph processing frameworks try to tackle this storage access bottleneck through techniques such as graph sharding, and sub-graph partitioning. Even with these techniques, the need to access data across different graph shards or sub-graphs causes storage systems to become a significant performance hurdle. In this paper, we propose a graph semantic aware solid state drive (SSD) framework, called GraphSSD, which is a full system solution for storing, accessing, and performing graph analytics on SSDs. Rather than treating storage as a collection of blocks, GraphSSD considers graph structure while deciding on graph layout, access, and update mechanisms. GraphSSD replaces the conventional logical to physical page mapping mechanism in an SSD with a novel vertex- to-page mapping scheme and exploits the detailed knowledge of the flash properties to minimize page accesses. GraphSSD also supports efficient graph updates (vertex and edge modifications) by minimizing unnecessary page movement overheads. GraphSSD provides a simple programming interface that enables application developers to access graphs as native data in their applications, thereby simplifying the code development. It also augments the NVMe (non-volatile memory express) interface with a minimal set of changes to map the graph access APIs to appropriate storage access mechanisms. Our evaluation results show that the GraphSSD framework improves the performance by up to 1.85× for the basic graph data fetch functions and on average 1.40×, 1.42×, 1.60×, 1.56×, and 1.29× for the widely used breadth-first search, connected components, random-walk, maximal independent set, and page rank applications, respectively.
机译:图分析在社交网络,药物发现和推荐系统等许多应用中起着关键作用。由于图形的大尺寸可能超过主内存的容量,因此应用程序性能受存储访问时间的限制。核外图处理框架试图通过图分片和子图分区等技术来解决此存储访问瓶颈。即使使用这些技术,跨不同图分片或子图访问数据的需求也导致存储系统成为重要的性能障碍。在本文中,我们提出了一种名为GraphSSD的图语义感知固态驱动器(SSD)框架,该框架是用于在SSD上存储,访问和执行图分析的完整系统解决方案。 GraphSSD不是将存储视为块的集合,而是在决定图的布局,访问和更新机制时考虑图的结构。 GraphSSD用新颖的顶点到页面的映射方案代替了SSD中传统的逻辑到物理页面的映射机制,并利用了闪存属性的详细知识来最大程度地减少了页面访问。 GraphSSD还通过最小化不必要的页面移动开销来支持有效的图形更新(顶点和边缘修改)。 GraphSSD提供了一个简单的编程接口,使应用程序开发人员可以在他们的应用程序中将图形作为本地数据访问,从而简化了代码开发。它还通过最少的更改集来增强NVMe(非易失性存储器表示)接口,以将图形访问API映射到适当的存储访问机制。我们的评估结果表明,对于基本的图形数据获取功能,GraphSSD框架将性能提高了1.85倍,对于广泛使用的广度优先搜索,其性能分别提高了1.40倍,1.42倍,1.60倍,1.56倍和1.29倍,连接组件,随机游走,最大独立集和页面排名应用程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号