首页> 外文期刊>ACM transactions on computer systems >SPIN: Seamless Operating System Integration of Peer-to-Peer DMA Between SSDs and CPUs
【24h】

SPIN: Seamless Operating System Integration of Peer-to-Peer DMA Between SSDs and CPUs

机译:旋转:SSD和CPU之间的点对点DMA的无缝操作系统集成

获取原文
获取原文并翻译 | 示例

摘要

Recent GPUs enable Peer-to-Peer Direct Memory Access (p2p) from fast peripheral devices like NVMe SSDs to exclude the CPU from the data path between them for efficiency. Unfortunately, using p2p to access files is challenging because of the subtleties of low-level non-standard interfaces, which bypass the OS file I/O layers and may hurt system performance. Developers must possess intimate knowledge of low-level interfaces to manually handle the subtleties of data consistency and misaligned accesses.We present SPIN, which integrates p2p into the standard OS file I/O stack, dynamically activating p2p where appropriate, transparently to the user. It combines p2p with page cache accesses, re-enables read-ahead for sequential reads, all while maintaining standard POSIX FS consistency, portability across GPUs and SSDs, and compatibility with virtual block devices such as software RAID.We evaluate SPIN on NVIDIA and AMD GPUs using standard file I/O benchmarks, application traces, and end-to-end experiments. SPIN achieves significant performance speedups across a wide range of workloads, exceeding p2p throughput by up to an order of magnitude. It also boosts the performance of an aerial imagery rendering application by 2.6x by dynamically adapting to its input-dependent file access pattern, enables 3.3x higher throughput for a GPU-accelerated log server, and enables 29% faster execution for the highly optimized GPU-accelerated image collage with only 30 changed lines of code.
机译:最近的GPU使等待对等的直接内存访问(P2P)从像NVME SSD这样的快速外围设备,以将CPU从它们之间的数据路径排除以获得效率。不幸的是,使用P2P访问文件是具有挑战性的,因为低级非标准接口的微妙之处,它绕过OS文件I / O层,可能会损坏系统性能。开发人员必须拥有对低级接口的亲密知识,可以手动处理数据一致性和未对准的访问的微妙之处。我们存在旋转,该旋转将P2P集成到标准OS文件I / O堆栈中,在适当的情况下动态激活P2P,透明地向用户透明地激活P2P。它将P2P与页面缓存访问权限组合,重新启用了顺序读取的重新启动,同时维护标准POSIX FS一致性,GPU和SSD的可移植性,以及软件RAID诸如软件RAID的虚拟块设备的兼容性。在NVIDIA和AMD上旋转旋转GPU使用标准文件I / O基准测试,应用迹线和端到端实验。旋转在各种工作负载中实现了显着的性能加速度,超过P2P吞吐量达到大量数量级。它还通过动态调整到其输入依赖的文件访问模式,增强了2.6倍的空中图像渲染应用程序的性能,为GPU加速日志服务器启用了3.3x较高的吞吐量,并为高度优化的GPU执行29% - 只有30个改变的代码线条的图像拼贴。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号