首页> 外文学位 >Exploiting parallel memory hierarchies for ray casting volumes.
【24h】

Exploiting parallel memory hierarchies for ray casting volumes.

机译:利用并行内存层次结构进行射线投射。

获取原文
获取原文并翻译 | 示例

摘要

Previous work in single-processor ray casting methods for volume rendering has concentrated on algorithmic optimizations to reduce computational work. This approach leaves untapped the performance gains which are possible through efficient exploitation of the memory hierarchy.; Previous work in parallel volume rendering has concentrated on parallel partitioning, with the goals of maximizing load balance and minimizing communication between distributed nodes. This implies a simplified view of the memory hierarchy of a parallel machine, ignoring the relationship between parallel partitioning and memory hierarchy effects at all but the top level.; In this thesis, we progressively develop methods to optimize memory hierarchy performance for ray casting: (1) on a uniprocessor, using algorithmic modifications to isolate cache miss costs, specialized hardware to monitor cache misses on the bus, and a software cache simulator; (2) on the a shared-memory Power Challenge multiprocessor, examining the fundamental dependence of algorithmic design decisions regarding parallel partitioning upon memory hierarchy effects at several levels; and (3) on a distributed array of interconnected Power Challenge multiprocessors, on which we implement a logical global address space for volume blocks, and investigate the tradeoff between replication (caching) and communication of data.; The methods we develop permit us to exploit the coherence found in volume rendering to increase memory locality, and thereby increase memory system performance. This focus on the optimal exploitation of the entire memory hierarchy, from the processor cache, to the interconnection network between distributed nodes, yields faster frame rates for large (357 MB to 1 GB) datasets than have been previously cited in the literature, and allows us to efficiently render a 7.1 GB dataset, the largest ever rendered.; Our results have implications for the parallel solution of other problems which, like ray casting, require a global gather operation, use an associative operator to combine partial results, and contain coherence. We discuss implications for the design of a parallel architecture suited to solving this class of problems, specifically, that these algorithms are best served by a deep memory hierarchy.
机译:用于体积渲染的单处理器射线投射方法的先前工作主要集中在算法优化上,以减少计算工作。这种方法未开发出可通过有效利用内存层次结构来实现的性能提升。并行卷渲染中的先前工作主要集中在并行分区上,其目标是最大化负载平衡并最小化分布式节点之间的通信。这意味着简化了并行机内存层次结构的视图,而忽略了除顶层之外的所有分区的并行分区与内存层次结构效果之间的关系。在本文中,我们逐步开发出方法来优化光线投射的内存层次结构性能:(1)在单处理器上,使用算法修改来隔离高速缓存未中成本,使用专用硬件监视总线上的高速缓存未中,以及软件高速缓存模拟器; (2)在共享内存的Power Challenge多处理器上,检查关于并行分区的算法设计决策在几个层次上对内存层次效应的基本依赖性; (3)在互连的Power Challenge多处理器的分布式阵列上,在该阵列上我们为卷块实现了逻辑全局地址空间,并研究了复制(缓存)和数据通信之间的权衡。我们开发的方法使我们能够利用体绘制中发现的一致性来增加内存局部性,从而提高内存系统性能。重点是从处理器缓存到分布式节点之间的互连网络的整个内存层次结构的最佳利用,与先前文献中所引用的相比,大型(357 MB至1 GB)数据集的帧速率更快,并且允许我们可以有效地渲染7.1 GB的数据集,这是有史以来最大的数据集。我们的结果对其他问题的并行解决方案有影响,例如射线投射,需要全局聚集操作,使用关联算子来组合部分结果并包含相干性。我们讨论了适合解决此类问题的并行体系结构设计的意义,特别是,这些算法最好由深度内存层次结构来提供。

著录项

  • 作者

    Palmer, Michael Edward.;

  • 作者单位

    California Institute of Technology.;

  • 授予单位 California Institute of Technology.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 1997
  • 页码 138 p.
  • 总页数 138
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号