In multivector processors, the lost cycles due to conflicts between concurrent vector streams make the effective throughput be lower than the peak throughput. When the request rate of all the concurrent vector streams to every memory module is less than or equal to the service rate, conflicts appear because concurrent vector streams reference memory modules in different orders. In addition, in a memory system where several memory modules are mapped in every bus (complex memory system) bus conflicts are added to memory module conflicts. This paper proposes an access order to the vector stream elements that reduces the average memory access time in vector processors with complex memory systems. When request rate is greater than the service rate, the proposed order reduces the numbe of lost cycles, and the effective throughput increases. In other cases, the effective throughput reach the peak throughput. The proposed order generates the memory references in such a way that the memory modules shared by the concurrent self-conflict-free vector streams, and the sections where memory modules are mapped, are referenced using the same order.
在多向量处理器中,由于并发向量流之间的冲突而导致的丢失周期使有效吞吐量低于峰值吞吐量。当所有并发矢量流对每个内存模块的请求速率小于或等于服务速率时,会出现冲突,因为并发矢量流以不同顺序引用内存模块。另外,在每个总线上都映射有多个内存模块的内存系统中(复杂内存系统),总线冲突会添加到内存模块冲突中。本文提出了对矢量流元素的访问顺序,以减少具有复杂存储系统的矢量处理器中的平均内存访问时间。当请求速率大于服务速率时,建议的订单将减少丢失周期的数量,并且有效吞吐量会增加。在其他情况下,有效吞吐量达到峰值吞吐量。拟议的顺序生成内存引用的方式是,以相同的顺序引用由并发的无冲突矢量流共享的内存模块以及映射内存模块的部分。 P>
机译:向量处理器系统中交错存储器的有效带宽
机译:基于多核处理器的系统的分析内存带宽模型
机译:MEG:基于RISC的系统仿真基础设施,用于使用FPGA和高带宽内存的近数据处理
机译:增加多向量处理器中复杂内存系统的有效带宽
机译:河流过程与河流植被之间越来越大的互殖相互作用和系统复杂性
机译:视觉记忆任务中复杂几何形式的大脑处理会增加P2幅度
机译:Eager Combining:用于提高共享内存多处理器中有效网络和内存带宽的一致性协议
机译:Eager Combining:用于提高共享内存多处理器中有效网络和内存带宽的一致性协议。