In this paper, performance of wormhole routed 2-D torus network with virtual channels has been evaluated for cache-coherent shared-memeory multiprocessors with execution-driven simulation. The traffic in such systems is very different from the traffic in message-passing environment. We show the impact of number of virtual channels, flit buffers per virtual channel, and internal links. The study shows that networks. The number of flit buffers per virtual channel has a considerable impact and 2 to 4 flit buffers are usually enough. The number of internal links makes a difference on the performance for applications, such as MP3D, that generate large contention for shared variables.
展开▼