Approaching the Theoretical Limits of a Mesh NoC with a 16-Node Chip Prototype in 45nm SOI
Sunghyun Park, Tushar Krishna, Chia-Hsin Owen Chen, Bhavya Daya, Anantha P. Chandrakasan, and Li-Shiuan Peh
In this paper, we present a case study of our chip prototype of a
16-node 4x4 mesh NoC fabricated in 45nm SOI CMOS that aims
to simultaneously optimize energy-latency-throughput for unicasts,
multicasts and broadcasts. We first define and analyze the theoretical
limits of a mesh NoC in latency, throughput and energy,
then describe how we approach these limits through a combination
of microarchitecture and circuit techniques. Our 1.1V 1GHz
NoC chip achieves 1-cycle router-and-link latency at each hop and
energy-efficient router-level multicast support, delivering 892Gb/s
(87.1% of the theoretical bandwidth limit) at 531.4mW for a mixed
traffic of unicasts and broadcasts. Through this fabrication, we derive
insights that help guide our research, and we believe, will also
be useful to the NoC and multicore research community.