论文标题

扩展您的内存加速器:利用基于AIMC的CNN推理的无线芯片通信

Scale up your In-Memory Accelerator: Leveraging Wireless-on-Chip Communication for AIMC-based CNN Inference

论文作者

Bruschi, Nazareno, Tagliavini, Giuseppe, Conti, Francesco, Abadal, Sergi, Cabellos-Aparicio, Alberto, Alarcón, Eduard, Karunaratne, Geethan, Boybat, Irem, Benini, Luca, Rossi, Davide

论文摘要

模拟内存计算(AIMC)作为异质计算的颠覆性范式出现,潜在地传递了比传统的数字信号处理体系结构在矩阵矢量乘数上更好的峰值性能和效率。但是,为了在现实世界中维持这种吞吐量,必须向AIMC图块提供非常高的带宽和低潜伏期的数据;这对片上通信基础设施构成了前所未有的压力,这成为系统的性能和效率瓶颈。在这种情况下,新兴的片上无线通信范式的性能和可塑性为大型AIMC设备中的片上进行了挑战所需的突破。这项工作提供了一个多层AIMC体系结构,其中包含瓷砖间的无线通信,该构建集成了多个异质计算簇,嵌入了平行的RISC-V内核和AIMC瓷砖的混合物。我们对拟议的体系结构进行了广泛的设计空间探索,并讨论了利用新出现的片上通信技术的好处,例如毫米波和Terahertz乐队中的无线收发器。

Analog In-Memory Computing (AIMC) is emerging as a disruptive paradigm for heterogeneous computing, potentially delivering orders of magnitude better peak performance and efficiency over traditional digital signal processing architectures on Matrix-Vector multiplication. However, to sustain this throughput in real-world applications, AIMC tiles must be supplied with data at very high bandwidth and low latency; this poses an unprecedented pressure on the on-chip communication infrastructure, which becomes the system's performance and efficiency bottleneck. In this context, the performance and plasticity of emerging on-chip wireless communication paradigms provide the required breakthrough to up-scale on-chip communication in large AIMC devices. This work presents a many-tile AIMC architecture with inter-tile wireless communication that integrates multiple heterogeneous computing clusters, embedding a mix of parallel RISC-V cores and AIMC tiles. We perform an extensive design space exploration of the proposed architecture and discuss the benefits of exploiting emerging on-chip communication technologies such as wireless transceivers in the millimeter-wave and terahertz bands.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源