Projects per year
Abstract
In order to follow the ever-growing computational complexity and data intensity of state-of-the-art AI models, new computing paradigms are being proposed. These paradigms aim at achieving high energy efficiency by mitigating the Von Neumann bottleneck that relates to the energy cost of moving data between the processing cores and the memory. Convolutional Neural Networks (CNNs) are susceptible to this bottleneck, given the massive data they have to manage. Systolic arrays (SAs) are promising architectures to mitigate data transmission cost, thanks to high data utilization of Processing Elements (PEs). These PEs continuously exchange and process data locally based on specific dataflows (such as weight stationary and row stationary), in turn reducing the number of memory accesses to the main memory. In SAs, convolutions are managed either as matrix multiplications or exploiting the raster-order scan of sliding windows. However, data redundancy is a primary concern affecting area, power, and energy. In this paper, we propose TrIM: a novel dataflow for SAs based on a Triangular Input Movement and compatible with CNN computing. TrIM maximizes the local input utilization, minimizes the weight data movement, and solves the data redundancy problem. Furthermore, TrIM does not incur the significant on-chip memory penalty introduced by the row stationary dataflow. When compared to state-of-the-art SA dataflows, the high data utilization offered by TrIM guarantees $\sim 10\times$ less memory access. Furthermore, considering that PEs continuously overlap multiplications and accumulations, TrIM achieves high throughput (up to 81.8% higher than row stationary), other than requiring a limited number of registers (up to $15.6\times$ fewer registers than row stationary).
| Original language | English |
|---|---|
| Article number | 11089493 |
| Pages (from-to) | 197-210 |
| Number of pages | 14 |
| Journal | IEEE Transactions on Circuits and Systems for Artificial Intelligence |
| Volume | 2 |
| Issue number | 3 |
| Early online date | 22 Jul 2025 |
| DOIs | |
| Publication status | E-pub ahead of print - 22 Jul 2025 |
Keywords / Materials (for Non-textual outputs)
- Artificial intelligence
- convolutional neural networks
- systolic arrays
- weight stationary
- data utilization
- memory accesses
Fingerprint
Dive into the research topics of 'TrIM, Triangular Input Movement Systolic Array for Convolutional Neural Networks: Dataflow and Analytical Modelling'. Together they form a unique fingerprint.-
AI for Productive Research & Innovation in eLectronics (APRIL)
Prodromakis, T. (Principal Investigator), King, R. (Co-investigator), O'Boyle, M. (Co-investigator) & Ramamoorthy, R. (Co-investigator)
Engineering and Physical Sciences Research Council
1/02/24 → 31/01/29
Project: Research
-
AI MeTLLE: Memristive Technologies for Lifelong Learning Embedded AI Hardware (AI MeTLLE)
Prodromakis, T. (Principal Investigator)
1/05/22 → 31/12/29
Project: Research
-
FORTE: Functional Oxide Reconfigurable Technologies (FORTE): A Programme Grant
Koch, D. (Principal Investigator), Prodromakis, T. (Principal Investigator), Constandinou, T. G. (Co-investigator), Dudek, P. (Co-investigator) & Papavassiliou, C. (Co-investigator)
Engineering and Physical Sciences Research Council
1/05/22 → 30/03/25
Project: Research