TY - GEN
T1 - UGEMM: Unary Computing Architecture for GEMM Applications
AU - Wu, Di
AU - Li, Jingjie
AU - Yin, Ruokai
AU - Hsiao, Hsuan
AU - Kim, Younghyun
AU - Miguel, Joshua San
PY - 2020/7/13
Y1 - 2020/7/13
N2 - General matrix multiplication (GEMM) is universal in various applications, such as signal processing, machine learning, and computer vision. Conventional GEMM hardware architectures based on binary computing exhibit low area and energy efficiency as they scale due to the spatial nature of number representation and computing. Unary computing, on the other hand, can be performed with extremely simple processing units, often just with a single logic gate. But currently there exist no efficient architectures for unary GEMM.In this paper, we present uGEMM, an area- and energy-efficient unary GEMM architecture enabled by novel arithmetic units. The proposed design relaxes previously-imposed constraints on input bit streams - low correlation and long stream length - and achieves superior area and energy efficiency over existing unary systems. Furthermore, uGEMM's output bit streams exhibit higher accuracy and faster convergence, enabling dynamic energy-accuracy scaling on resource-constrained systems.
AB - General matrix multiplication (GEMM) is universal in various applications, such as signal processing, machine learning, and computer vision. Conventional GEMM hardware architectures based on binary computing exhibit low area and energy efficiency as they scale due to the spatial nature of number representation and computing. Unary computing, on the other hand, can be performed with extremely simple processing units, often just with a single logic gate. But currently there exist no efficient architectures for unary GEMM.In this paper, we present uGEMM, an area- and energy-efficient unary GEMM architecture enabled by novel arithmetic units. The proposed design relaxes previously-imposed constraints on input bit streams - low correlation and long stream length - and achieves superior area and energy efficiency over existing unary systems. Furthermore, uGEMM's output bit streams exhibit higher accuracy and faster convergence, enabling dynamic energy-accuracy scaling on resource-constrained systems.
UR - https://www.scopus.com/pages/publications/85092006238
U2 - 10.1109/ISCA45697.2020.00040
DO - 10.1109/ISCA45697.2020.00040
M3 - Conference contribution
AN - SCOPUS:85092006238
T3 - Proceedings - International Symposium on Computer Architecture
SP - 377
EP - 390
BT - Proceedings - 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture, ISCA 2020
PB - Institute of Electrical and Electronics Engineers
T2 - 47th ACM/IEEE Annual International Symposium on Computer Architecture, ISCA 2020
Y2 - 30 May 2020 through 3 June 2020
ER -