TY - JOUR
T1 - Modeling, validation, and co-design of IBM Blue Gene/Q
T2 - Tools and examples
AU - IBM Blue Gene Team
AU - Asaad, Sameh W.
AU - Bellofatto, Ralph
AU - Brezzo, Bernard
AU - Brunheroto, Jose
AU - Chen, Dong
AU - Cher, Chen-Yong
AU - Chung, I-Hsin
AU - Coteus, Paul W.
AU - Eichenberger, Alexandre E.
AU - Gschwind, Michael K.
AU - Gunnels, John A.
AU - Haring, Ruud A.
AU - Haymes, Charles L.
AU - Heidelberger, Philip
AU - Janssen, Geert
AU - Kapur, Mohit
AU - Nair, Indira
AU - Ohmacht, Alda S.
AU - Ohmacht, Martin
AU - Parker, Benjamin J.
AU - Roewer, Thomas
AU - Saha, Proshanta K.
AU - Sugavanam, Krishnan
AU - Takken, Todd
AU - Tsao, Michael M.
AU - Christ, Norman H.
AU - Boyle, Peter A.
AU - Gara, Alan
AU - Tierno, Jose A.
AU - Wisniewski, Robert W.
PY - 2013
Y1 - 2013
N2 - Major architectural innovations in the compute node have been introduced in the IBM Blue Gene (R)/Q, including programmable Level 1 (L1) cache data prefetching units to hide memory access latency, hardware support for transactional memory (TM) and speculative execution (SE), an enhanced five-dimensional integrated torus network, and a high-performance quad floating-point SIMD (single-instruction, multiple-data) unit. In this paper, we present the tools and methodology that we used to model, co-design, and validate these new features from early concept phase through design implementation. Early in the design cycle, we made extensive use of an architectural simulator, BGQSim, capable of executing unmodified binary Blue Gene/Q code for single as well as multiple nodes. As the hardware description language for the chip implementation became available, we complemented BGQSim with a cycle-accurate and cycle-reproducible, large-scale field-programmable gate array-based platform, Twinstar, to validate the implementation against performance targets and functional specifications. Through specific examples, we show the effectiveness of these tools in co-developing the hardware and software of Blue Gene/Q, allowing us to meet the design targets at an aggressive project schedule.
AB - Major architectural innovations in the compute node have been introduced in the IBM Blue Gene (R)/Q, including programmable Level 1 (L1) cache data prefetching units to hide memory access latency, hardware support for transactional memory (TM) and speculative execution (SE), an enhanced five-dimensional integrated torus network, and a high-performance quad floating-point SIMD (single-instruction, multiple-data) unit. In this paper, we present the tools and methodology that we used to model, co-design, and validate these new features from early concept phase through design implementation. Early in the design cycle, we made extensive use of an architectural simulator, BGQSim, capable of executing unmodified binary Blue Gene/Q code for single as well as multiple nodes. As the hardware description language for the chip implementation became available, we complemented BGQSim with a cycle-accurate and cycle-reproducible, large-scale field-programmable gate array-based platform, Twinstar, to validate the implementation against performance targets and functional specifications. Through specific examples, we show the effectiveness of these tools in co-developing the hardware and software of Blue Gene/Q, allowing us to meet the design targets at an aggressive project schedule.
U2 - 10.1147/JRD.2012.2227577
DO - 10.1147/JRD.2012.2227577
M3 - Article
SN - 0018-8646
VL - 57
JO - Ibm journal of research and development
JF - Ibm journal of research and development
IS - 1-2
M1 - 6
ER -