Abstract / Description of output
This paper presents a new graph theoretic approach to minimising
the number of barriers in parallelised programs. A
simple procedure to reduce the complexity of barrier placement,
without athscting optimality, is developed. A new
algorithm is then presented which places provably the minimal
number of barriers in perfect loop nests. This technique
is extended so as to place the minimal number of barriers
in certain imperfect loop nest structures. This scheme is
generalised to accept entire programs and implemented in a
prototype parallelising compiler where it has been applied
to several well-known benchmarks and shown to place significantly
fewer synchronisation points than an existing commercial
compiler.
Original language | English |
---|---|
Title of host publication | ICS '97 Proceedings of the 11th international conference on Supercomputing |
Publisher | ACM |
Pages | 156-163 |
Number of pages | 8 |
ISBN (Print) | 0-89791-902-5 |
DOIs | |
Publication status | Published - 1997 |