| | |
Summary: Automatic Thread Extraction with Decoupled Software Pipelining
Guilherme Ottoni Ram Rangan Adam Stoler David I. August
Departments of Computer Science and Electrical Engineering
Princeton University
{ottoni, ram, astoler, august}@princeton.edu
Abstract
Until recently, a steadily rising clock rate and other
uniprocessor microarchitectural improvements could be re-
lied upon to consistently deliver increasing performance for
a wide range of applications. Current difficulties in main-
taining this trend have lead microprocessor manufacturers
to add value by incorporating multiple processors on a chip.
Unfortunately, since decades of compiler research have not
succeeded in delivering automatic threading for prevalent
code properties, this approach demonstrates no improve-
ment for a large class of existing codes.
To find useful work for chip multiprocessors, we propose
an automatic approach to thread extraction, called Decou-
pled Software Pipelining (DSWP). DSWP exploits the fine-
grained pipeline parallelism lurking in most applications to
|