Measuring the parallelism available for very long instruction word architectures

Nicolau, A; Fisher, J A

doi:10.1109/TC.1984.1676371

Measuring the parallelism available for very long instruction word architectures

Journal Article · Wed Oct 31 23:00:00 EST 1984 · IEEE Trans. Comput.; (United States)

DOI:https://doi.org/10.1109/TC.1984.1676371· OSTI ID:6422509

Nicolau, A; Fisher, J A

Long instruction word architectures, such as attached scientific processors and horizontally microcoded CPU's, are a popular means of obtaining code speedup via fine-grained parallelism. The falling cost of hardware holds out the hope of using these architectures for much more parallelism. But this hope has been diminished by experiments measuring how much parallelism is available in the code to start with. These experiments implied that even if we had infinite hardware, long instruction word architectures could not provide a speedup of more than a factor of 2 or 3 on real programs. These experiments measured only the parallelism within basic blocks. Given the machines that prompted them, it made no sense to measure anything else. Now it does. A recently developed code compaction technique, called trace scheduling, could exploit parallelism in operations even hundreds of blocks apart. Does such parallelism exist. In this paper we show that it does. We did analogous experiments, but we disregarded basic block boundaries. We found huge amounts of parallelism available. Our measurements were made on standard Fortran programs in common use. The actual programs tested averaged about a factor of 90 parallelism. It ranged from about a factor of 4 to virtually unlimited amounts, restricted only by the size of the data.

Research Organization:: Department of Computer Science, Cornell University, Ithaca, NY

OSTI ID:: 6422509

Journal Information:: IEEE Trans. Comput.; (United States), Journal Name: IEEE Trans. Comput.; (United States) Vol. C-33:11; ISSN ITCOB

Country of Publication:: United States

Language:: English

Similar Records

Very long instruction word architectures and the ELI-512

Book · Fri Dec 31 23:00:00 EST 1982 · OSTI ID:5290967

Enhancing instruction scheduling with a block-structured ISA

Journal Article · Thu Jun 01 00:00:00 EDT 1995 · International Journal of Parallel Programming · OSTI ID:441447

Computer systems architecture at Yale. The enormous longword instruction (ELI) machine progress and research plans

Technical Report · Thu Jul 01 00:00:00 EDT 1982 · OSTI ID:6755620

Related Subjects

99 GENERAL AND MISCELLANEOUS
990200* -- Mathematics & Computers
ARCHITECTURE
COMPUTER CODES
DATA PROCESSING
FORTRAN
PARALLEL PROCESSING
PERFORMANCE TESTING
PROCESSING
PROGRAMMING
PROGRAMMING LANGUAGES
TASK SCHEDULING
TESTING

Measuring the parallelism available for very long instruction word architectures

Citation Formats

Similar Records

Related Subjects