skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Parallelization and performance analysis of the Cooley-Tukey FFT algorithm for shared-memory architectures

Journal Article · · IEEE Trans. Comput.; (United States)

The authors present a study of parallelization of the Cooley-Tukey radix two FFT algorithm for MIMD (nonvector) architectures. Parallel algorithms are presented for one and multidimensional Fourier transforms. From instruction traces obtained by executing Fortran kernels derived from our algorithms, they determined the precise instructions to be executed by each processor in the parallel system. They used these instruction traces to predict the performance of the IBM Research Parallel Processing Prototype, RP3, as a computer of FFT's. The performance results are depicted in graphs included in this paper.

Research Organization:
IBM Thomas J. Watson Research Center, Yorktown Heights, NY 10598
OSTI ID:
6595452
Journal Information:
IEEE Trans. Comput.; (United States), Vol. C-36:5
Country of Publication:
United States
Language:
English