Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Performance analysis of the FFT algorithm on a shared-memory parallel architecture

Journal Article · · IBM J. Res. Dev.; (United States)
DOI:https://doi.org/10.1147/rd.314.0435· OSTI ID:5912399
This paper presents a model for the performance prediction of FFT algorithms executed on a shared-memory parallel computer consisting of N processors and the same number of memory modules. The model applies a deterministic analysis to estimate the communication delay through the interconnection network by assuming that all requests arrive at the network in bursts. The results indicate that the communication delay is significantly affected by the method applied to allocate data to memory modules. For the case in which all data items referenced by a processor during an iteration are allocated to a single memory module. The authors present the best and worst case.
Research Organization:
Digital Equipment Corp., 85 Swanson Road, Boxborough, MA 01719
OSTI ID:
5912399
Journal Information:
IBM J. Res. Dev.; (United States), Journal Name: IBM J. Res. Dev.; (United States) Vol. 31:4; ISSN IBMJA
Country of Publication:
United States
Language:
English