
- POLYSHIFT Communications Software for the Connection Machine System
- Computer Science 6365 February 7 and 12, 2008 Lecture #8 and 9: Interconnection Networks
- An Efficient Communication Strategy for Finite Element Methods on the
- Scientific Software Libraries for Scalable Architectures
- DPF: A Data Parallel Fortran Benchmark Suite
- A Data--Parallel Implementation of O(N ) HierarchicalN--body Methods
- High Performance, Scalable Scientific Software Libraries
- Language and Compiler Issues in Scalable High Performance Scientific Libraries
- ADMISSION CONTROL AND RESOURCE RESERVATION FOR QUALITY OF SERVICE PROVISIONING IN CELLULAR MOBILE WIRELESS NETWORKS
- AN ADAPTIVE SOFTWARE LIBRARY FOR FAST FOURIER TRANSFORMS
- Computer Science 6365 March, 27 2008 Lecture #20-21: Dense matrix multiplication
- On the Accuracy of Poisson's Formula Based N--Body Algorithms
- Multiplication of Matrices of Arbitrary Shape on a Data Parallel Computer
- Computer Science 6365 January 31, 2008 Lecture #6: Memory Systems Data distribution
- ROMM Routing: A Class of Efficient Minimal Routing Algorithms
- Local Basic Linear Algebra Subroutines (LBLAS) for the CM--5/5E
- An Efficient Communication Strategy for Finite Element Methods on the
- Data Parallel Finite Element Techniques for Compressible Flow Problems
- Data Partitioning for Load--Balance and Communication Bandwidth Preservation
- Data Motion and High Performance S. Lennart Johnsson
- CooleyTukey FFT on the Connection S. Lennart Johnsson
- A Data--Parallel Adaptive N--body Method
- Minimizing the Communication Time for Matrix Multiplication on Multiprocessors
- A Stencil Complier for the Connection Machine Models CM2/200
- A Data Parallel Implementation of Hierarchical N--body Methods
- CMSSL: A Scalable Scientific Software S. Lennart Johnsson
- Performance Modeling of Distributed Memory Architectures
- Finite Element Techniques for Computational Fluid Dynamics on
- Communication Primitives for Unstructured Finite Element Simulations
- Computer Science 6365 February 5, 2008 Lecture #7: Memory Systems-II
- Optimal Communication Channel Utilization for Matrix Transposition and
- Block--Cyclic Dense Linear Algebra Woody Lichtenstein
- Computer Science 6365 April 22, 2008 Lecture #27: Linear Recurrences
- All--to--all Communication Algorithms for Distributed BLAS
- All--to--all Broadcast and Applications on the Connection Machine
- QCD on the Connection Machine: Beyond *LISP
- High Performance Fortran for Highly Irregular Problems
- Massively Parallel Computing: Unstructured Finite Element Simulations
- Massively Parallel Computing: Data distribution and communication
- Massively Parallel Computing: Mathematics and communications
- Index Transformation Algorithms in a Linear Algebra Framework
- Randomized, Oblivious, Minimal Routing Algorithms for Multicomputers
- A DataParallel Implementation of the Geometric Partitioning Algorithm
- Mesh Decomposition and Communication Procedures for Finite Element
- Scalability of Finite Element Applications on DistributedMemory Parallel
- Computer Science 6365 January 24, 2008 Lecture #4: Performance Concepts
- Computer Science 6365 February 21, 2008 Lecture #12: Vector Architectures
- Computer Science 6365 February 26, 2008 Lecture 13: Vectorization
- Computer Science 6365 March 6, 2008 Lecture #16: Parallel Sorting-I
- Computer Science 6365 March 25, 2005 Lecture 19: Sorting II
- Computer Science 6365 April 22, 2008 Lecture #26-2: Fast Fourier Transforms II
- Computer Science 6365 April 28, 2008 Lecture 28: Sparse matrix computations
- MPI-HPF COMMUNICATION TECHNIQUES Presented to
- Parallel implementation of recursive spectral bisection on the Connection
- ROMM Routing on Mesh and Torus S. Lennart Johnsson
- Optimal AlltoAll Personalized Communication with Minimum Span on
- Communication and I/O Libraries Chair: S. Lennart Johnsson
- All--to--All Communication on the Connection Machine CM200
- On the Accuracy of Anderson's Fast N--body Algorithm
- Computer Science 6365 April 17, 2008 Lecture #26: Fast Fourier Transforms I
- Generalized Shuffle Permutations on Boolean Cubes
- Issues in High Performance Computer Position Paper
- Network Related Performance Issues and Techniques for MPPs
- Communication Efficient Multiprocessor S. Lennart Johnsson
- An Efficient Algorithm for Gray--to--Binary Permutation on
- Implementing O(N ) N--body algorithms efficiently in data parallel languages
- A Data Parallel Finite Element Method for Computational Fluid Dynamics on the
- A Data--Parallel Adaptive N--body Method We present a data--parallel formulation of the 3--D Anderson's method
- DATA PARALLEL PERFORMANCE OPTIMIZATIONS USING
- IMPLEMENTATION AND PERFORMANCE ANALYSIS OF A HIGH{ORDER CEM ALGORITHM IN PARALLEL
- On the Conversion between Binary Code and Binary--Reflected Gray Code on
- DPF: A Data Parallel Fortran Benchmark S. Lennart Johnsson
- The Connection Machine Systems CM5 S. Lennart Johnsson
- Local Basic Linear Algebra Subroutines (LBLAS) for Distributed Memory