Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Automatic Parallelization of Numerical Python Applications using the Global Arrays Toolkit

Conference ·

Global Arrays is a software system from Pacific Northwest National Laboratory that enables an efficient, portable, and parallel shared-memory programming interface to manipulate distributed dense arrays. The NumPy module is the de facto standard for numerical calculation in the Python programming language, a language whose use is growing rapidly in the scientific and engineering communities. NumPy provides a powerful N-dimensional array class as well as other scientific computing capabilities. However, like the majority of the core Python modules, NumPy is inherently serial. Using a combination of Global Arrays and NumPy, we have reimplemented NumPy as a distributed drop-in replacement called Global Arrays in NumPy (GAiN). Serial NumPy applications can become parallel, scalable GAiN applications with only minor source code changes. Scalability studies of several different GAiN applications will be presented showing the utility of developing serial NumPy codes which can later run on more capable clusters or supercomputers.

Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (US)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
1038396
Report Number(s):
PNNL-SA-84645; KJ0402000
Country of Publication:
United States
Language:
English

Similar Records

Using the Global Arrays Toolkit to Reimplement NumPy for Distributed Computation
Conference · Thu Sep 01 00:00:00 EDT 2011 · OSTI ID:1095469

GAiN: Distributed Array Computation with Python
Thesis/Dissertation · Fri May 01 00:00:00 EDT 2009 · OSTI ID:1006323

Global Arrays Parallel Programming Toolkit
Book · Fri Dec 31 23:00:00 EST 2010 · OSTI ID:1182356