skip to main content
DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: System, methods and apparatus for program optimization for multi-threaded processor architectures

Abstract

Methods, apparatus and computer software product for source code optimization are provided. In an exemplary embodiment, a first custom computing apparatus is used to optimize the execution of source code on a second computing apparatus. In this embodiment, the first custom computing apparatus contains a memory, a storage medium and at least one processor with at least one multi-stage execution unit. The second computing apparatus contains at least two multi-stage execution units that allow for parallel execution of tasks. The first custom computing apparatus optimizes the code for parallelism, locality of operations and contiguity of memory accesses on the second computing apparatus. This Abstract is provided for the sole purpose of complying with the Abstract requirement rules. This Abstract is submitted with the explicit understanding that it will not be used to interpret or to limit the scope or the meaning of the claims.

Inventors:
; ; ; ; ; ;
Issue Date:
Research Org.:
Reservoir Labs, Inc., New York, NY (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1167014
Patent Number(s):
8930926
Application Number:
12/762,281
Assignee:
Reservoir Labs, Inc. (New York, NY)
Patent Classifications (CPCs):
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
DOE Contract Number:  
FG02-08ER85149
Resource Type:
Patent
Resource Relation:
Patent File Date: 2010 Apr 16
Country of Publication:
United States
Language:
English
Subject:
42 ENGINEERING

Citation Formats

Bastoul, Cedric, Lethin, Richard A., Leung, Allen K., Meister, Benoit J., Szilagyi, Peter, Vasilache, Nicolas T., and Wohlford, David E. System, methods and apparatus for program optimization for multi-threaded processor architectures. United States: N. p., 2015. Web.
Bastoul, Cedric, Lethin, Richard A., Leung, Allen K., Meister, Benoit J., Szilagyi, Peter, Vasilache, Nicolas T., & Wohlford, David E. System, methods and apparatus for program optimization for multi-threaded processor architectures. United States.
Bastoul, Cedric, Lethin, Richard A., Leung, Allen K., Meister, Benoit J., Szilagyi, Peter, Vasilache, Nicolas T., and Wohlford, David E. Tue . "System, methods and apparatus for program optimization for multi-threaded processor architectures". United States. https://www.osti.gov/servlets/purl/1167014.
@article{osti_1167014,
title = {System, methods and apparatus for program optimization for multi-threaded processor architectures},
author = {Bastoul, Cedric and Lethin, Richard A. and Leung, Allen K. and Meister, Benoit J. and Szilagyi, Peter and Vasilache, Nicolas T. and Wohlford, David E.},
abstractNote = {Methods, apparatus and computer software product for source code optimization are provided. In an exemplary embodiment, a first custom computing apparatus is used to optimize the execution of source code on a second computing apparatus. In this embodiment, the first custom computing apparatus contains a memory, a storage medium and at least one processor with at least one multi-stage execution unit. The second computing apparatus contains at least two multi-stage execution units that allow for parallel execution of tasks. The first custom computing apparatus optimizes the code for parallelism, locality of operations and contiguity of memory accesses on the second computing apparatus. This Abstract is provided for the sole purpose of complying with the Abstract requirement rules. This Abstract is submitted with the explicit understanding that it will not be used to interpret or to limit the scope or the meaning of the claims.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2015},
month = {1}
}

Patent:

Save / Share:

Works referenced in this record:

Synthesizing transformations for locality enhancement of imperfectly-nested loop nests
conference, January 2000


A Compiler Framework for Tiling Imperfectly-Nested Loops
book, January 2000


Normalised Givens rotations for recursive least squares processing
conference, January 1995


Flow-insensitive interprocedural alias analysis in the presence of pointers
book, June 2005


Counting Integer Points in Parametric Polytopes Using Barvinok's Rational Functions
journal, February 2007


Array-data flow analysis and its use in array privatization
conference, January 1993

  • Maydan, Dror E.; Amarasinghe, Saman P.; Lam, Monica S.
  • Proceedings of the 20th ACM SIGPLAN-SIGACT symposium on Principles of programming languages - POPL '93
  • https://doi.org/10.1145/158511.158515

A fast algorithm for finding dominators in a flowgraph
journal, January 1979


Maximizing parallelism and minimizing synchronization with affine transforms
conference, January 1997


Polyhedral Code Generation in the Real World
book, January 2006


Enabling Loop Fusion and Tiling for Cache Performance by Fixing Fusion-Preventing Data Dependences
conference, January 2005


Optimal weighted loop fusion for parallel programs
conference, January 1997


Register tiling in nonrectangular iteration spaces
journal, July 2002


Semi-Automatic Composition of Loop Transformations for Deep Parallelism and Memory Hierarchies
journal, June 2006


The Z-polyhedral model
conference, January 2007


Conversion of control dependence to data dependence
conference, January 1983

  • Allen, J. R.; Kennedy, Ken; Porterfield, Carrie
  • Proceedings of the 10th ACM SIGACT-SIGPLAN symposium on Principles of programming languages - POPL '83
  • https://doi.org/10.1145/567067.567085

Efficient string matching: an aid to bibliographic search
journal, June 1975


Manufacturing cheap, resilient, and stealthy opaque constructs
conference, January 1998

  • Collberg, Christian; Thomborson, Clark; Low, Douglas
  • Proceedings of the 25th ACM SIGPLAN-SIGACT symposium on Principles of programming languages - POPL '98
  • https://doi.org/10.1145/268946.268962

Scanning polyhedra with DO loops
journal, July 1991


Static branch frequency and program profile analysis
conference, January 1994


The program dependence graph and its use in optimization
journal, July 1987


Automatic mapping of nested loops to FPGAS
conference, January 2007


Lattice-based memory allocation
conference, January 2003

  • Darte, Alain; Schreiber, Rob; Villard, Gilles
  • Proceedings of the international conference on Compilers, architectures and synthesis for embedded systems - CASES '03
  • https://doi.org/10.1145/951710.951749

Some efficient solutions to the affine scheduling problem. I. One-dimensional time
journal, October 1992


Lattice-Based Memory Allocation
journal, October 2005


Parallel Sparse Supports for Array Intrinsic Functions of Fortran 90
journal, March 2001


Tiling Imperfectly-nested Loop Nests
conference, January 2000


An experimental evaluation of tiling and shackling for memory hierarchy management
conference, January 1999


A Library for Doing Polyhedral Operations
journal, December 2000


Abstract interpretation: a unified lattice model for static analysis of programs by construction or approximation of fixpoints
conference, January 1977


Towards automatic generation of vulnerability-based signatures
conference, January 2006


Distributed Microarchitectural Protocols in the TRIPS Prototype Processor
conference, December 2006

  • Sankaralingam, Karthikeyan; Nagarajan, Ramadass; McDonald, Robert
  • 2006 39th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'06)
  • https://doi.org/10.1109/MICRO.2006.19

Iterative modulo scheduling: an algorithm for software pipelining loops
conference, January 1994


On the Best Rank-1 and Rank-( R 1 , R 2 ,. . ., R N ) Approximation of Higher-Order Tensors
journal, January 2000


Solving SAT and SAT Modulo Theories: From an abstract Davis--Putnam--Logemann--Loveland procedure to DPLL(
journal, November 2006


Revisiting the decomposition of Karp, Miller and Winograd
conference, January 1995


Early Control of Register Pressure for Software Pipelined Loops
book, January 2003


Some efficient solutions to the affine scheduling problem. Part II. Multidimensional time
journal, December 1992


Supernode partitioning
conference, January 1988


Effective partial redundancy elimination
conference, January 1994


Generation of Efficient Nested Loops from Polyhedra
journal, October 2000


Dataflow analysis of array and scalar references
journal, February 1991


The mapping of linear recurrence equations on regular arrays
journal, October 1989

  • Quinton, Patrice; van Dongen, Vincent
  • Journal of VLSI signal processing systems for signal, image and video technology, Vol. 1, Issue 2, p. 95-113
  • https://doi.org/10.1007/BF02477176

A Geometric Programming Framework for Optimal Multi-Level Tiling
conference, January 2004


Value dependence graphs: representation without taxation
conference, January 1994

  • Weise, Daniel; Crew, Roger F.; Ernst, Michael
  • Proceedings of the 21st ACM SIGPLAN-SIGACT symposium on Principles of programming languages - POPL '94
  • https://doi.org/10.1145/174675.177907

Scalable Tensor Decompositions for Multi-aspect Data Mining
conference, December 2008


Operator strength reduction
journal, September 2001


A practical automatic polyhedral parallelizer and locality optimizer
journal, May 2008


Code generation for multiple mappings
conference, January 1994


Iterated register coalescing
journal, May 1996


Constant propagation with conditional branches
journal, April 1991


Solution and Optimization of Systems of Pseudo-Boolean Constraints
journal, October 2007


Blocking and array contraction across arbitrarily nested loops using affine partitioning
journal, July 2001


Efficient representation scheme for multidimensional array operations
journal, March 2002


Global code motion/global value numbering
journal, June 1995


Undecidability of static analysis
journal, December 1992


An Efficient Inclusion-Based Points-To Analysis for Strictly-Typed Languages
book, January 2002


On the (Im)possibility of Obfuscating Programs
book, January 2001


Symbolic array dataflow analysis for array privatization and program parallelization
conference, January 1995


Partial dead code elimination
journal, June 1994


Configurable string matching hardware for speeding up intrusion detection
journal, March 2005


Memory optimization by counting points in integer transformations of parametric polytopes
conference, January 2006

  • Seghir, Rachid; Loechner, Vincent
  • Proceedings of the 2006 international conference on Compilers, architecture and synthesis for embedded systems - CASES '06
  • https://doi.org/10.1145/1176760.1176771

Scanning polyhedra without Do-loops
conference, January 1998


A data locality optimizing algorithm
journal, June 1991


Cache miss equations: a compiler framework for analyzing and tuning memory behavior
journal, July 1999


Scaling to the end of silicon with EDGE architectures
journal, July 2004


Automatic memory layout transformations to optimize spatial locality in parameterized loop nests
journal, March 2000


Deobfuscation: Reverse Engineering Obfuscated Code
conference, January 2005


Verifying safety properties of a class of infinite-state distributed algorithms
book, January 1995


An accurate cost model for guiding data locality transformations
journal, September 2005


A simple graph-based intermediate representation
journal, March 1995


Ultra-fast aliasing analysis using CLA: a million lines of C code in a second
journal, May 2001


Impact of memory hierarchy on program partitioning and scheduling
conference, January 1995

  • Kaplow, W. K.; Maniatty, W. A.; Szymanski, B. K.
  • Twenty-Eighth Annual Hawaii International Conference on System Sciences, Proceedings of the Twenty-Eighth Hawaii International Conference on System Sciences
  • https://doi.org/10.1109/HICSS.1995.375473

Two Fast Algorithms for Sparse Matrices: Multiplication and Permuted Transposition
journal, September 1978