skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: High speed finite element simulations on the graphics card

Abstract

A software package is developed to perform explicit time domain finite element simulations of ultrasonic propagation on the graphical processing unit, using Nvidia’s CUDA. Of critical importance for this problem is the arrangement of nodes in memory, allowing data to be loaded efficiently and minimising communication between the independently executed blocks of threads. The initial stage of memory arrangement is partitioning the mesh; both a well established ‘greedy’ partitioner and a new, more efficient ‘aligned’ partitioner are investigated. A method is then developed to efficiently arrange the memory within each partition. The technique is compared to a commercial CPU equivalent, demonstrating an overall speedup of at least 100 for a non-destructive testing weld model.

Authors:
;  [1]
  1. Department of Mechanical Engineering, Imperial College, London, SW7 2AZ (United Kingdom)
Publication Date:
OSTI Identifier:
22263788
Resource Type:
Journal Article
Resource Relation:
Journal Name: AIP Conference Proceedings; Journal Volume: 1581; Journal Issue: 1; Conference: 40. annual review of progress in quantitative nondestructive evaluation, Baltimore, MD (United States), 21-26 Jul 2013, 10. international conference on Barkhausen noise and micromagnetic testing, Baltimore, MD (United States), 21-26 Jul 2013; Other Information: (c) 2014 AIP Publishing LLC; Country of input: International Atomic Energy Agency (IAEA)
Country of Publication:
United States
Language:
English
Subject:
71 CLASSICAL AND QUANTUM MECHANICS, GENERAL PHYSICS; FINITE ELEMENT METHOD; NONDESTRUCTIVE TESTING; SIMULATION

Citation Formats

Huthwaite, P., and Lowe, M. J. S. High speed finite element simulations on the graphics card. United States: N. p., 2014. Web. doi:10.1063/1.4865070.
Huthwaite, P., & Lowe, M. J. S. High speed finite element simulations on the graphics card. United States. doi:10.1063/1.4865070.
Huthwaite, P., and Lowe, M. J. S. Tue . "High speed finite element simulations on the graphics card". United States. doi:10.1063/1.4865070.
@article{osti_22263788,
title = {High speed finite element simulations on the graphics card},
author = {Huthwaite, P. and Lowe, M. J. S.},
abstractNote = {A software package is developed to perform explicit time domain finite element simulations of ultrasonic propagation on the graphical processing unit, using Nvidia’s CUDA. Of critical importance for this problem is the arrangement of nodes in memory, allowing data to be loaded efficiently and minimising communication between the independently executed blocks of threads. The initial stage of memory arrangement is partitioning the mesh; both a well established ‘greedy’ partitioner and a new, more efficient ‘aligned’ partitioner are investigated. A method is then developed to efficiently arrange the memory within each partition. The technique is compared to a commercial CPU equivalent, demonstrating an overall speedup of at least 100 for a non-destructive testing weld model.},
doi = {10.1063/1.4865070},
journal = {AIP Conference Proceedings},
number = 1,
volume = 1581,
place = {United States},
year = {Tue Feb 18 00:00:00 EST 2014},
month = {Tue Feb 18 00:00:00 EST 2014}
}