Updating incomplete factorization preconditioners for model order reduction
|
journal
|
February 2016 |
Kokkos: Enabling manycore performance portability through polymorphic memory access patterns
|
journal
|
December 2014 |
Ifpack2 User's Guide 1.0
|
report
|
May 2016 |
Asynchronous Iterative Algorithm for Computing Incomplete Factorizations on GPUs
|
book
|
January 2015 |
Fine-Grained Parallel Incomplete LU Factorization
|
journal
|
January 2015 |
ParILUT---A New Parallel Threshold ILU Factorization
|
journal
|
January 2018 |
Parallel Graph Coloring for Manycore Architectures
|
conference
|
May 2016 |
Iterative Methods for Sparse Linear Systems
|
book
|
January 2003 |
A New Paradigm for Parallel Adaptive Meshing Algorithms
|
journal
|
January 2000 |
Towards Extreme-Scale Simulations for Low Mach Fluids with Second-Generation Trilinos
|
journal
|
December 2014 |
Multithreaded sparse matrix-matrix multiplication for many-core and GPU architectures
|
journal
|
October 2018 |
ViennaCL---Linear Algebra Library for Multi- and Many-Core Architectures
|
journal
|
January 2016 |
An overview of the Trilinos project
|
journal
|
September 2005 |
A Distributed-Memory Package for Dense Hierarchically Semi-Separable Matrix Computations Using Randomization
|
journal
|
June 2016 |
ParILUT - A Parallel Threshold ILU for GPUs
|
conference
|
May 2019 |
Reducing communication in algebraic multigrid using additive variants: REDUCING COMMUNICATION IN AMG WITH ADDITIVE VARIANTS
|
journal
|
February 2014 |
Communication Avoiding ILU0 Preconditioner
|
journal
|
January 2015 |
Adaptive precision in block-Jacobi preconditioning for iterative sparse linear system solvers: Adaptive precision in block-Jacobi preconditioning for iterative solvers
|
journal
|
March 2018 |
ShyLU: A Hybrid-Hybrid Solver for Multicore Platforms
- Rajamanickam, Sivasankaran; Boman, Erik G.; Heroux, Michael A.
-
2012 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), 2012 IEEE 26th International Parallel and Distributed Processing Symposium
https://doi.org/10.1109/IPDPS.2012.64
|
conference
|
May 2012 |
Reducing Parallel Communication in Algebraic Multigrid through Sparsification
|
journal
|
January 2016 |
Exposing Fine-Grained Parallelism in Algebraic Multigrid Methods
|
journal
|
January 2012 |
A fast adaptive solver for hierarchically semiseparable representations
|
journal
|
December 2005 |
Improving Multifrontal Methods by Means of Block Low-Rank Representations
|
journal
|
January 2015 |
A new parallel domain decomposition method for the adaptive finite element solution of elliptic partial differential equations
|
journal
|
January 2001 |
An $$\mathcal O (N \log N)$$ O ( N log N ) Fast Direct Solver for Partial Hierarchically Semi-Separable Matrices: With Application to Radial Basis Function Interpolation
|
journal
|
April 2013 |
Non-Galerkin Coarse Grids for Algebraic Multigrid
|
journal
|
January 2014 |
A Study of Clustering Techniques and Hierarchical Matrix Formats for Kernel Ridge Regression
|
conference
|
May 2018 |
Fast linear algebra-based triangle counting with KokkosKernels
- Wolf, Michael M.; Deveci, Mehmet; Berry, Jonathan W.
-
2017 IEEE High-Performance Extreme Computing Conference (HPEC), 2017 IEEE High Performance Extreme Computing Conference (HPEC)
https://doi.org/10.1109/HPEC.2017.8091043
|
conference
|
September 2017 |
Designing vector-friendly compact BLAS and LAPACK kernels
- Kim, Kyungjoo; Costa, Timothy B.; Deveci, Mehmet
-
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '17
https://doi.org/10.1145/3126908.3126941
|
conference
|
January 2017 |
Distance-two interpolation for parallel algebraic multigrid
|
journal
|
January 2008 |
Stencil computations for PDE-based applications with examples from DUNE and hypre: Stencil Computations for PDE-based Applications
|
journal
|
February 2017 |
Fast Triangle Counting Using Cilk
|
conference
|
September 2018 |
A low-communication, parallel algorithm for solving PDEs based on range decomposition: RANGE DECOMPOSITION: A LOW COMMUNICATION ALGORITHM FOR SOLVING PDES
|
journal
|
March 2016 |
Basker: Parallel sparse LU factorization utilizing hierarchical parallelism and data layouts
|
journal
|
October 2017 |
A Communication-Avoiding 3D LU Factorization Algorithm for Sparse Matrices
|
conference
|
May 2018 |
A communication-avoiding 3D algorithm for sparse LU factorization on heterogeneous systems
|
journal
|
September 2019 |
Domain Decomposition Preconditioners for Communication-Avoiding Krylov Methods on a Hybrid CPU/GPU Cluster
- Yamazaki, Ichitaro; Rajamanickam, Sivasankaran; Boman, Erik G.
-
SC14: International Conference for High Performance Computing, Networking, Storage and Analysis
https://doi.org/10.1109/SC.2014.81
|
conference
|
November 2014 |
A Parallel Multigrid Preconditioned Conjugate Gradient Algorithm for Groundwater Flow Simulations
|
journal
|
September 1996 |
Robust and Accurate Stopping Criteria for Adaptive Randomized Sampling in Matrix-Free Hierarchically Semiseparable Construction
|
journal
|
January 2019 |
Tacho: Memory-Scalable Task Parallel Sparse Cholesky Factorization
|
conference
|
May 2018 |
Algebraic Multigrid Domain and Range Decomposition (AMG-DD/AMG-RD)
|
journal
|
January 2015 |
An HSS Matrix-Inspired Butterfly-Based Direct Solver for Analyzing Scattering From Two-Dimensional Objects
|
journal
|
January 2017 |
A communication-avoiding 3D sparse triangular solver
|
conference
|
January 2019 |
Distance-two interpolation for parallel algebraic multigrid
|
journal
|
July 2007 |
ParILUT - A parallel threshold ILU for GPUS
|
text
|
January 2019 |