-
Optimal Polynomial Smoothers for Parallel AMG
Authors:
Pasqua D'Ambra,
Fabio Durastante,
Salvatore Filippone,
Stefano Massei,
Stephen Thomas
Abstract:
In this paper, we propose some Chebyshev polynomials of the 1st-kind which produce optimal bound for a polynomial dependent constant involved in the AMG $V$-cycle error bound and do not require information about the spectrum of matrices. We formulate a variant of a minimax problem already proposed in [J. Lottes, Optimal polynomial smoothers for multigrid V-cycles, Numer. Lin. Alg. with Appl., 30 (…
▽ More
In this paper, we propose some Chebyshev polynomials of the 1st-kind which produce optimal bound for a polynomial dependent constant involved in the AMG $V$-cycle error bound and do not require information about the spectrum of matrices. We formulate a variant of a minimax problem already proposed in [J. Lottes, Optimal polynomial smoothers for multigrid V-cycles, Numer. Lin. Alg. with Appl., 30 (2023), p. e2518, https://doi.org/10.1002/nla.2518] and define Chebyshev polynomial of the 1st-kind as acceleration for a weighted-Jacobi smoother; we also describe efficient GPU kernels for the application of the polynomial smoother and compare results with accelerators defined in [J. Lottes, Optimal polynomial smoothers for multigrid V-cycles, Numer. Lin. Alg. with Appl., 30 (2023), p. e2518, https://doi.org/10.1002/nla.2518] on usual benchmarks at very large scales.
△ Less
Submitted 13 July, 2024;
originally announced July 2024.
-
PSCToolkit: solving sparse linear systems with a large number of GPUs
Authors:
Pasqua D'Ambra,
Fabio Durastante,
Salvatore Filippone
Abstract:
In this chapter, we describe the Parallel Sparse Computation Toolkit (PSCToolkit), a suite of libraries for solving large-scale linear algebra problems in an HPC environment. In particular, we focus on the tools provided for the solution of symmetric and positive-definite linear systems using up to 8192 GPUs on the EuroHPC-JU Leonardo supercomputer. PSCToolkit is an ongoing mathematical software p…
▽ More
In this chapter, we describe the Parallel Sparse Computation Toolkit (PSCToolkit), a suite of libraries for solving large-scale linear algebra problems in an HPC environment. In particular, we focus on the tools provided for the solution of symmetric and positive-definite linear systems using up to 8192 GPUs on the EuroHPC-JU Leonardo supercomputer. PSCToolkit is an ongoing mathematical software project aimed at exploiting the extreme computational speed of current supercomputers for relevant problems in Computational and Data Science.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Alya towards Exascale: Algorithmic Scalability using PSCToolkit
Authors:
Herbert Owen,
Oriol Lehmkuhl,
Pasqua D'Ambra,
Fabio Durastante,
Salvatore Filippone
Abstract:
In this paper, we describe an upgrade of the Alya code with up-to-date parallel linear solvers capable of achieving reliability, efficiency and scalability in the computation of the pressure field at each time step of the numerical procedure for solving a Large Eddy Simulation formulation of the incompressible Navier-Stokes equations. We developed a software module in Alya's kernel to interface th…
▽ More
In this paper, we describe an upgrade of the Alya code with up-to-date parallel linear solvers capable of achieving reliability, efficiency and scalability in the computation of the pressure field at each time step of the numerical procedure for solving a Large Eddy Simulation formulation of the incompressible Navier-Stokes equations. We developed a software module in Alya's kernel to interface the libraries included in the current version of PSCToolkit, a framework for the iterative solution of sparse linear systems on parallel distributed-memory computers by Krylov methods coupled to Algebraic MultiGrid preconditioners. The Toolkit has undergone various extensions within the EoCoE-II project with the primary goal of facing the exascale challenge. Results on a realistic benchmark for airflow simulations in wind farm applications show that the PSCToolkit solvers significantly outperform the original versions of the Conjugate Gradient method available in the Alya's kernel in terms of scalability and parallel efficiency and represent a very promising software layer to move the Alya code towards exascale.
△ Less
Submitted 19 December, 2023; v1 submitted 29 October, 2022;
originally announced October 2022.
-
Why diffusion-based preconditioning of Richards equation works: spectral analysis and computational experiments at very large scale
Authors:
Daniele Bertaccini,
Pasqua D'Ambra,
Fabio Durastante,
Salvatore Filippone
Abstract:
We consider here a cell-centered finite difference approximation of the Richards equation in three dimensions, averaging for interface values the hydraulic conductivity $K=K(p)$, a highly nonlinear function, by arithmetic, upstream, and harmonic means. The nonlinearities in the equation can lead to changes in soil conductivity over several orders of magnitude and discretizations with respect to sp…
▽ More
We consider here a cell-centered finite difference approximation of the Richards equation in three dimensions, averaging for interface values the hydraulic conductivity $K=K(p)$, a highly nonlinear function, by arithmetic, upstream, and harmonic means. The nonlinearities in the equation can lead to changes in soil conductivity over several orders of magnitude and discretizations with respect to space variables often produce stiff systems of differential equations. A fully implicit time discretization is provided by \emph{backward Euler} one-step formula; the resulting nonlinear algebraic system is solved by an inexact Newton Armijo-Goldstein algorithm, requiring the solution of a sequence of linear systems involving Jacobian matrices. We prove some new results concerning the distribution of the Jacobians eigenvalues and the explicit expression of their entries. Moreover, we explore some connections between the saturation of the soil and the ill-conditioning of the Jacobians. The information on eigenvalues justifies the effectiveness of some preconditioner approaches which are widely used in the solution of Richards equation. We also propose a new software framework to experiment with scalable and robust preconditioners suitable for efficient parallel simulations at very large scales. Performance results on a literature test case show that our framework is very promising in the advance towards realistic simulations at extreme scale.
△ Less
Submitted 15 July, 2022; v1 submitted 9 December, 2021;
originally announced December 2021.
-
AMG preconditioners for Linear Solvers towards Extreme Scale
Authors:
Pasqua D'Ambra,
Fabio Durastante,
Salvatore Filippone
Abstract:
Linear solvers for large and sparse systems are a key element of scientific applications, and their efficient implementation is necessary to harness the computational power of current computers. Algebraic MultiGrid (AMG) preconditioners are a popular ingredient of such linear solvers; this is the motivation for the present work where we examine some recent developments in a package of AMG precondi…
▽ More
Linear solvers for large and sparse systems are a key element of scientific applications, and their efficient implementation is necessary to harness the computational power of current computers. Algebraic MultiGrid (AMG) preconditioners are a popular ingredient of such linear solvers; this is the motivation for the present work where we examine some recent developments in a package of AMG preconditioners to improve efficiency, scalability, and robustness on extreme-scale problems. The main novelty is the design and implementation of a parallel coarsening algorithm based on aggregation of unknowns employing weighted graph matching techniques; this is a completely automated procedure, requiring no information from the user, and applicable to general symmetric positive definite (s.p.d.) matrices. The new coarsening algorithm improves in terms of numerical scalability at low operator complexity over decoupled aggregation algorithms available in previous releases of the package. The preconditioners package is built on the parallel software framework \texttt{PSBLAS}, which has also been updated to progress towards exascale. We present weak scalability results on one of the most powerful supercomputers in Europe, for linear systems with sizes up to $O(10^{10})$ unknowns.
△ Less
Submitted 2 May, 2021; v1 submitted 29 June, 2020;
originally announced June 2020.
-
Automatic coarsening in Algebraic Multigrid utilizing quality measures for matching-based aggregations
Authors:
Pasqua D'Ambra,
Fabio Durastante,
Salvatore Filippone,
Ludmil Zikatanov
Abstract:
In this paper, we discuss the convergence of an Algebraic MultiGrid (AMG) method for general symmetric positive-definite matrices. The method relies on an aggregation algorithm, named \emph{coarsening based on compatible weighted matching}, which exploits the interplay between the principle of compatible relaxation and the maximum product matching in undirected weighted graphs. The results are bas…
▽ More
In this paper, we discuss the convergence of an Algebraic MultiGrid (AMG) method for general symmetric positive-definite matrices. The method relies on an aggregation algorithm, named \emph{coarsening based on compatible weighted matching}, which exploits the interplay between the principle of compatible relaxation and the maximum product matching in undirected weighted graphs. The results are based on a general convergence analysis theory applied to the class of AMG methods employing unsmoothed aggregation and identifying a quality measure for the coarsening; similar quality measures were originally introduced and applied to other methods as tools to obtain good quality aggregates leading to optimal convergence for M-matrices. The analysis, as well as the coarsening procedure, is purely algebraic and, in our case, allows an \emph{a posteriori} evaluation of the quality of the aggregation procedure which we apply to analyze the impact of approximate algorithms for matching computation and the definition of graph edge weights. We also explore the connection between the choice of the aggregates and the compatible relaxation convergence, confirming the consistency between theories for designing coarsening procedures in purely algebraic multigrid methods and the effectiveness of the coarsening based on compatible weighted matching. We discuss various completely automatic algorithmic approaches to obtain aggregates for which good convergence properties are achieved on various test cases.
△ Less
Submitted 31 July, 2022; v1 submitted 27 January, 2020;
originally announced January 2020.