
A MATRIX-FREE ILU REALIZATION BASED ON SURROGATES
DANIEL DRZISGA†, ANDREAS WAGNER† ∗,AND BARBARA WOHLMUTH†
Abstract.
Matrix-free techniques play an increasingly important role in large-scale simulations.
Schur complement techniques and massively parallel multigrid solvers for second-order elliptic partial
differential equations can significantly benefit from reduced memory traffic and consumption. The
matrix-free approach often restricts solver components to purely local operations, for instance, to the
most basic schemes like Jacobi- or Gauss–Seidel-Smoothers in multigrid methods. An incomplete LU
(ILU) decomposition cannot be calculated from local information and is therefore not amenable to an
on-the-fly computation which is typically needed for matrix-free calculations. It generally requires the
storage and factorization of a sparse matrix which contradicts the low memory requirements in large
scale scenarios. In this work, we propose a matrix-free ILU realization. More precisely, we introduce
a memory-efficient, matrix-free ILU(0)-Smoother component for low-order conforming finite-elements
on tetrahedral hybrid grids. Hybrid-grids consist of an unstructured macro-mesh which is subdivided
into a structured micro-mesh. The ILU(0) is used for degrees-of-freedom assigned to the interior
of macro-tetrahedra. This ILU(0)-Smoother can be used for the efficient matrix-free evaluation of
the Steklov–Poincar´e operator from domain-decomposition methods. After introducing and formally
defining our smoother, we investigate its performance on refined macro-tetrahedra. Secondly, the
ILU(0)-Smoother on the macro-tetrahedrons is implemented via surrogate matrix polynomials in
conjunction with a fast on-the-fly evaluation scheme resulting in an efficient matrix-free algorithm.
The polynomial coefficients are obtained by solving a least-squares problem on a small part of the
factorized ILU(0) matrices to stay memory efficient. The convergence rates of this smoother with
respect to the polynomial order are thoroughly studied.
Key words. ILU-Smoother, multigrid, hybrid grids, polynomial surrogates, matrix-free
AMS subject classifications. 65F55, 65N55
1. Introduction.
The incomplete LU(0)-factorization [34] (ILU) approximates
an LU factorization by retaining the sparsity pattern of the original matrix. For strongly
anisotropic problems in 2D, it is often used as a smoother within multigrid algorithms
since its convergence rates are more stable than the ones of simpler smoothers like
the Gauss–Seidel- or Jacobi-Smoothers [47, Sec. 7.8]. This property carries on to
anisotropic 3D problems in which the coupling in one spatial direction is dominant while
other schemes have to be used if two of the spatial directions are dominant [26]. The
related thresholded ILU-Smoother was recently used for p-multigrid in isogeometric
analysis [41,42] or as a smoother for the wave equation [45].
Besides its usage as a smoother, incomplete factorizations like the ILU are used
as preconditioners [5,40], for instance in problems involving the incompressible Stokes
equation [24] or in electromagnetic scattering [31]. An algorithm for a communication
avoiding ILU(0) preconditioner in the high-performance context was introduced in [22].
Algorithms for the efficient parallel assembly of thresholded ILU preconditioners can
be found in [3] including adaptions to GPUs in [4,30].
Matrix-free methods are becoming increasingly prevalent within finite-element
frameworks [27,29,46]. For instance, large scale mantle-convection simulations typically
operate on scales on which storing the discretization matrices is not always feasible [6].
On the other hand, reducing the memory traffic by not requiring to load a matrix from
memory has the potential to result in faster algorithms on today’s hardware. This
∗Corresponding author.
Funding:
This work was partly supported by the German Research Foundation through grant
WO671/11-1.
†
Lehrstuhl f¨ur Numerische Mathematik, Fakult¨at f¨ur Mathematik (M2), Technische Universit¨at
M¨unchen, Garching bei M¨unchen (drzisga@ma.tum.de,wagneran@ma.tum.de,wohlmuth@ma.tum.de)
1
arXiv:2210.15280v1 [math.NA] 27 Oct 2022