-
GPU-based compressible lattice Boltzmann simulations on non-uniform grids using standard C++ parallelism: From best practices to aerodynamics, aeroacoustics and supersonic flow simulations
Authors:
Christophe Coreixas,
Jonas Latt
Abstract:
Despite decades of research, creating accurate, robust, and efficient lattice Boltzmann methods (LBM) on non-uniform grids with seamless GPU acceleration remains challenging. This work introduces a novel strategy to address this challenge by integrating simple yet effective components: (1) parallel algorithms in modern C++, (2) conservative cell-centered grid refinement, (3) local boundary conditi…
▽ More
Despite decades of research, creating accurate, robust, and efficient lattice Boltzmann methods (LBM) on non-uniform grids with seamless GPU acceleration remains challenging. This work introduces a novel strategy to address this challenge by integrating simple yet effective components: (1) parallel algorithms in modern C++, (2) conservative cell-centered grid refinement, (3) local boundary conditions, and (4) robust collision models. Our framework supports multiple lattices (D2Q9, D2Q13, D2Q21, D2Q37) tailored to various flow conditions. It includes collision models with polynomial and numerical equilibria, a second distribution for polyatomic behavior, a Jameson-like shock sensor, and generalizes Rohde's refinement strategy.
The framework's accuracy and robustness is validated across diverse benchmarks, including lid-driven cavity flows, Aeolian noise, 30P30N airfoil aerodynamics, inviscid Riemann problems, and viscous flows past a NACA airfoil in transonic and supersonic regimes. Modern C++ further enables our framework to reach GPU-native performance, while ensuring high portability, modularity, and ease of implementation. Notably, weakly compressible LBMs achieve state-of-the-art GPU efficiency on non-uniform grids, while fully compressible LBMs benefit from acceleration equivalent to thousands of CPU cores in the most compute-intensive cases. Our advanced performance models incorporate neighbor-list and asynchronous time-stepping effects, providing new insights into the performance decomposition of LB simulations on non-uniform grids.
Overall, this study sets a new standard for portable, tree-based LBMs, demonstrating that a combination of well-chosen components can achieve high performance, accuracy, and robustness across various flow conditions. As a final proof-of-concept, adaptive mesh refinement is proposed for subsonic and supersonic applications.
△ Less
Submitted 8 April, 2025; v1 submitted 6 April, 2025;
originally announced April 2025.
-
Palabos Turret: A Particle-Resolved Numerical Framework for Settling Dynamics of Arbitrary-Shaped Particles
Authors:
Taraprasad Bhowmick,
Jonas Latt,
Yong Wang,
Gholamhossein Bagheri
Abstract:
Particles transported in fluids are everywhere, occurring for example in indoor air, the atmosphere, the oceans, and engineering applications. In this study, a novel three-dimensional numerical framework -- the Palabos Turret is presented, which allows fully resolved simulations of the settling dynamics of heavy particles with arbitrary shapes over a wide range of particle Reynolds numbers. The nu…
▽ More
Particles transported in fluids are everywhere, occurring for example in indoor air, the atmosphere, the oceans, and engineering applications. In this study, a novel three-dimensional numerical framework -- the Palabos Turret is presented, which allows fully resolved simulations of the settling dynamics of heavy particles with arbitrary shapes over a wide range of particle Reynolds numbers. The numerical solver is based on the lattice Boltzmann method utilizing immersed-boundary approach and a recursive-regularized collision model to fully resolve the particle-fluid interactions. A predictor-corrector scheme is applied for the robust time integration of the six-degrees-of-freedom (6DOF) rigid-body motion. Finally, the multi-scale nature arising from the long free-fall distances of a particle is addressed through a dynamic memory allocation scheme allowing for a virtually infinite falling distance. This solver allows for the simulation of particles of any arbitrary shape. The proposed framework is validated using the analytical and experimental data of freely-falling spheres, ellipsoids, and an irregular particle in a wide range of Reynolds numbers between $5\times10^{-1}$ and $4\times10^4$. For different Reynolds numbers and particle shapes considered, the Palabos Turret shows excellent agreement compared to theoretical and experimental values with a median relative deviation of $\pm1.5\%$ and a maximum deviation of $\pm5\%$. The Palabos Turret enables an in-depth analysis of the translational and rotational dynamics of particles with complex geometries.
△ Less
Submitted 17 October, 2024; v1 submitted 27 August, 2024;
originally announced August 2024.
-
Twist, turn and encounter: the trajectories of small atmospheric particles unravelled
Authors:
Taraprasad Bhowmick,
Yong Wang,
Jonas Latt,
Gholamhossein Bagheri
Abstract:
Every solid particle in the atmosphere, from ice crystals and pollen to dust, ash, and microplastics, is non-spherical. These particles play significant roles in Earth's climate system, influencing temperature, weather patterns, natural ecosystems, human health, and pollution levels. However, our understanding of these particles is largely based on the theories for extremely small particles and ex…
▽ More
Every solid particle in the atmosphere, from ice crystals and pollen to dust, ash, and microplastics, is non-spherical. These particles play significant roles in Earth's climate system, influencing temperature, weather patterns, natural ecosystems, human health, and pollution levels. However, our understanding of these particles is largely based on the theories for extremely small particles and experiments conducted in liquid mediums. In this study, we used an innovative experimental setup and particle-resolved numerical simulations to investigate the behaviour of sub-millimetre ellipsoids of varying shapes in the air. Our results revealed complex decaying oscillation patterns involving numerous twists and turns in these particles, starkly contrasting their dynamics in liquid mediums. We found that the frequency and decay rate of these oscillations have a strong dependence on the particle shape. Interestingly, disk-shaped particles oscillated at nearly twice the frequency of rod-shaped particles, though their oscillations also decayed more rapidly. During oscillation, even subtly non-spherical particles can drift laterally up to ten times their volume-equivalent spherical diameter. This behaviour enables particles to sweep through four times more air both vertically and laterally compared to a volume-equivalent sphere, significantly increasing their encounter rate and aggregation possibility. Our findings provide an explanation for the long-range transport and naturally occurring aggregate formation of highly non-spherical particles such as snowflakes and volcanic ash.
△ Less
Submitted 28 August, 2024; v1 submitted 21 August, 2024;
originally announced August 2024.
-
Modelling Settling-Driven Gravitational Instabilities at the Base of Volcanic Clouds Using the Lattice Boltzmann Method
Authors:
Jonathan Lemus,
Allan Fries,
Paul A. Jarvis,
Costanza Bonadonna,
Bastien Chopard,
Jonas Latt
Abstract:
Field observations and laboratory experiments have shown that ash sedimentation can be significantly affected by collective settling mechanisms that promote premature ash deposition, with important implications for associated impacts. Among these mechanisms, settling-driven gravitational instabilities result from the formation of a gravitationally-unstable particle boundary layer (PBL) that grows…
▽ More
Field observations and laboratory experiments have shown that ash sedimentation can be significantly affected by collective settling mechanisms that promote premature ash deposition, with important implications for associated impacts. Among these mechanisms, settling-driven gravitational instabilities result from the formation of a gravitationally-unstable particle boundary layer (PBL) that grows between volcanic ash clouds and the underlying atmosphere. The PBL destabilises once it reaches a critical thickness, triggering the formation of rapid, downward-moving ash fingers that remain poorly characterised. We simulate this process by coupling a Lattice Boltzmann model, which solves the Navier-Stokes equations for the fluid phase, with a Weighted Essentially Non Oscillatory (WENO) finite difference scheme which solves the advection-diffusion-settling equation describing particle transport. Since the physical problem is advection dominated, the use of the WENO scheme reduces numerical diffusivity and ensures accurate tracking of the temporal evolution of the interface between the layers. We have validated the new model by showing that the simulated early-time growth rate of the instability is in very good agreement with that predicted by linear stability analysis, whilst the modelled late-stage behaviour also successfully reproduces quantitative results from published laboratory experiments.
△ Less
Submitted 14 June, 2021;
originally announced June 2021.
-
PalaCell2D: A framework for detailed tissue morphogenesis
Authors:
Raphaël Conradin,
Christophe Coreixas,
Jonas Latt,
Bastien Chopard
Abstract:
In silico, cell based approaches for modeling biological morphogenesis are used to test and validate our understanding of the biological and mechanical process that are at work during the growth and the organization of multi-cell tissues. As compared to in vivo experiments, computer based frameworks dedicated to tissue modeling allow us to easily test different hypotheses, and to quantify the impa…
▽ More
In silico, cell based approaches for modeling biological morphogenesis are used to test and validate our understanding of the biological and mechanical process that are at work during the growth and the organization of multi-cell tissues. As compared to in vivo experiments, computer based frameworks dedicated to tissue modeling allow us to easily test different hypotheses, and to quantify the impact of various biophysically relevant parameters.
Here, we propose a formalism based on a detailed, yet simple, description of cells that accounts for intra-, inter- and extra-cellular mechanisms. More precisely, the cell growth and division is described through the space and time evolution of the membrane vertices. These vertices follow a Newtonian dynamics, meaning that their evolution is controlled by different types of forces: a membrane force (spring and bending), an adherence force (inter cellular spring), external and internal pressure forces. In addition to the cells dynamics, our formalism further relies on a lattice Boltzmann method, using the Palabos library, to simulate the diffusion of chemical signals. The latter aims at driving the growth and migration of a tissue by simply changing the state of the cells.
All of this leads to an accurate description of the growth and division of cells, with realistic cell shapes and where membranes can have different properties. While this work is of methodological nature, we also propose to validate our framework through simple, yet biologically relevant benchmark tests at both single-cell and full tissue scales. This includes free and chemically controlled cell tissue growth in an unbounded domain. The ability of our framework to simulate cell migration, cell compression and morphogenesis under external constraints is also investigated in a qualitative manner.
△ Less
Submitted 13 March, 2021; v1 submitted 30 January, 2021;
originally announced February 2021.
-
Palabos-npFEM: Software for the Simulation of Cellular Blood Flow (Digital Blood)
Authors:
Christos Kotsalos,
Jonas Latt,
Bastien Chopard
Abstract:
Palabos-npFEM is a computational framework for the simulation of blood flow with fully resolved constituents. The software resolves the trajectories and deformed state of blood cells, such as red blood cells and platelets, and the complex interaction between them. The tool combines the lattice Boltzmann solver Palabos for the simulation of blood plasma (fluid phase), a finite element method (FEM)…
▽ More
Palabos-npFEM is a computational framework for the simulation of blood flow with fully resolved constituents. The software resolves the trajectories and deformed state of blood cells, such as red blood cells and platelets, and the complex interaction between them. The tool combines the lattice Boltzmann solver Palabos for the simulation of blood plasma (fluid phase), a finite element method (FEM) solver for the resolution of blood cells (solid phase), and an immersed boundary method (IBM) for the coupling of the two phases. Palabos-npFEM provides, on top of a CPU-only version, the option to simulate the deformable bodies on GPUs, thus the code is tailored for the fastest supercomputers. The software is integrated in the Palabos core library, and is available on the Git repository https://gitlab.com/unigespc/palabos. It offers the possibility to simulate various setups, e.g. several geometries and blood parameters, and due to its modular design, it allows external solvers to readily replace the provided ones.
△ Less
Submitted 9 November, 2020;
originally announced November 2020.
-
Cross-platform programming model for many-core lattice Boltzmann simulations
Authors:
Jonas Latt,
Christophe Coreixas,
Joël Beny
Abstract:
We present a novel, hardware-agnostic implementation strategy for lattice Boltzmann (LB) simulations, which yields massive performance on homogeneous and heterogeneous many-core platforms. Based solely on C++17 Parallel Algorithms, our approach does not rely on any language extensions, external libraries, vendor-specific code annotations, or pre-compilation steps. Thanks in particular to a recentl…
▽ More
We present a novel, hardware-agnostic implementation strategy for lattice Boltzmann (LB) simulations, which yields massive performance on homogeneous and heterogeneous many-core platforms. Based solely on C++17 Parallel Algorithms, our approach does not rely on any language extensions, external libraries, vendor-specific code annotations, or pre-compilation steps. Thanks in particular to a recently proposed GPU back-end to C++17 Parallel Algorithms, it is shown that a single code can compile and reach state-of-the-art performance on both many-core CPU and GPU environments for the solution of a given non trivial fluid dynamics problem. The proposed strategy is tested with six different, commonly used implementation schemes to test the performance impact of memory access patterns on different platforms. Nine different LB collision models are included in the tests and exhibit good performance, demonstrating the versatility of our parallel approach. This work shows that it is less than ever necessary to draw a distinction between research and production software, as a concise and generic LB implementation yields performances comparable to those achievable in a hardware specific programming language. The results also highlight the gains of performance achieved by modern many-core CPUs and their apparent capability to narrow the gap with the traditionally massively faster GPU platforms. All code is made available to the community in form of the open-source project "stlbm", which serves both as a stand-alone simulation software and as a collection of reusable patterns for the acceleration of pre-existing LB codes.
△ Less
Submitted 9 May, 2021; v1 submitted 22 October, 2020;
originally announced October 2020.
-
Compressible lattice Boltzmann methods with adaptive velocity stencils: An interpolation-free formulation
Authors:
C. Coreixas,
J. Latt
Abstract:
Adaptive lattice Boltzmann methods (LBMs) are based on velocity discretizations that self-adjust to local macroscopic conditions such as velocity and temperature. While this feature improves the accuracy and the stability of LBMs for large velocity and temperature fluctuations, it also strongly impacts the efficiency of the algorithm due to space interpolations that are required to get populations…
▽ More
Adaptive lattice Boltzmann methods (LBMs) are based on velocity discretizations that self-adjust to local macroscopic conditions such as velocity and temperature. While this feature improves the accuracy and the stability of LBMs for large velocity and temperature fluctuations, it also strongly impacts the efficiency of the algorithm due to space interpolations that are required to get populations at grid nodes. To avoid this defect, the present work proposes new formulations of adaptive LBMs for the simulation of compressible flows which do not rely anymore on space interpolations, hence, drastically improving their parallel efficiency for the simulation of high-speed compressible flows. To reach this goal, the adaptive phase discretization is restricted to particular states that are compliant with the efficient "collide and stream" algorithm, and as a consequence it does not require additional interpolation steps. The development of proper state-adaptive solvers with on-grid propagation imposes new restrictions and challenges on the discrete stencils, namely the need for an extended operability range allowing for the transition between two phase discretizations. Achieving the minimum operability range for discrete polynomial equilibria requires rather large stencils (e.g. D2Q81, D2Q121) and is therefore not competitive for compressible flow simulations. However, as shown in the article, the use of numerical equilibria can provide for overlaps in the operability ranges of neighboring discrete shifts at acceptable cost using the D2Q21 lattice. Through several numerical validations, the present approach is shown to allow for an efficient realization of discrete state-adaptive LBMs for high Mach number flows even in the low viscosity regime.
△ Less
Submitted 10 October, 2020; v1 submitted 28 September, 2020;
originally announced September 2020.
-
Enhanced single-node boundary condition for the Lattice Boltzmann Method
Authors:
Francesco Marson,
Yann Thorimbert,
Jonas Latt,
Bastien Chopard
Abstract:
We propose a new way to implement Dirichlet boundary conditions for complex shapes using data from a single node only, in the context of the lattice Boltzmann method. The resulting novel method exhibits second-order convergence for the velocity field and shows similar or better accuracy than the well established Bouzidi, Firdaouss, and Lallemand (2001) boundary condition for curved walls, despite…
▽ More
We propose a new way to implement Dirichlet boundary conditions for complex shapes using data from a single node only, in the context of the lattice Boltzmann method. The resulting novel method exhibits second-order convergence for the velocity field and shows similar or better accuracy than the well established Bouzidi, Firdaouss, and Lallemand (2001) boundary condition for curved walls, despite its local nature. The method also proves to be suitable to simulate moving rigid objects or immersed surfaces either with or without prescribed motion. The core idea of the new approach is to generalize the description of boundary conditions that combine bounce-back rule with interpolations and to enhance them by limiting the information involved in the interpolation to a close proximity of the boundary.
△ Less
Submitted 24 September, 2020; v1 submitted 9 September, 2020;
originally announced September 2020.
-
Anomalous Platelet Transport & Fat-Tailed Distributions
Authors:
Christos Kotsalos,
Karim Zouaoui Boudjeltia,
Ritabrata Dutta,
Jonas Latt,
Bastien Chopard
Abstract:
The transport of platelets in blood is commonly assumed to obey an advection-diffusion equation. Here we propose a disruptive view, by showing that the random part of their velocity is governed by a fat-tailed probability distribution, usually referred to as a Lévy flight. Although for small spatio-temporal scales, it is hard to distinguish it from the generally accepted "red blood cell enhanced"…
▽ More
The transport of platelets in blood is commonly assumed to obey an advection-diffusion equation. Here we propose a disruptive view, by showing that the random part of their velocity is governed by a fat-tailed probability distribution, usually referred to as a Lévy flight. Although for small spatio-temporal scales, it is hard to distinguish it from the generally accepted "red blood cell enhanced" Brownian motion, for larger systems this effect is dramatic as the standard approach may underestimate the flux of platelets by several orders of magnitude, compromising in particular the validity of current platelet function tests.
△ Less
Submitted 16 September, 2021; v1 submitted 21 June, 2020;
originally announced June 2020.
-
Impact of collision models on the physical properties and the stability of lattice Boltzmann methods
Authors:
C. Coreixas,
G. Wissocq,
B. Chopard,
J. Latt
Abstract:
The lattice Boltzmann method (LBM) is known to suffer from stability issues when the collision model relies on the BGK approximation, especially in the zero viscosity limit and for non-vanishing Mach numbers. To tackle this problem, two kinds of solutions were proposed in the literature. They consist in changing either the numerical discretization (finite-volume, finite-difference, spectral-elemen…
▽ More
The lattice Boltzmann method (LBM) is known to suffer from stability issues when the collision model relies on the BGK approximation, especially in the zero viscosity limit and for non-vanishing Mach numbers. To tackle this problem, two kinds of solutions were proposed in the literature. They consist in changing either the numerical discretization (finite-volume, finite-difference, spectral-element, etc) of the discrete velocity Boltzmann equation (DVBE), or the collision model. In this work, the latter solution is investigated in details. More precisely, we propose a comprehensive comparison of (static relaxation time based) collision models, in terms of stability, and with preliminary results on their accuracy, for the simulation of isothermal high-Reynolds number flows in the (weakly) compressible regime. It starts by investigating the possible impact of collision models on the macroscopic behavior of stream-and-collide based D2Q9-LBMs, which clarifies the exact physical properties of collision models on LBMs. It is followed by extensive linear and numerical stability analyses, supplemented with an accuracy study based on the transport of vortical structures over long distances. In order to draw conclusions as general as possible, the most common moment spaces (raw, central, Hermite, central Hermite and cumulant), as well as regularized approaches, are considered for the comparative studies. LBMs based on dynamic collision mechanisms (entropic collision, subgrid scale models, explicit filtering, etc) are also briefly discussed.
△ Less
Submitted 6 June, 2020; v1 submitted 12 February, 2020;
originally announced February 2020.
-
Reducing bias in dual flip angle T1-mapping in human brain at 7T
Authors:
H. Olsson,
M. Andersen,
R. Wirestam,
J. Lätt,
G. Helms
Abstract:
Purpose: To address the systematic bias in whole-brain dual flip angle (DFA) T1-mapping at 7T by optimizing the flip angle pair and carefully selecting RF pulse shape and duration. Theory and Methods: Spoiled gradient echoes can be used to estimate whole-brain maps of T1. This can be accomplished by using only two acquisitions with different flip angles, i.e., a DFA-based approach. Although DFA-ba…
▽ More
Purpose: To address the systematic bias in whole-brain dual flip angle (DFA) T1-mapping at 7T by optimizing the flip angle pair and carefully selecting RF pulse shape and duration. Theory and Methods: Spoiled gradient echoes can be used to estimate whole-brain maps of T1. This can be accomplished by using only two acquisitions with different flip angles, i.e., a DFA-based approach. Although DFA-based T1-mapping is seemingly straightforward to implement, it is sensitive to bias caused by incomplete spoiling and incidental magnetization transfer (MT) effects. Further bias is introduced by the increased B0 and B1+ inhomogeneities at 7T. Experiments were performed to determine the optimal flip angle pair and appropriate RF pulse shape and duration. Obtained T1 estimates were validated using inversion recovery prepared EPI and compared to literature values. A multi-echo readout was used to increase SNR, enabling quantification of R2* and susceptibility, X. Results: Incomplete spoiling was observed above a local flip angle of approximately 20 degrees. An asymmetric gauss-filtered sinc pulse with a constant duration of 700 us showed a sufficiently flat frequency response profile to avoid incomplete excitation in areas with high B0 offsets. A pulse duration of 700 us minimized effects from incidental MT. Conclusion: When performing DFA-based T1-mapping one should (i) limit the higher flip angle to avoid incomplete spoiling, (ii) use a RF pulse shape insensitive to B0 inhomogeneities and (iii) apply a constant RF pulse duration, balanced to minimize incidental MT.
△ Less
Submitted 24 January, 2020;
originally announced January 2020.
-
Digital Blood in Massively Parallel CPU/GPU Systems for the Study of Platelet Transport
Authors:
Christos Kotsalos,
Jonas Latt,
Joel Beny,
Bastien Chopard
Abstract:
We propose a highly versatile computational framework for the simulation of cellular blood flow focusing on extreme performance without compromising accuracy or complexity. The tool couples the lattice Boltzmann solver Palabos for the simulation of the blood plasma, a novel finite element method (FEM) solver for the resolution of the deformable blood cells, and an immersed boundary method for the…
▽ More
We propose a highly versatile computational framework for the simulation of cellular blood flow focusing on extreme performance without compromising accuracy or complexity. The tool couples the lattice Boltzmann solver Palabos for the simulation of the blood plasma, a novel finite element method (FEM) solver for the resolution of the deformable blood cells, and an immersed boundary method for the coupling of the two phases. The design of the tool supports hybrid CPU-GPU executions (fluid, fluid-solid interaction on CPUs, the FEM solver on GPUs), and is non-intrusive, as each of the three components can be replaced in a modular way. The FEM-based kernel for solid dynamics outperforms other FEM solvers and its performance is comparable to the state-of-the-art mass-spring systems. We perform an exhaustive performance analysis on Piz Daint at the Swiss National Supercomputing Centre and provide case studies focused on platelet transport. The tests show that this versatile framework combines unprecedented accuracy with massive performance, rendering it suitable for the upcoming exascale architectures.
△ Less
Submitted 8 November, 2019;
originally announced November 2019.
-
Efficient supersonic flows through high-order guided equilibrium with lattice Boltzmann
Authors:
Jonas Latt,
Christophe Coreixas,
Joel Beny,
Andrea Parmigiani
Abstract:
A double-distribution-function based lattice Boltzmann method (DDF-LBM) is proposed for the simulation of polyatomic gases in the supersonic regime. The model relies on an extended equilibrium state that is constructed to reproduce the first 13 moments of the Maxwell-Boltzmann distribution exactly. This extends the validity of the standard 5-constraint (mass, momentum and energy) approach and lead…
▽ More
A double-distribution-function based lattice Boltzmann method (DDF-LBM) is proposed for the simulation of polyatomic gases in the supersonic regime. The model relies on an extended equilibrium state that is constructed to reproduce the first 13 moments of the Maxwell-Boltzmann distribution exactly. This extends the validity of the standard 5-constraint (mass, momentum and energy) approach and leads to the correct simulation of thermal, compressible flows with only 39 discrete velocities in 3D. The stability of this BGK-LBM is reinforced by relying on Knudsen-number-dependent relaxation times that are computed analytically. Hence, high-Reynolds number, supersonic flows can be simulated in an efficient and elegant manner. While the 1D Riemann problem shows the ability of the proposed approach to handle discontinuities in the zero-viscosity limit, the simulation of the flow past a NACA0012 airfoil (Mach number $\mathrm{Ma}=1.5$, Reynolds number $\mathrm{Re=10^4}$) confirms the excellent behavior of this model in a low-viscosity and supersonic regime. The proposed model is substantially more efficient than the previous 5-moment D3Q343 DDF-LBM and opens up a whole new world of compressible flow applications that can be realistically tackled with a purely LB approach.
△ Less
Submitted 12 February, 2020; v1 submitted 29 October, 2019;
originally announced October 2019.
-
Comprehensive comparison of collision models in the lattice Boltzmann framework: Theoretical investigations
Authors:
C. Coreixas,
B. Chopard,
J. Latt
Abstract:
Over the last decades, several types of collision models have been proposed to extend the validity domain of the lattice Boltzmann method (LBM), each of them being introduced in its own formalism. The present article proposes a formalism that describes all these methods within a common mathematical framework, and in this way allows us to draw direct links between them. Here, the focus is put on si…
▽ More
Over the last decades, several types of collision models have been proposed to extend the validity domain of the lattice Boltzmann method (LBM), each of them being introduced in its own formalism. The present article proposes a formalism that describes all these methods within a common mathematical framework, and in this way allows us to draw direct links between them. Here, the focus is put on single and multirelaxation time collision models in either their raw moment, central moment, cumulant or regularized form. In parallel with that, several bases (non orthogonal, orthogonal, Hermite) are considered for the polynomial expansion of populations. General relationships between moments are first derived to understand how moment spaces are related to each other. In addition, a review of collision models further sheds light on collision models that can be rewritten in a linear matrix form. More quantitative mathematical studies are then carried out by comparing explicit expressions for the post collision populations. Thanks to this, it is possible to deduce the impact of both the polynomial basis (raw, Hermite, central, central Hermite, cumulant) and the inclusion of regularization steps on isothermal LBMs. Extensive results are provided for the D1Q3, D2Q9, and D3Q27 lattices, the latter being further extended to the D3Q19 velocity discretization. Links with the most common two and multirelaxation time collision models are also provided for the sake of completeness. The present work ends by emphasizing the importance of an accurate representation of the equilibrium state, independently of the choice of moment space. As an addition to the theoretical purpose of the present article, general instructions are provided to help the reader with the implementation of the most complicated collision models.
△ Less
Submitted 30 May, 2019; v1 submitted 29 April, 2019;
originally announced April 2019.
-
Efficient LBM on GPUs for dense moving objects using immersed boundary condition
Authors:
Joel Beny,
Jonas Latt
Abstract:
There exists an increasing interest for using immersed boundary methods (IBMs) (Peskin 2000) to model moving objects in computational fluid dynamics. Indeed, this approach is particularly efficient, because the fluid mesh does not require to be body-fitted or to adjust dynamically to the motion of the body. Frequently, IBMs are implemented in combination with the lattice Boltzmann methods (LBM) (K…
▽ More
There exists an increasing interest for using immersed boundary methods (IBMs) (Peskin 2000) to model moving objects in computational fluid dynamics. Indeed, this approach is particularly efficient, because the fluid mesh does not require to be body-fitted or to adjust dynamically to the motion of the body. Frequently, IBMs are implemented in combination with the lattice Boltzmann methods (LBM) (Krüger 2016). They fit elegantly into the framework of this method, and yield impressive parallel performances. It has also become quite common to accelerate LBM simulations with the use of Graphics Processing Units (GPUs) (Tölke 2010), as the underlying algorithm adjusts naturally to the architecture of such platforms. It is not uncommon that speedups of an order of magnitude, or more, at equal financial cost or energy consumption are observed, as compared to classical CPUs. IBM algorithms are however more difficult to adapt to GPUs, because their complex memory access pattern conflicts with a GPU's strategy of broadcasting data to a large number of GPU cores in single memory accesses. In the existing literature, GPU implementations of LBM-IBM codes are therefore restricted to situations in which the immersed surfaces are very small compared to the total number of fluid cells (Valero-Lara 2014), as is often the case in exterior flow simulations around an obstacle. This assumption is however not valid in many other cases of interest.
We propose a new method for the implementation of a LBM-IBM on GPUs in the CUDA language, which allows to handle a substantially larger immersed surfaces with acceptable performance than previous implementations.
△ Less
Submitted 27 March, 2019;
originally announced April 2019.
-
Bridging the computational gap between mesoscopic and continuum modeling of red blood cells for fully resolved blood flow
Authors:
Christos Kotsalos,
Jonas Latt,
Bastien Chopard
Abstract:
We present a computational framework for the simulation of blood flow with fully resolved red blood cells (RBCs) using a modular approach that consists of a lattice Boltzmann solver for the blood plasma, a novel finite element based solver for the deformable bodies and an immersed boundary method for the fluid-solid interaction. For the RBCs, we propose a nodal projective FEM (npFEM) solver which…
▽ More
We present a computational framework for the simulation of blood flow with fully resolved red blood cells (RBCs) using a modular approach that consists of a lattice Boltzmann solver for the blood plasma, a novel finite element based solver for the deformable bodies and an immersed boundary method for the fluid-solid interaction. For the RBCs, we propose a nodal projective FEM (npFEM) solver which has theoretical advantages over the more commonly used mass-spring systems (mesoscopic modeling), such as an unconditional stability, versatile material expressivity, and one set of parameters to fully describe the behavior of the body at any mesh resolution. At the same time, the method is substantially faster than other FEM solvers proposed in this field, and has an efficiency that is comparable to the one of mesoscopic models. At its core, the solver uses specially defined potential energies, and builds upon them a fast iterative procedure based on quasi-Newton techniques. For a known material, our solver has only one free parameter that demands tuning, related to the body viscoelasticity. In contrast, state-of-the-art solvers for deformable bodies have more free parameters, and the calibration of the models demands special assumptions regarding the mesh topology, which restrict their generality and mesh independence. We propose as well a modification to the potential energy proposed by Skalak et al. 1973 for the red blood cell membrane, which enhances the strain hardening behavior at higher deformations. Our viscoelastic model for the red blood cell, while simple enough and applicable to any kind of solver as a post-convergence step, can capture accurately the characteristic recovery time and tank-treading frequencies. The framework is validated using experimental data, and it proves to be scalable for multiple deformable bodies.
△ Less
Submitted 27 August, 2019; v1 submitted 15 March, 2019;
originally announced March 2019.
-
Whole-brain diffusional variance decomposition (DIVIDE): Demonstration of technical feasibility at clinical MRI systems
Authors:
Filip Szczepankiewicz,
Jens Sjölund,
Freddy Ståhlberg,
Jimmy Lätt,
Markus Nilsson
Abstract:
Purpose: To assess the technical feasibility of whole-brain diffusional variance decomposition (DIVIDE) based on q-space trajectory encoding (QTE) at clinical MRI systems with varying performance. DIVIDE is used to separate diffusional heterogeneity into components that arise due to isotropic and anisotropic tissue structures. Methods: We designed imaging protocols for DIVIDE using numerically opt…
▽ More
Purpose: To assess the technical feasibility of whole-brain diffusional variance decomposition (DIVIDE) based on q-space trajectory encoding (QTE) at clinical MRI systems with varying performance. DIVIDE is used to separate diffusional heterogeneity into components that arise due to isotropic and anisotropic tissue structures. Methods: We designed imaging protocols for DIVIDE using numerically optimized gradient waveforms for diffusion encoding. Imaging was performed at systems with magnetic field strengths between 1.5 and 7 T, and gradient amplitudes between 33 and 80 mT/m. Technical feasibility was assessed from signal characteristics and quality of parameter maps in a single volunteer scanned at all systems. Results: The technical feasibility of QTE and DIVIDE was demonstrated at all systems. The system with the highest performance allowed whole-brain DIVIDE at 2 mm isotropic voxels. The system with the lowest performance required a spatial resolution of 2.5x2.5x4 mm3 to yield a sufficient signal-to-noise ratio. Conclusions: Whole-brain DIVIDE based on QTE is feasible at the investigated MRI systems. This demonstration indicates that tissue features beyond those accessible by conventional diffusion encoding may be explored on a wide range of MRI systems.
△ Less
Submitted 20 December, 2016;
originally announced December 2016.
-
A physical description of the adhesion and aggregation of platelets
Authors:
Bastien Chopard,
Daniel Ribeiro de Sousa,
Jonas Latt,
Frank Dubois,
Catherine Yourassowsky,
Pierre Van Antwerpen,
Omer Eker,
Luc Vanhamme,
David Perez-Morga,
Guy Courbebaisse,
Karim Zouaoui Boudjeltia
Abstract:
The early stages of clot formation in blood vessels involve platelets adhesion-aggregation. Although these mechanisms have been extensively studied, gaps in their understanding still persist. We have performed detailed in-vitro experiments and developed a numerical model to better describe and understand this phenomenon. Unlike previous studies, we took into account both activated and non-activate…
▽ More
The early stages of clot formation in blood vessels involve platelets adhesion-aggregation. Although these mechanisms have been extensively studied, gaps in their understanding still persist. We have performed detailed in-vitro experiments and developed a numerical model to better describe and understand this phenomenon. Unlike previous studies, we took into account both activated and non-activated platelets, as well as the 3D nature of the aggregation process. Our investigation reveals that blood albumin is a major parameter limiting platelet adhesion and aggregation. Our results also show that the well accepted Zydney-Colton shear-induced diffusivity is much too low to explain the observed deposition rate. Simulations are in very good agreement with observations and provide quantitative estimates of the adhesion and aggregation rates that are hard to measure experimentally.
△ Less
Submitted 3 November, 2015;
originally announced November 2015.
-
Local mesh refinement sensor for the lattice Boltzmann method
Authors:
Yann Thorimbert,
Daniel Lagrava,
Orestis Malaspinas,
Bastien Chopard,
Christophe Coreixas,
Jose Pedro de Santana Neto,
Ralf Deiterding,
Jonas Latt
Abstract:
A novel mesh refinement sensor is proposed for lattice Boltzmann methods (LBMs) applicable to either static or dynamic mesh refinement algorithms. The sensor exploits the kinetic nature of LBMs by evaluating the departure of distribution functions from their local equilibrium state. This sensor is first compared, in a qualitative manner, to three state-of-the-art sensors: (1) the vorticity norm, (…
▽ More
A novel mesh refinement sensor is proposed for lattice Boltzmann methods (LBMs) applicable to either static or dynamic mesh refinement algorithms. The sensor exploits the kinetic nature of LBMs by evaluating the departure of distribution functions from their local equilibrium state. This sensor is first compared, in a qualitative manner, to three state-of-the-art sensors: (1) the vorticity norm, (2) the Q-criterion, and (3) spatial derivatives of the vorticity. This comparison shows that our kinetic sensor is the most adequate candidate to propose tailored mesh structures across a wide range of physical phenomena: incompressible, compressible subsonic/supersonic single phase, and weakly compressible multiphase flows. As a more quantitative validation, the sensor is then used to produce the computational mesh for two existing open-source LB solvers based on inhomogeneous, block-structured meshes with static and dynamic refinement algorithms, implemented in the Palabos and AMROC-LBM software, respectively. The sensor is first used to generate a static mesh to simulate the turbulent 3D lid-driven cavity flow using Palabos. AMROC-LBM is then adopted to confirm the ability of our sensor to dynamically adapt the mesh to reach the steady state of the 2D lid-driven cavity flow. Both configurations show that our sensor successfully produces meshes of high quality and allows to save computational time.
△ Less
Submitted 1 March, 2022; v1 submitted 24 July, 2015;
originally announced July 2015.
-
Spatial Coupling of a Lattice Boltzmann fluid model with a Finite Difference Navier-Stokes solver
Authors:
Jonas Latt,
Bastien Chopard,
Paul Albuquerque
Abstract:
In multiscale, multi-physics applications, there is an increasing need for coupling numerical solvers that are each applied to a different part of the problem. Here we consider the case of coupling a Lattice Boltzmann fluid model and a Finite Difference Navier-Stokes solver. The coupling is implemented so that the entire computational domain can be divided in two regions, with the FD solver runn…
▽ More
In multiscale, multi-physics applications, there is an increasing need for coupling numerical solvers that are each applied to a different part of the problem. Here we consider the case of coupling a Lattice Boltzmann fluid model and a Finite Difference Navier-Stokes solver. The coupling is implemented so that the entire computational domain can be divided in two regions, with the FD solver running on one of them and the LB one on the other.
We show how the various physical quantities of the two approaches should be related to ensure a smooth transition at the interface between the regions. We demonstrate the feasibility of the method on the Poiseuille flow, where the LB and FD schemes are used on adjacent sub-domains.
The same idea can be also developed to couple LB models with Finite Volumes, or Finite Elements calculations.
The motivation for developing such a type of coupling is that, depending on the geometry of the flow, one technique can be more efficient, less memory consuming, or physically more appropriate than the other in some regions (e.g. near the boundaries), whereas the converse is true for other parts of the same system. We can also imagine that a given system solved, say by FD, can be augmented in some spatial regions with a new physical process that is better treated by a LB model. Our approach allows us to only modify the concerned region without altering the rest of the computation.
△ Less
Submitted 29 November, 2005;
originally announced November 2005.
-
Lattice Boltzmann Method with regularized non-equilibrium distribution functions
Authors:
Jonas Latt,
Bastien Chopard
Abstract:
A new lattice Boltzmann (LB) model is introduced, based on a regularization of the pre-collision distribution functions in terms of the local density, velocity, and momentum flux tensor. The model dramatically improves the precision and numerical stability for the simulation of fluid flows by LB methods. This claim is supported by simulation results of some 2D and 3D flows.
A new lattice Boltzmann (LB) model is introduced, based on a regularization of the pre-collision distribution functions in terms of the local density, velocity, and momentum flux tensor. The model dramatically improves the precision and numerical stability for the simulation of fluid flows by LB methods. This claim is supported by simulation results of some 2D and 3D flows.
△ Less
Submitted 20 June, 2005;
originally announced June 2005.