Computational & Technology Resources
an online resource for computational,
engineering & technology publications |
|
Civil-Comp Proceedings
ISSN 1759-3433 CCP: 95
PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, GRID AND CLOUD COMPUTING FOR ENGINEERING Edited by:
Paper 56
A Comparison of Different Parallel Techniques Applied to the Solution of the Navier-Stokes Equations J. Cotela, R. Rossi, E. Oñate and P. Dadvand
International Center for Numerical Methods in Engineering (CIMNE), Barcelona, Spain , "A Comparison of Different Parallel Techniques Applied to the Solution of the Navier-Stokes Equations", in , (Editors), "Proceedings of the Second International Conference on Parallel, Distributed, Grid and Cloud Computing for Engineering", Civil-Comp Press, Stirlingshire, UK, Paper 56, 2011. doi:10.4203/ccp.95.56
Keywords: parallel computing, Navier-Stokes, OpenMP, MPI, Trilinos, uncoupling, pressure Schur complement.
Summary
In this paper, we implement a parallel solver for the incompressible Navier-Stokes equations using the finite element method. We compare two different parallel programming strategies, OpenMP, which is based on a shared memory model, and MPI, which uses a distributed memory model.
The incompressible Navier-Stokes equations constitute a transient, non-linear system which has well-known stability issues when the standard Galerkin finite element discretization is used. This means that some treatment will be required before we can implement their solution. We implement a solution strategy based on algebraic subgrid scale stabilization and a generalized Newmark method for iteration in time. This results in a linear system which, unfortunately, can be poorly scaled, as it involves both velocity and pressure degrees of freedom. As a result, the system is difficult to solve using iterative methods, which are preferable for large problems. To avoid this situation, we have used an uncoupling approach based on a Schur complement formulation for the pressure, which results in a scheme similar to that of fractional step methods. Two different implementations of this method, one using OpenMP and another based on MPI through the Trilinos library, have been used to solve Ahmed's body, a standard benchmark problem in turbulence which simulates the flow around a bus-like object. Both implementations are compared with each other and with a scalar run of the same example, with emphasis on performance gains provided by the parallelization. In this sense, we have found that the time benefit obtained from the parallelization is affected by the hardware where the simulations are run. In our case, this imposes us two clear restrictions: OpenMP is limited to processors that can access some shared memory, and in general the performance is affected by competition between the processes for memory access, which seems to be an important bottleneck in our test system. In general, it can be said that, when choosing an approach to parallelization, it is important to take into account the available hardware. purchase the full-text of this paper (price £20)
go to the previous paper |
|