Computational & Technology Resources
an online resource for computational,
engineering & technology publications |
|
Civil-Comp Proceedings
ISSN 1759-3433 CCP: 95
PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, GRID AND CLOUD COMPUTING FOR ENGINEERING Edited by:
Paper 36
Parallel Preconditioning and Modular Finite Element Solvers on Hybrid CPU-GPU Systems V. Heuveline1, D. Lukarski1,2, C. Subramanian1 and J.-P. Weiss1,2
1Engineering Mathematics and Computing Lab (EMCL), 2SRG New Frontiers in High Performance Computing,
V. Heuveline, D. Lukarski, C. Subramanian, J.-P. Weiss, "Parallel Preconditioning and Modular Finite Element Solvers on Hybrid CPU-GPU Systems", in , (Editors), "Proceedings of the Second International Conference on Parallel, Distributed, Grid and Cloud Computing for Engineering", Civil-Comp Press, Stirlingshire, UK, Paper 36, 2011. doi:10.4203/ccp.95.36
Keywords: parallel preconditioning, iterative solvers, GMRES, multi-colouring, graphics processing units, multi-core CPU, convection-diffusion equation.
Summary
platform-specific implementations aggravate flexible and portable
implementations and impede programmer productivity. Second, increasing core
counts require fine-grained parallelism in the numerical schemes and
algorithms. All software development needs to be designed towards
scalability with respect to high core counts. This situation becomes
particularly apparent for preconditioning techniques, where classical approaches
are typically sequential or only have a limited degree of parallelism.
Our proposed concept of a multi-platform linear algebra toolbox implemented within the framework of the HiFlow3 finite element package [1] is tackling both the aspects described. It provides unified interfaces to highly optimized implementations on diverse hardware platforms including multicore CPUs and GPUs [2]. It allows modular building of solvers for typical numerical applications. It provides cross-platform portability, flexible utilization of resources at run time, and scalable fine-grained parallelism. In this work we detail the concept of our hybrid and portable approach for efficient parallel finite element software designed in the HiFlow3 project. To this end, we examine a typical scenario in numerical simulation by means of a convection-diffusion equation revealing challenges for preconditioning for non-symmetric problems. Based on a two-level library with backends to multi-core CPUs and multi-GPUs, we investigate performance and efficiency of parallel preconditioning techniques for the GMRES solver based on multi-colouring and matrix reorderings for incomplete decompositions and Gauss-Seidel splitting schemes. This work extends our results for preconditioning of symmetric conjugate gradient solvers [3] on single GPUs and multicore CPUs. Our approach proves efficiency and scalability across hybrid multi-core and GPU platforms where parallelism is exploited on the level of basic linear algebra routines. Our multi-coloring schemes show considerable performance improvements, especially for the dual GPU configuration. It is a robust solution and applicable to a large class of problems, in particular to non-symmetric systems. Our methodology can be applied as an out-of-the-box approach for any matrix originating in the context of finite elements. References
purchase the full-text of this paper (price £20)
go to the previous paper |
|