Computational & Technology Resources
an online resource for computational,
engineering & technology publications |
|
Civil-Comp Proceedings
ISSN 1759-3433 CCP: 76
PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON ENGINEERING COMPUTATIONAL TECHNOLOGY Edited by: B.H.V. Topping and Z. Bittnar
Paper 43
An Efficient Algorithm for Parallel Stiffness Matrix Assembling A. Unterkircher and J. Reissner
Institute for Virtual Production, ETH Zürich, Switzerland A. Unterkircher, J. Reissner, "An Efficient Algorithm for Parallel Stiffness Matrix Assembling", in B.H.V. Topping, Z. Bittnar, (Editors), "Proceedings of the Third International Conference on Engineering Computational Technology", Civil-Comp Press, Stirlingshire, UK, Paper 43, 2002. doi:10.4203/ccp.76.43
Keywords: parallel assembling, shared memory, graph theory, metal forming simulation.
Summary
Besides the solution of the linear equation system, assembling of the stiffness
matrix is the most time consuming task in finite element codes. This is especially
true if one uses elements with many integration points and complicated material
laws needed in order to achieve sufficient accuracy. The rapidly decreasing
prices for computer hardware make multi-processor machines (e.g. a dual Intel PC)
affordable even for small companies. Thus we have developed an algorithm for
stiffness matrix assembling on shared memory multi-processor architectures for
our 3D bulk forming simulation code. The algorithm is easy to implement and
nevertheless efficient.
The main problem with parallel assembling of the stiffness matrix is that different elements may contribute to the same entry in the matrix. Several approaches to avoid access conflicts to matrix entries have been proposed including the use of semaphores [1], special renumber and divide techniques [2] and row-wise assembling [3]. Our algorithm is based on graph theory: first we generate the dual graph of the finite element mesh. A recursive maximal independent set algorithm works on the dual graph and delivers indices of elements which do not share any nodes. These elements are being assembled by different processors and the element matrices can be directly added to the global stiffness matrix. The algorithm is quite easy to implement. Furthermore it is independent of dimension, element type and model shape. We incorporated the algorithm into the non-linear FE program PressForm [4]. This code uses an Arbitrary Lagrangian-Eulerian descripton to simulate extrusion and other 3D bulk forming processes. The non-linear equilibrium equations are being solved by the direct iteration method, i.e. within one increment one has to assemble and solve several linear systems. Using real life examples of various size we discuss several performance aspects: a good parallel efficiency as well as the scalability of the construction of the dual graph and the maximal independent set algorithm. References
purchase the full-text of this paper (price £20)
go to the previous paper |
|