|   | Computational & Technology Resources an online resource for computational, engineering & technology publications | 
| Computational Science, Engineering & Technology Series ISSN 1759-3158 CSETS: 27 TRENDS IN PARALLEL, DISTRIBUTED, GRID AND CLOUD COMPUTING FOR ENGINEERING Edited by: P. Iványi, B.H.V. Topping Chapter 11 High Performance Strategies for Large Scale Complex Structural Analysis J.Y. Cognard1 and P. Verpeaux2 1Laboratoire Brestois de Mécanique et des Systèmes, ENSIETA, Brest, France  Full Bibliographic Reference for this chapter J.Y. Cognard, P. Verpeaux, "High Performance Strategies for Large Scale Complex Structural Analysis", in P. Iványi, B.H.V. Topping, (Editors), "Trends in Parallel, Distributed, Grid and Cloud Computing for Engineering", Saxe-Coburg Publications, Stirlingshire, UK, Chapter 11, pp 243-268, 2011. doi:10.4203/csets.27.11 Keywords: non-linear computations, parallel strategies, algorithms, large scale problems, load balancing, industrial environment. Summary Reducing the time and cost of mechanical design requires  the non-linearities to be taken into account
when simulating the behaviour of the structures. Unfortunately, these simulations often lead to
numerical costs too high for their use to be widespread in the industry. The joint use of powerful
algorithms and parallel computers is necessary to significantly reduce the cost of these complex
simulations. In order to obtain accurate numerical predictions, especially in respect of the  safety
constraints which are more and more required in high-tech industries, realistic models
have to be used. For such analysis, often the effects of aging, sometimes in severe environments,
can have a great  influence on the mechanical behaviour of the materials which can lead the solution of
coupled problems. Therefore, such studies must take into account more and more accurate
numerical mechanical properties which leads to the model use of the various parts of the  structures studied.
Unfortunately, the numerical simulation of these problems can be difficult, as they generally lead to the
solution of   large scale complex time-dependent non-linear problems.
 Non-linear problems are usually solved using the so-called incremental methods, which split the studied time interval into a series of time increments. Using an estimate for the displacement leads to a time-independent non-linear problem, which is solved by means of Newton type iterative method. This algorithm mainly leads to solving two types of sub-problems, which can be time consuming for a large number of degrees of freedom and for strongly non-linear constitutive laws, i.e. for industrial type problems. On the one hand, linear global problems defined over the whole structure have to be solved, and on the other hand, the integration of the constitutive relationships leads to the solution of local in space non-linear equations (at each integration point). Therefore two main difficulties exist with different mechanical properties. Moreover the iterative resolution process generates a coupling of these two difficulties which has to be taken into account in the numerical implementation. A precise modelling of some structures (composites, assemblies, etc.) requires highly mechanical properties contrasts to be taken into account, and for some industrial complex assemblies the mesh can contain some flattened elements. These two properties can lead to very bad conditioning of the stiffness matrix with an under elastic assumption of the different constituents. The resolution of such complex linear problems, taking into account the various boundary conditions can be accurately done using direct solvers or direct parallel solvers. For tridimensional applications, even using adequate ordering approaches which limit the fill-in effect in the factorization of the matrix in order to reduce the matrix storage, the computational wall-clock time increases very quickly with the number of degree of freedom. Another important limitation is the storage of the stiffness matrix which also increases very quickly with the number of degrees of freedom (d.o.f); the use of out of core storage strategies is thus necessary to solve large scale linear problems. But, it is important to understand that the use of intensive disk storage drastically increases the computational wall-clock time. For instance, for a powerful PC a practical limit is around 10,000,000 d.o.f. with around 100 Gb matrix storage. Various powerful iterative solvers exist, for instance parallel preconditioned conjugate gradient techniques, but the convergence of such approaches is not often assured in the case of very bad stiffness matrix conditioning. Such problems can be encountered for industrial applications, for research analysis and for inverse identification procedures of material models parameters which can be time consuming simulations for complex three-dimensional models. Thus, robust solvers have to be developed, in order to take into account the quality criteria of industrial type software, i.e. the determination of the correct solution of the problem (if the data is correct) with a predictable wall-clock time and without the need for tuning. Therefore, the definition of high performance strategies for large scale complex time-dependent non-linear structural analysis requires the joint use of powerful algorithms and efficient parallel strategies. The aim of this research project was to extend the possibilities of the finite element code CAST3M (developed at CEA, France), where the purpose is to facilitate the development of new algorithms. Moreover, it is important to take into account the experiments over several decades of developments of powerful numerical strategies for non-linear simulations in industrial environments. The challenge is to merge the possibilities of different parallel computers (in particular efficient and economic configurations of multi-core 64 bits computers) with the traditional requirements of an industrial code: robustness and flexibility, ease of use, and predictability of computational resource employment. The proposed parallelisation strategy uses the mechanical properties of these two types of subproblems to be solved. On the one hand, domain decomposition techniques can be used to solve the linear global problems. On the other hand, the CPU time spent to integrate the constitutive laws depends on several parameters: the material behaviour, the position of the integration point in the structure, the history of the loading path, etc. Therefore, for complex simulations, it is nearly impossible to predict the space evolution of the numerical cost of this part with respect to the increments. In order to have a well-balanced load during the integration of the constitutive laws, without communication, we propose the use of a type of second domain decomposition. An optimisation of the communications between the two domain decompositions is necessary to obtain good performance for the simulation of a wide class of non-linear problems for quasi-static response. The starting point is to make use of the mechanical properties of the different types of equations to be solved in order to distribute computations over the different processors of a parallel computer. The approach is based on the use of two domain decompositions where the goal is to balance the computation load by limiting the redistribution of the tasks. A good load balancing of the tasks as well as keeping the communications as low as possible are necessary to obtain an effective parallel algorithm. The implementation of this algorithm is carried out starting from an extension of the possibilities of GIBIANE: the user language of the code CAST3M. We have created a parallel environment language that eases the development of parallel algorithms either at the programming level or at the user level. It is based on the development environment of the finite element code CAST3M. The parallel language developed, which is based on an object-based virtual shared memory system, offers the user the vision of a unique and global address space over the individual memories. It ensures the data coherence and hides data exchanges between processors and much of the sequential code can be directly reused. The system proposed can be implemented on most parallel computers as it is developed with machine-independent programming techniques and it is important to notice that the different concepts can be used in other object-based parallel languages. The purpose of this chapter is to present strategies developed efficiently to solve non-linear mechanical problems with highly contrasting mechanical properties which can lead to very bad stiffness matrix conditioning. Numerical examples, in the case of large scale industrial type problems are presented to validate the proposed parallel approach. purchase the full-text of this chapter (price £20) 
go to the previous chapter | |