Computational & Technology Resources
an online resource for computational,
engineering & technology publications |
|
Civil-Comp Proceedings
ISSN 1759-3433 CCP: 89
PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON ENGINEERING COMPUTATIONAL TECHNOLOGY Edited by: M. Papadrakakis and B.H.V. Topping
Paper 4
Reliable Performance Prediction for Parallel Scientific Software in a Multi-Cluster Grid Environment G. Romanazzi, P.K. Jimack and C.E. Goodyer
School of Computing, University of Leeds, United Kingdom G. Romanazzi, P.K. Jimack, C.E. Goodyer, "Reliable Performance Prediction for Parallel Scientific Software in a Multi-Cluster Grid Environment", in M. Papadrakakis, B.H.V. Topping, (Editors), "Proceedings of the Sixth International Conference on Engineering Computational Technology", Civil-Comp Press, Stirlingshire, UK, Paper 4, 2008. doi:10.4203/ccp.89.4
Keywords: parallel distributed algorithms, cluster computing, performance evaluation and prediction, multilevel numerical software.
Summary
This paper shows the results of modelling the performance
of practical engineering numerical software, such as [1],
across different clusters of processors, with both multicore and heterogeneous resources.
The goal of this model is to allow reliable predictions to be
made as to the execution time of a given parallel code on a large number of processors
of a given parallel system, by only benchmarking the code on a small numbers of processors.
When extra memory and processors are available, parallel multilevel
implementations are able to solve problems numerically on finer meshes,
so as to achieve greater accuracy than would be otherwise possible.
The methodology described permits us to estimate the performance prior
to actually running with these very fine meshes on a large numbers of processors.
This is of great potential value in making decisions concerning the scheduling
of jobs within a grid environment.
We describe some recent related work into performance modelling for parallel numerical software, and therefore we introduce the new idea upon which our methodology is based, which is similar to that presented in [2]. Our goal here is to exploit this methodology for predicting the performance of practical multilevel software, that does not necessarily scale linearly with the size of the physical problem. We model the parallel execution time as sum of two terms: the computational time and the parallel overhead, the latter is primarily due to inter-processor communications. We describe the methodology used to model the computational time. In order to accurately predict it, care needs to be taken to preserve the geometric shape of the subdomains for the parallel runs. We have also to consider, and exploit, any heterogeneity and/or the multicore features associated with the processors and parallel architectures used. We describe the methodology used to predict the parallel overheads. In order to predict the communications patterns for large parallel runs, we use only information based on the patterns observed for sequences of runs across small numbers of processors. Finally, a selection for numerical results for our methodology are presented and discussed. We demonstrate that the methodology is robust and accurate across multicore and inhomogeneous parallel architectures. References
purchase the full-text of this paper (price £20)
go to the previous paper |
|