![]() |
![]() |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
An example of the performance achieved with ParaWise generated parallel NAS-LU CodeThe NAS-LU code (sometimes referred to as APPLU or NPB-LU) is one of the NAS Parallel Benchmarks (NPB). The NAS-LU code is a lower-diagonal (LU) CFD application. However, it does not perform an LU factorization, but instead implements a symmetric successive over-relaxation (SSOR) numerical scheme to solve a regular-sparse, block lower and upper triangular system. The parallel code is generic and the parallel execution on a number of different platforms is achieved by transporting the code (via ftp) to the different platforms. Once transported, the code is compiled and linked with the appropriate version of CAPLib. The original serial code undergoes a relatively small number of changes to generate a parallel version of the code, this can be seen in the 1-D partition parallel code, the 1-D partition reduced memory parallel code , the 1-D partition overlapping communication parallel code and also for the 2-D partition parallel code.
SGI Origin 2000
Results for NAS-LU (Ver. NPB2.3) on a 64x64x64 (Class A) grid on the SGI Origin 2000 for ParaWise one- dimensional and two-dimensional partitioning, and NASA manual two-dimensional parallelization (NPB2.2).
Results for NAS-LU (Ver. NPB2.3) on a 102x102x102 (Class B) grid on the SGI Origin for ParaWise one-dimensional and two-dimensional partitioning, and NASA manual two-dimensional parallelization (NPB2.2).
Cray T3D
* lower bound estimate for serial time (i.e. speed up is also lower bound)
Speed Up Graph for NAS-LU for a 32x32x32 and 64x64x64 problems on the Cray T3D.
Transtech Paramid
Speed Up Graph for NAS-LU for a 32x32x32 problem on the Transtech Paramid. Timing Graph for NAS-LU for a 32x32x32 problem on the Transtech Paramid.
Parsys SN9500
Parsytec GC/PP
* lower bound estimate for serial time (i.e. speed up is also lower bound)
Speed Up Graph for NAS-LU for 32x32x32 and 64x64x64 problem sizes on the Parsytec GC/PP
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||