hama-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hama Wiki] Update of "PerformanceEvaluation" by udanax
Date Thu, 27 Nov 2008 12:39:29 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hama Wiki" for change notification.

The following page has been changed by udanax:
http://wiki.apache.org/hama/PerformanceEvaluation

------------------------------------------------------------------------------
  ||Trunk 718158 || Mult ||2 node ||300 ||300 ||2||2||12 seconds ||1,464,484 || 2,929,092||
  ||Trunk 720735 || Mult ||2 node ||1,000 ||1,000 ||2||2||20 seconds || 16,166,452 || 32,333,028
||
  
+ {{{
+ NOTE: The following numbers are obtained by using poe+ on the entire code, including minimal
I/O and matrix construction.
+ 
+ Matrix-Matrix Multiply of 5,000 by 5,000 dense matrix
+ 
+ Mflip/s  Wall sec   Library
+ -------  --------   -------------------------------------------
+  8,300       30     PESSL PDGEMM (16 processors)
+  7,900       32     ScaLAPACK routine PDGEMM (16 processors)
+  7,900       32     ESSL-SMP routine DGEMM (16 threads)
+  7,900       32     NAG-SMP routine F01CKF (16 threads)
+  1,200      213     ESSL routine DGEMM
+ 
+ Matrix-Matrix Multiply of 20,000 by 20,000 dense matrix
+ 
+ Mflip/s  Wall sec   Library and configuration
+ -------  --------   -------------------------------------------
+ 158,900     100     ScaLAPACK PDGEMM (256 proc, 16 nodes) 
+ 146,200     110     PESSL PDGEMM (256 proc, 16 nodes) 
+ 105,400     150     ScaLAPACK PDGEMM (144 proc, 9 nodes, block 128) 
+ 100,960     160     PESSL PDGEMM (144 proc, 9 nodes, block 128) 
+  79,400     200     PESSL PDGEMM (144 proc, 9 nodes, block 1024) 
+  74,800     214     ScaLAPACK PDGEMM (144 proc, 9 nodes, block 1024) 
+  55,000     290     PESSL PDGEMM (64 proc, 4 nodes) 
+  50,000     320     ScaLAPACK PDGEMM (64 proc, 4 nodes) 
+  27,160     590     PESSL PDGEMM (32 proc, 2 nodes) 
+  25,630     625     ScaLAPACK PDGEMM (32 proc, 2 nodes) 
+  15,800   1,010     PESSL PDGEMM (16 Proc, 1 node)
+  15,600   1,025     ScaLAPACK PDGEMM (16 Proc, 1 node)
+ }}}
+ ----
+ 
   * Dense LU factorization
   * Transpose
   * Matrix tridiagonalization, for eigenvalue computations of symmetric matrices.

Mime
View raw message