Skip to main content
SHARE
Publication

Computation of Large Covariance Matrices by SAMMY on Graphical Processing Units and Multicore CPUs...

by Goran Arbanas, Michael E Dunn, Dorothea A Wiarda
Publication Type
Conference Paper
Publication Date
Conference Name
International Conference on Mathematics and Computational Methods Applied to Nuclear Science and Engineering (M&C 2011)
Conference Location
Rio de Janeiro, Brazil
Conference Date
-

Computational power of Graphical Processing Units and multicore CPUs was harnessed by the nuclear data evaluation code SAMMY to speed up computations of large Resonance Parameter Covariance Matrices (RPCMs). This was accomplished by linking SAMMY to vendor-optimized implementations of the matrix-matrix multiplication subroutine of the Basic Linear Algebra Library to compute the most time-consuming step. The U-235 RPCM computed previously using a triple-nested loop was re-computed using the NVIDIA implementation of the subroutine on a single Tesla Fermi Graphical Processing Unit, and also using the Intel's Math Kernel Library implementation on two different multicore CPU systems. A multiplication of two matrices of dimensions 16,000 x 20,000 that had previously taken days, took approximately one minute on the GPU. Similar performance was achieved on a dual six-core CPU system. The magnitude of the speed-up suggests that these, or similar, combinations of hardware and libraries may be useful for large matrix operations in SAMMY. Uniform interfaces of standard linear algebra libraries make them a promising candidate for a programming framework of a new generation of SAMMY for the emerging heterogeneous computing platforms.