Parallel implementation of the sparse QR decomposition for rectangular upper quasi triangular matrix with ND-type sparsity
Автор: Kharchenko S.A., Yushchenko A.A.
Рубрика: Вычислительная математика
Статья в выпуске: 2 т.5, 2016 года.
Бесплатный доступ
The paper considers parallel MPI+threads+SIMD implementation of the algorithm for computing sparse QR decomposition of a specially ordered rectangular matrix. Decomposition is based on block sparse Householder transformations. The algorithm starts with independent parallel QR decompositions for sets of matrix rows; and then, according to the computations tree, the QR decomposition is performed for matrices, combined with elements of R factors of rows decompositions. The results of numerical experiments for test problems show efficiency of the parallel implementation. The algorithm can also be efficiently implemented on heterogeneous cluster architectures with GPGPU accelerators.
Qr разложение, mpi, simd, sparse rectangular matrix, upper quasi triangular matrix, nested dissection, qr decomposition, householder transformations, multithreading
Короткий адрес: https://sciup.org/147160590
IDR: 147160590 | DOI: 10.14529/cmse160203