Parallel implementation of the sparse QR decomposition for rectangular upper quasi triangular matrix with ND-type sparsity

Бесплатный доступ

The paper considers parallel MPI+threads+SIMD implementation of the algorithm for computing sparse QR decomposition of a specially ordered rectangular matrix. Decomposition is based on block sparse Householder transformations. The algorithm starts with independent parallel QR decompositions for sets of matrix rows; and then, according to the computations tree, the QR decomposition is performed for matrices, combined with elements of R factors of rows decompositions. The results of numerical experiments for test problems show efficiency of the parallel implementation. The algorithm can also be efficiently implemented on heterogeneous cluster architectures with GPGPU accelerators.

Еще

Qr разложение, mpi, simd, sparse rectangular matrix, upper quasi triangular matrix, nested dissection, qr decomposition, householder transformations, multithreading

Короткий адрес: https://sciup.org/147160590

IDR: 147160590   |   DOI: 10.14529/cmse160203

Статья научная