Sparse matrix-matrix multiplication in CUDA using cuSPARSE

We are providing a fully worked code implementing matrix-matrix sparse multiplication using CSR format. To avoid any ambiguity on sparse matrix format, the code starts from dense matrices and uses cusparse<t>dense2csr to convert the matrix format from dense to CSR. The two matrices involved in the code are A and B. Matrix B is a permutation matrix. The code, available on our GitHub website, calculates C = B * A.