

cusparseCsrmvEx has been deprecated in favor of cusparseSpMV.cusparseConstrainedGeMM has been deprecated in favor of cusparseSDDMM.All routines support NVTX annotation for enhancing the profiler time line on complex applications.Better accuracy of cusparseAxpby, cusparseRot, cusparseSpVV for bfloat16 and half regular/complex data types.New routine for Sampled Dense Matrix - Dense Matrix Multiplication (cusparseSDDMM) which deprecated cusparseConstrainedGeMM and provides better performance.

New algorithm (CUSPARSE_SPMM_CSR_ALG3) for Sparse Matrix - Matrix Multiplication (cusparseSpMM) with better performance especially for small matrices.Support for deterministic and non-deterministic computation.Support for mixed regular-complex data type computation.Support for regular/complex bfloat16 data types for both uniform and mixed-precision computation.Extended functionalities for cusparseSpMV:.New algorithms for CSR/COO Sparse Matrix - Vector Multiplication (cusparseSpMV) with better performance.New Tensor Core-accelerated Block Sparse Matrix - Matrix Multiplication (cusparseSpMM) and introduction of the Blocked-Ellpack storage format.The user has to link libcusolver.so with the correct version of libcublas.so. However, it breaks backward compatibility. This reduces the binary size of libcusolver.so. libcusolver.so no longer links libcublas_static.a instead, it depends on libcublas.so.GESVDR computes partial spectrum with random sampling, an order of magnitude faster than GESVD. New singular value decomposition (GESVDR) is added.
Cuda 7.5 driver for osx code#
Cuda 7.5 driver for osx update#
This section summarizes the changes in CUDA 11.2.1 (11.2 Update 1) since the 11.2.0 GA release.

