搜索资源列表
Cholesky_MPI
- 矩阵Cholesky分解并行算法MPI源程序-Matrix Cholesky decomposition parallel algorithm MPI source
cannon
- linux环境下,并行矩阵乘Cannon算法的两种MPI程序实现:采用阻塞操作与采用非阻塞操作!-linux environment, two kinds of MPI programs parallel matrix multiplication Cannon algorithm to achieve: a blocking operation using non-blocking operation!
mysumma
- SUMMA并行矩阵的MPI实现,不用考虑矩阵的行列必须为平方数-UMMA parallel matrix MPI implementation
newsumma
- SUMMA并行矩阵的另一种MPI实现,在广播时以块为单位传输,运行速度明显提高-SUMMA parallel matrix another MPI implementation, the broadcast is transmitted in units of blocks, running speed is significantly improved
MPI_Bcast
- 矩阵相乘的LU算法的MPI并行实现源码,采用连续行划分方法。-MPI parallel implementation of matrix multiplication, LU algorithm source code, a continuous line of division method.
Matrix-multiplication
- 采用共享内存的矩阵乘法的CUDA实现,是一种并行实现方式-Matrix multiplication
cannon
- 用MPI实现矩阵乘cannon算法的并行实现。-Parallel MPI implementation matrix multiplication cannon algorithm implementation
juzhenxiangcheng
- 矩阵相乘的并行算法,矩阵是按行分类的,还有按列分类的算法-Parallel algorithm for matrix multiplication, matrix classification by line, column classification algorithm
MatMultop
- 矩阵乘法OpenMP并行计算程序 线程数可以设定-The matrix multiplication OpenMP parallel computing program the number of threads that can be set
Matrix
- 矩阵相乘并行计算程序 C——C++边写 已实现-Matrix multiplication parallel program
stripMatrixMultiplus
- MPI带状划分矩阵乘法 MPI文件并行读写-MPI the striping matrix multiplication MPI parallel read and write files
Gauss
- 高斯消去的并行程序,测试在不同规模的系数矩阵下、不同处理器核心数量下,程序的运行时间-Parallel Gaussian elimination procedure, test coefficient matrix in different sizes, different number of processor cores, the program running time
MatrixMultiplication_OpenCL
- 矩阵相乘并行计算方法,基于OPENCL通用计算接口。-Parallel matrix multiplication calculation method, based OPENCL general computing interface.
matrix
- 矩阵相乘并行算法MPI源程序 并行思想加快速度-MPI parallel matrix multiplication algorithm source code parallel thinking speed
spmv_csr
- 稀疏矩阵的DIA/ELLPACK/COO/CSR/HYB表示形式,以及各表示形式下的稀疏矩阵乘法(稀疏大矩阵*矢量)的CUDA实现。对于矩阵中每一行稀疏元素个数较统一的情况,ELLPACK表示最佳,其次是HYB(ELL+COO)。 CUDA™ 是一种由NVIDIA推出的通用并行计算架构,该架构使GPU能够解决复杂的计算问题。 它包含了CUDA指令集架构(ISA)以及GPU内部的并行计算引擎。 开发人员现在可以使用C语言来为CUDA™ 架构编写程序-Sparse matri
cengci_bingxing
- 一种层次聚类算法的并行改进,采用MPI编程,对三角矩阵的求解实现并行-A hierarchical clustering algorithm to improve parallelism, using MPI programming, parallel solving triangular matrix
summa
- 矩阵相乘的SUMMA算法的并行实现,包含了实验结果演示。-SUMMA matrix multiplication algorithm for parallel implementation, including the experimental results presentation.
my_ludep
- 矩阵LU分解的并行实现,文档中给出了不同矩阵规模和处理器数下的实验结果。-Matrix LU decomposition parallel implementation, the document gives a different matrix size and the number of processors under the experimental results.
SWAP
- nvidia 提供的关于矩阵交换的并行程序,平台是opencl,内有详细注解。-nvidia provided on the matrix of the parallel exchange program, the platform is opencl, with detailed notes.
SWAP
- nvidia 提供的关于矩阵交换的并行程序,平台是opencl,内有详细注解。-nvidia provided on the matrix of the parallel exchange program, the platform is opencl, with detailed notes.