CLBLAST是一个现代的、轻量级的、性能良好的、可调的OpenCL BLAS库,用C++ 11编写。它旨在充分利用来自不同供应商的各种OpenCL设备的全部性能潜力,包括台式机和笔记本电脑gpu、嵌入式gpu和其他加速器。CLBlast实现BLAS例程:在向量和矩阵上操作的基本线性代数子程序。有关各种设备的性能报告以及最新的CLBlast新闻,请访问CLBlast网站。
CLBlast is a modern, lightweight, performant and tunable OpenCL BLAS library written in C++11. It is designed to leverage the full performance potential of a wide variety of OpenCL devices from different vendors, including desktop and laptop GPUs, embedded GPUs, and other accelerators. CLBlast implements BLAS routines: basic linear algebra subprograms operating on vectors and matrices. See the CLBlast website for performance reports on various devices as well as the latest CLBlast news. (2020-07-21, C/C++, 893KB, 下载1次)
阿贡国家实验室并行计算中心对于mpi,openmp的介绍和例程,简单易学,是并行计算入门的最佳资料。
The best introducing materials for parallel computing (mpi, openmp). (2011-11-21, C/C++, 5530KB, 下载69次)