用CUDA编写的GPU和高性能计算程序的集合,用于CS89.25 189.3,这是达特茅斯大学的研究生级CS课程。,
A collection of GPU and high-performance computing programs written in CUDA for CS89.25 189.3, a graduate-level CS course at Dartmouth., (2020-06-18, Cuda, 0KB, 下载0次)
固有顺序算法的并行实现黑客。随机数生成器-加法LFG和GFSR-使用连续子序列技术和蛙跳技术用NVIDIA CUDA实现。在以色列特拉维夫2022年国际人工智能会议上提交的论文。
Parallel implementation hack of inherently sequential algorithms. Random Number Generators - Additive LFG and GFSR - implemented with NVIDIA CUDA using Continuous Subsequence Technique and Leap Frog Technique. Paper presented in the International AI Conference 2022, Tel Aviv, Israel. (2022-10-24, Cuda, 0KB, 下载0次)
威斯康星大学麦迪逊分校CS 759高性能计算课程的最终项目
Final project for CS 759 High Performance Computing Class taken in UW-Madison (2017-12-24, Cuda, 3889KB, 下载0次)
基于cublasHgemm的特斯拉P100和V100 GPU上测试原生float16矩阵乘法性能的代码
Code for testing the native float16 matrix multiplication performance on Tesla P100 and V100 GPU based on cublasHgemm (2019-08-20, Cuda, 6KB, 下载0次)