基于GPU的基于Cholesky分解的线性求解器的实现,使用CUDA C++、Thrust和cuBLAS,还具有用于…
GPU based implementation of a Cholesky Decomposition based linear solver using CUDA C++, Thrust and cuBLAS, also featuring Eigen for the … (2022-01-12, Cuda, 0KB, 下载0次)
OSDI 23:MGG的工件:在多个处理器上使用细粒度内核内通信计算流水线加速图神经网络…
Artifact for OSDI 23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult… (2024-01-13, Cuda, 0KB, 下载0次)
Gallatin是CUDA的通用内存管理器,允许线程快速malloc并释放其中任意大小的内存...
Gallatin is a general-purpose memory manager for CUDA that allows for threads to quickly malloc and free memory of arbitrary size inside of kernels. (2023-12-02, Cuda, 0KB, 下载0次)
用C CUDA编写的简单基本照明光线跟踪器。实现了球体、圆柱体、圆锥体、平面和三角形。棋盘格纹理...,
A simple basic illumination ray tracer written in C/CUDA. Spheres, cylinders, cones, planes and triangles implemented. Checkered texture and all colors defined with adjustable specular values. Still working on making mirrors work on the GPU. (2023-10-15, Cuda, 0KB, 下载0次)
在CUDA中实现前向龙格库塔4解算器和后向邻接梯度法,
Implementation of a forward Runge-Kutta 4 solver and a backward adjoin gradient method in CUDA, (2023-07-25, Cuda, 0KB, 下载0次)
基于MPI的马尔可夫决策过程分布式异构GPU求解器(MDP)
Distributed MPI based Heterogenous GPU Solver for Markov Decision Processes (MDP) (2019-10-17, Cuda, 0KB, 下载0次)
将一小块数据从火炬模型传输到重新初始化元素的cuda存储器,并将其返回。,
Transferring a small chunk of data from the Torch model to the cuda memory of re-initializing elements and returning it back., (2021-11-02, Cuda, 0KB, 下载0次)
提出了一种基于笛卡尔遗传规划的图像滤波器进化设计工具,用于改善受损图像的质量...
A tool for evolutionary design of image filters improving the quality of damaged images using Cartesian Genetic Programming. (2015-04-29, Cuda, 3167KB, 下载0次)
这是一个用cuda c编写的简单2d卷积,它使用共享存储器以获得更好的性能
This is a simple 2d convolution written in cuda c which uses shared memory for better performance (2018-04-12, Cuda, 1KB, 下载0次)
Graphiler是在DGL和TorchScript之上构建的编译器堆栈,用于编译使用用户定义函数定义的GNN...
Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into efficient execution plans. (2022-10-03, Cuda, 322KB, 下载0次)
基于CUDA C的并行数独求解器。纽约大学CSCI-GA 3033-004图形处理单元(GPU)项目:架构师...
A Parallel Sudoku Solver based on CUDA C. Project of NYU CSCI-GA 3033-004 Graphics Processing Units (GPUs): Architecture and Programming (2019-03-25, Cuda, 191KB, 下载0次)
一个平滑粒子流体动力学(SPH)求解器,利用NVIDIA的CUDA API进行硬件加速并行处理...
A smoothed-particle hydrodynamics(SPH) solver that utilizes NVIDIA s CUDA API for hardware-accelerated parallel processing on the GPU. (2020-07-12, Cuda, 2354KB, 下载0次)
这是culayers,一个CUDA上的C++包装器,可以轻松部署在CUDA上运行的应用程序,而无需关心GPU管理
This is culayers, a C++ wrapper over CUDA to easy deploy applications running on CUDA without caring about GPU managment (2020-03-18, Cuda, 36KB, 下载0次)
一个简单的GPU加速粒子模拟器,以CUDA为基础模拟粒子的运动。计算领域...
A simple GPU accelerated particle simulator to simulate the movement of particle based on CUDA. The computational domain is split between GPU and CPU to share the work load. (2020-11-15, Cuda, 25KB, 下载0次)
一个用CUDA编写的简单光线跟踪器,将其输出保存在.ppm文件中,包括CPU版本以供参考。
A simple ray tracer written with CUDA that saves its output in a .ppm file, CPU version included for reference. (2023-01-04, Cuda, 661KB, 下载0次)
学士学位项目:数字下变频器(DDC)在NVIDIAGP-GPU上的实现在巨型微波无线电信...
Bachelor s project: Implementation of a Digital Down Converter (DDC) on NVIDIA GP-GPU at Giant Metrewave Radio Telescope (GMRT), India. (2021-03-27, Cuda, 6KB, 下载0次)
记录CUDA程序的GPU内存访问,并在浏览器中可视化访问模式
Record GPU memory accesses of a CUDA program and visualize the access pattern in a browser (2020-11-17, Cuda, 20677KB, 下载0次)
Mars是一个基于图形处理器(GPU)的MapReduce框架。我在科大的研究项目。2008年至2010年。
Mars is a MapReduce framework on graphic processors (GPUs). My research project in HKUST. 2008 ~ 2010. (2016-09-28, Cuda, 454KB, 下载0次)
Matter Labs和Yrrid根据各自提交的Z-Prize类别加速器的组合解决方案...
Combined solution from Matter Labs and Yrrid based on their respective submissions for the Z-Prize category Accelerating MSM Operations on GPU/FPGA (2023-02-20, Cuda, 72KB, 下载0次)
svox_t,一个可微分的动态特性级八叉树和渲染器实现,作为ARTEMIS的PyTorch CUDA扩展。
svox_t,A differentiable dynamic feature-level octree and renderer implementation as a PyTorch CUDA extension for ARTEMIS. (2022-09-09, Cuda, 78KB, 下载0次)