SystemVerilog implementation of Nvidia's SIMT CUDA, Hybrid-Precision Tensor Core, and Google's Systolic Array TPU MXU GEMM Operations. These modules are by no means really emulating the actual microarchitecture executing CUDA/Tensor Core instructions, instead they're simply performing the same operation for direct usage in FPGA designs.
-
Notifications
You must be signed in to change notification settings - Fork 1
SystemVerilog Implementation of Nvidia's CUDA/Tensor Core GEMM Operations
License
NikhilRout/TheGEMMCoreProject
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
SystemVerilog Implementation of Nvidia's CUDA/Tensor Core GEMM Operations
Topics
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published