Quick Explainer: GPU Programming with CUDA and Triton
What is CUDA and Triton? CUDA (Compute Unified Device Architecture) was introduced by NVIDIA to allow developers like us to directly program the GPUs. CUDA provides a low level C/C++ API for writing programs that execute on the GPU. We wrap the code to be executed on the GPU inside a function, this function is called a kernel. Not every machine learning person is an expert in using low-level programming languages supported by CUDA....