Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 3: CUDA Threads, Tools, Simple Examples
Category
Published on
Abstract
CUDA Threads, Tools, Simple Examples
Topics:
- A Running example of Matrix Multiplication
- Memory Layout of a Matrix in C
- Compiling a CUDA Program
- Device Emulation Mode Pitfalls
- Floating Point
- CUDA Threads
- MAtrix Multiplication Using Multiple Blocks
- Transparent Scalability
Credits
These lecture were breezed by Carl Pearson and Daniel Borup and then reviewed, edited ,and Uploaded by Omar Sobh.
Sponsored by
NCN@illinois
Cite this work
Researchers should cite this work as follows: