Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 3: CUDA Threads, Tools, Simple Examples
Recommendations
- Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 4: CUDA Threads - Part 2
- Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 5: CUDA Memories
- Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 2: The CUDA Programming Model
- Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 1: Introduction
- Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 6: CUDA Memories - Part 2
- Illinois ECE 498AL : Programming Massively Parallel Processors , Lecture 1: The CUDA Programming Model
- Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 8: Threading Hardware in G80
- Illinois ECE 498AL: Programming Massively Parallel Processors
- Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 12: Structuring Parallel Algorithms
- Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 10: Control Flow
Category
Published on
Abstract
CUDA Threads, Tools, Simple Examples
Topics:
- A Running example of Matrix Multiplication
- Memory Layout of a Matrix in C
- Compiling a CUDA Program
- Device Emulation Mode Pitfalls
- Floating Point
- CUDA Threads
- MAtrix Multiplication Using Multiple Blocks
- Transparent Scalability
Credits
These lecture were breezed by Carl Pearson and Daniel Borup and then reviewed, edited ,and Uploaded by Omar Sobh.
Sponsored by
NCN@illinois
Cite this work
Researchers should cite this work as follows:
-
Wen-Mei W Hwu (2009), "Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 3: CUDA Threads, Tools, Simple Examples," https://nanohub.org/resources/7232.