Support Options

Submit a Support Ticket


Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 3: CUDA Threads, Tools, Simple Examples

By Wen-Mei W Hwu

University of Illinois at Urbana-Champaign

Published on


CUDA Threads, Tools, Simple Examples


  • A Running example of Matrix Multiplication
  • Memory Layout of a Matrix in C
  • Compiling a CUDA Program
  • Device Emulation Mode Pitfalls
  • Floating Point
  • CUDA Threads
  • MAtrix Multiplication Using Multiple Blocks
  • Transparent Scalability


These lecture were breezed by Carl Pearson and Daniel Borup and then reviewed, edited ,and Uploaded by Omar Sobh.

Sponsored by


Tags, a resource for nanoscience and nanotechnology, is supported by the National Science Foundation and other funding agencies. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.