Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 3: CUDA Threads, Tools, Simple Examples

By Wen-Mei W Hwu

University of Illinois at Urbana-Champaign

Published on

Abstract

CUDA Threads, Tools, Simple Examples

Topics:

  • A Running example of Matrix Multiplication
  • Memory Layout of a Matrix in C
  • Compiling a CUDA Program
  • Device Emulation Mode Pitfalls
  • Floating Point
  • CUDA Threads
  • MAtrix Multiplication Using Multiple Blocks
  • Transparent Scalability

Credits

These lecture were breezed by Carl Pearson and Daniel Borup and then reviewed, edited ,and Uploaded by Omar Sobh.

Sponsored by

NCN@illinois

Cite this work

Researchers should cite this work as follows:

  • Wen-Mei W Hwu (2009), "Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 3: CUDA Threads, Tools, Simple Examples," http://nanohub.org/resources/7232.

    BibTex | EndNote

Tags