-
Mathematica for CUDA and OpenCL Programming
07 Mar 2011 | Online Presentations | Contributor(s): Abdul Dakkak
In the latest release of Mathematica 8, a large number of programming tools for GPU computing are available. In this presentation, new tools for CUDA and OpenCL programming will be explored. Several …
http://nanohub.org/resources/10940
-
Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 14: Application Case Study - Quantative MRI Reconstruction
30 Sep 2009 | Online Presentations | Contributor(s): Wen-Mei W Hwu
Quantative MRI Reconstruction Topics: Reconstructing MR Images An exciting revolution: Sodium Map of the Brain Least Squares reconstruction Q vs. FhD Algorithms to Accelerate From C …
http://nanohub.org/resources/7406
-
Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 15: Kernel and Algorithm Patterns for CUDA
30 Sep 2009 | Online Presentations | Contributor(s): Wen-Mei W Hwu
Kernel and Algorithm Patterns for CUDA Topics: Reductions and Memory Patterns Reduction Patterns in CUDA Mapping Data into CUDA's Memories Input/Output Convolution Generic Algorithm …
http://nanohub.org/resources/7442
-
Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 13: Reductions and their Implementation
15 Sep 2009 | Online Presentations | Contributor(s): Wen-Mei W Hwu
Structuring Parallel Algorithms Topics: Parallel Reductions Parallel Prefix Sum Relevance of Scan Application of Scan Scan on the CPU First attempt Parallel Scan Algorithm Work …
http://nanohub.org/resources/7376
-
Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 12: Structuring Parallel Algorithms
15 Sep 2009 | Online Presentations | Contributor(s): Wen-Mei W Hwu
Structuring Parallel Algorithms Topics: Key Parallel Programming Steps Algorithms Choosing Algorithm Structure Mapping a Divide and Conquer algorithm Tiled Algorithms Increased work …
http://nanohub.org/resources/7372
-
Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 11: Floating Point Considerations
15 Sep 2009 | Online Presentations | Contributor(s): Wen-Mei W Hwu
Floating Point Considerations Topics: GPU Floating Point Features Normalized Representation Exponent Representation Representable Numbers Flush to Zero Denormaliztion Runtime Math …
http://nanohub.org/resources/7338
-
Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 10: Control Flow
01 Sep 2009 | Online Presentations | Contributor(s): Wen-Mei W Hwu
Control Flow Topics: Terminology Review How Thread Blocks are Partitioned Control Flow Instructions Parallel Reduction A Vector Reduction Example A simple Implementation Vector …
http://nanohub.org/resources/7304
-
Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 7: GPU as part of the PC Architecture
30 Aug 2009 | Online Presentations | Contributor(s): Wen-Mei W Hwu
GPU as part of the PC Architecture Topics: Typical Structure of a CUDA Program Bandwidth: Gravity of Modern computer Systems (Original) PCI Bus Specification PCI as Memory Mapped I/O PCI …
http://nanohub.org/resources/7266
-
Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 8: Threading Hardware in G80
30 Aug 2009 | Online Presentations | Contributor(s): Wen-Mei W Hwu
Threading Hardware in G80 Topics: Single Program Multiple Data (SPMD) Grids and Blocks CUDA Thread Block : Review Geforce-8 Series Hardware Overview CUDA Processor Terminology Stream …
http://nanohub.org/resources/7272
-
Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 9: Memory Hardware in G80
30 Aug 2009 | Online Presentations | Contributor(s): Wen-Mei W Hwu
Memory Hardware in G80 Topics: CUDA Device Memory Space Parallel Memory Sharing SM Memory Architecture SM Register File Programmer view of Register File Matrix Multiplication Example …
http://nanohub.org/resources/7277
-
Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 6: CUDA Memories - Part 2
20 Aug 2009 | Online Presentations | Contributor(s): Wen-Mei W Hwu
CUDA Memories Part2 Topics: Tiled Multiply Breaking Md and Nd into Tiles Tiled Matrix Multiplication Kernel CUDA Code - Kernel Execution Configuration First Order Size considerations in …
http://nanohub.org/resources/7247
-
Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 5: CUDA Memories
20 Aug 2009 | Online Presentations | Contributor(s): Wen-Mei W Hwu
CUDA Memories Topics: G80 Implementation of CUDA Memories CUDA Variable Type Qualifiers Where to Declare Variables Variable Type Restrictions A Common Programming Strategy GPU Atomic …
http://nanohub.org/resources/7243
-
Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 4: CUDA Threads - Part 2
14 Aug 2009 | Online Presentations | Contributor(s): Wen-Mei W Hwu
CUDA Threads Part2 Topics: CUDA Thread Block Transparent Scalability G80 CUDA Mode, A Review Executing Thread Blocks Thread Scheduling Block Granularity Considerations More Details …
http://nanohub.org/resources/7236
-
Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 3: CUDA Threads, Tools, Simple Examples
14 Aug 2009 | Online Presentations | Contributor(s): Wen-Mei W Hwu
CUDA Threads, Tools, Simple Examples Topics: A Running example of Matrix Multiplication Memory Layout of a Matrix in C Compiling a CUDA Program Device Emulation Mode Pitfalls Floating …
http://nanohub.org/resources/7232
-
Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 2: The CUDA Programming Model
14 Aug 2009 | Online Presentations | Contributor(s): Wen-Mei W Hwu
CUDA Programming Model Topics: What is GPGPU? CUDA An Example of Physical Reality Behind CUDA Parallel computing on a GPU CUDA - C With no shader limitations CUDA Devices and Threads …
http://nanohub.org/resources/7206
-
Illinois ECE 498AL: Programming Massively Parallel Processors, Lecture 1: Introduction
14 Aug 2009 | Online Presentations | Contributor(s): Wen-Mei W Hwu
Programming Massively Parallel Processors Topics: Introduction, Grading, Outline Lab Equipment UIUC/NCSA QP Cluster UIUC/NCSA AP Cluster ECE498AL Development History Why Program …
http://nanohub.org/resources/7226
-
Illinois ECE 498AL: Programming Massively Parallel Processors
11 Aug 2009 | Courses | Contributor(s): Wen-Mei W Hwu
Spring 2009 Virtually all semiconductor market domains, including PCs, game consoles, mobile handsets, servers, supercomputers, and networks, are converging to concurrent platforms. There are two …
http://nanohub.org/resources/7225