CMSC 32001
Topics in Programming Languages
Winter 2016

General Information

Instructor:	John Reppy	Ryerson 256
Lecture:	M 3-5 (Ry 255)
	SS-106

Description

The focus of this seminar will be on high-level languages and models for programming GPUs. We will begin by looking at the architectural features of GPUs that both make them very fast and very difficult to program. With that background in place, we will read and discuss recent (and some not-so-recent) papers on languages and models for GPUs.

Note: The seminar was originally scheduled to meet twice a week, but we are meeting once a week for two hours instead. The new meeting time and location is on Mondays from 3pm to 5pm in Ry 255.

Reading by week

Week 2 (January 12, 2016)

For week 2, please take a look at the paper Parallel Prefix Sum (Scan) with CUDA, which was Chapter 39 of GPU Gems 3.

Week 3 (January 20, 2016; Note different meeting day for this week only)

For week 3, we will look at an approach for handling tree traversals in GPU programs that has been developed by researchers at Purdue. There are two papers:

Automatically Enhancing Locality for Tree Traversals with Traversal Splicing (OOPSLA '12)
General transformations for GPU execution of tree traversals (SC '13).

Week 4 (January 25, 2016)

More discussion of the techniques from last week. One additional paper: There are two papers:

Enhancing Locality for Recursive Traversals of Recursive Structures (OOPSLA '11)

Week 5 (February 1, 2016)

This week we will look at ray tracing on GPUs and the use of persistent threads as an implementation technique. There are several papers:

Understanding the Efficiency of Ray Traversal on GPUs (Proceedings of High Performance Graphics 2009).
This paper describes the difficulties with implementing ray tracing on a GPU and possible solutions, including the use of persistent threads.
GPU Ray Tracing (CACM 2013).
This paper describes the OptiX system from NVIDIA. The original version was presented at SIGGRAPH 2010.
A Study of Persistent Threads Style GPU Programming for GPGPU Workloads (Innovative Parallel Computing 2012).

Week 6 (February 8, 2016)