SC13 Home > SC13 Schedule > SC13 Presentation - Debugging and Optimizing MPI and OpenMP Applications Running on CUDA, OpenACC®, and Intel® Xeon Phi Coprocessors with TotalView®

SCHEDULE: NOV 16-22, 2013

When viewing the Technical Program schedule, on the far righthand side is a column labeled "PLANNER." Use this planner to build your own schedule. Once you select an event and want to add it to your personal schedule, just click on the calendar icon of your choice (outlook calendar, ical calendar or google calendar) and that event will be stored there. As you select events in this manner, you will have your own schedule to guide you through the week.

Debugging and Optimizing MPI and OpenMP Applications Running on CUDA, OpenACC®, and Intel® Xeon Phi Coprocessors with TotalView®

SESSION: Debugging and Optimizing MPI and OpenMP Applications Running on CUDA, OpenACC®, and Intel® Xeon Phi Coprocessors with TotalView®

EVENT TYPE: Tutorials

TIME: 8:30AM - 5:00PM

Presenter(s):Mike Ashworth, Vince Betro, Sandra Wienke, Nikolay Piskun, Chris Gottbrath

ROOM:405

ABSTRACT:
With High-Performance Computing trends heading towards increasingly heterogeneous solutions, scientific developers face challenges adapting software to leverage these new systems. For instance, many systems feature nodes that couple multi-core processors with GPU-based computational accelerators, like the NVIDIA® Kepler, or many-core coprocessors, like the Intel® Xeon Phi coprocessor. In order to utilize these systems, scientific programmers need to leverage as much parallelism in applications as possible. Developers also need to juggle technologies including MPI, OpenMP, CUDA, and OpenACC. While troubleshooting, debugging, and optimizing applications are an expected part of porting, they become even more critical with the introduction of so many technologies. This tutorial provides an introduction to parallel debugging and optimization. Debugging techniques covered include: MPI and subset debugging, process and thread sets, reverse and comparative debugging, and techniques for CUDA, OpenACC, and Intel Xeon Phi coprocessor debugging. Participants will have the opportunity to do hands-on CUDA and Intel Xeon Phi coprocessor debugging using TotalView on a cluster at RWTH Aachen University and on Keeneland and Beacon at NICS. Therefore, it is recommended that participants bring a network-capable laptop to the session. Optimization techniques will include profiling, tracing, and cache memory optimization. Examples will use ThreadSpotter and vendor-supplied tools.

Chair/Presenter Details:

Mike Ashworth - Science and Technology Facilities Council

Vince Betro - National Institute for Computational Sciences

Sandra Wienke - RWTH Aachen University

Nikolay Piskun - Rogue Wave Software

Chris Gottbrath - Rogue Wave Software

Add to iCal  Click here to download .ics calendar file

Add to Outlook  Click here to download .vcs calendar file

Add to Google Calendarss  Click here to add event to your Google Calendar