SCHEDULE: NOV 16-22, 2013
When viewing the Technical Program schedule, on the far righthand side is a column labeled "PLANNER." Use this planner to build your own schedule. Once you select an event and want to add it to your personal schedule, just click on the calendar icon of your choice (outlook calendar, ical calendar or google calendar) and that event will be stored there. As you select events in this manner, you will have your own schedule to guide you through the week.
SESSION: FTI: Advanced Application-level Checkpointing Library for HPC Applications
EVENT TYPE: Emerging Technologies
TIME: 3:10PM - 3:30PM
Presenter(s):Leonardo Gomez
ROOM:Booth 3947
ABSTRACT:
High Performance Computing is changing the way scientists make discoveries. Large scale simulations performed in supercomputers are allowing researches of all domains to better understand and study complex natural phenomena. Extreme scale computing promises great opportunities to the entire scientific community, motivating the design and development of always larger supercomputers. However, at extreme scale, component failures start compromising the usability of these systems. Multiple techniques can be used to guarantee the successful completion of the simulation, checkpoint/restart being the most popular of them. This method has been useful for several decades, but it is starting to show some limitations: while the computational power of supercomputers has been increasing exponentially, the I/O system has been growing linearly. This causes a bottleneck while writing large amounts of data to the File System. Fault Tolerance Interface (FTI) is a library that aims to provide researchers easy and scalable multilevel checkpointing.
Chair/Presenter Details:
Leonardo Gomez - Argonne National Laboratory
Click here to download .ics calendar file
