Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Main Topics

Schedule

Speakers

Types of presentation

Titles (tentative)

 

 

 

 

 

Workshop Day 1 (Auditorium)

Monday Nov. 22cd

 


 

Welcome and Introduction

08:30

Franck Cappello, INRIA & UIUC, France and Thom dunning, NCSA, USA

Background

Workshop details

Post PetaScale and Exascale Systems 

08:45

Mitsuhisa Sato, U. Tsukuba, Japan

Trends in HPC

Challenges on Programming Models and Languages for Post-Petascale Computing -- from Japanese NGS project "The K computer" to Exascale computing --

 

09:15

Marc Snir, UIUC, USA

Trends in HPC

Toward Exascale

 

09:45

Wen Mei Wu, UIUC, USA

Trends in HPC


Extreme-Scale Heterogeneous Computing

 

10:15

Arun Rodrigues, Sandia, USA

Trends in HPC

The UHPC X-Caliber Project

 

10:45

Break

 

 

Post Petascale Applications  and System Software

11:15

Pete Beckman, ANL, USA

Trends in HPC

Exascale Sofware Center

 

11:45

Michael Norman, SDSC, USA

Trends in HPC

Extreme Scale AMR for Hydrodynamic Cosmology

 

12:15

Eric Bohm, UIUC, USA

Trends in HPC

NAMD

 

12:30

Lunch

 

 

 

 

 

 

 

 

 

 

 

 

BLUE WATERS

14:00

Bill Kramer, NCSA, USA

Overview

Blue Waters: A Super-System to Explore the Expanse and Depth of 21st Century Science

Collaborations on System Software

14:30

Ana Gainaru, NCSA, USA

Early Results

Framework for Event Log Analysis in HPC

 

15:00

Thomas Ropars, INRIA, France

Results

Latest Progresses on Rollback-Recovery Protocols for Send-Deterministic Applications

 

15:30

Esteban Menese, UIUC, USA

Early Results

Clustering Message Passing Applications to Enhance Fault Tolerance Protocols

 

16:00

Break

 

 

Collaborations on System Software

16:30

Leonardo Bautista, Titech, Japan

Results/International collaboration with Japan

Transparent low-overhead checkpoint for GPU-accelerated clusters

 

17:00

Gabriel Antoniu, INRIA/IRISA, France

Results

Concurrency-optimized I/O for visualizing HPC simulations: An Approach Using Dedicated I/O cores

 

17:30

Mathias Jacquelin, INRIA/ENS Lyon

Results

Comparing archival policies for BlueWaters

 

18:00

Olivier Richard, Joseph Emeras, INRIA/U. Grenoble, France

Early Results

Studying the RJMS, applications and File System triptych: a first step toward experimental approach

 

 

 

 

 

Workshop Day 2 (Auditorium)

Tuesday Nov. 23rd

 

 

 

 

 

 

 

 

Collaborations on System Software

08:30

Torsten Hoefler, NCSA, USA

Potential collaboration

Application Performance Modeling on Petascale and Beyond

 

09:00

Frederic Viven, INRIA/ENS Lyon, France

Potential collaboration

On Scheduling Checkpoints of Exascale Application

Collaborations on Programming models

09:30

Thierry Gautier

Early Results

TBA

 

10:00

Jean François Méhaut, INRIA/U. Grenoble, France

Early Results

Charm++ on NUMA Platforms: the impact of SMP Optimizations and a NUMA-aware Load Balancing

 

10:30

Break

 

 

 

11:00

Raymon Namyst, INRIA/U. Bordeaux, France

Early Results

TBA

 

11:30

Brian Amedo, INRIA/U. Nice, France

Potential collaboration

Improving asynchrony in an Active Object model

 

12:00

Christian Perez, INRIA/ENS Lyon, France

Early Results

High Performance Component with Charm++ and OpenAtom

 

12:30

Lunch

 

 

Collaborations on Numerical Algorithms and Libraries

14:00

Luke Olson, Bill Gropp, UIUC, USA

Early Results

On the status of algebraic (multigrid) preconditioners

 

14:30

Simplice Donfac, INRIA/U. Paris Sud, France

Early Results

Improving data locality in communication avoiding LU and QR factorizations

 

15:00

Desiré Nuentsa, INRIA/IRISA, France

Early Results

Parallel Implementation of deflated GMRES in the PETSc package

 

15:30

Sebastien Fourestier, INRIA/U. Bordeaux, France

Early Results

Graph repartitioning with Scotch and other on going work

 

16:00

Break

 

 

 

16:30

Marc Baboulin, INRIA, U. Paris Sud, France

Early Results

Accelerating linear algebra computations with hybrid GPU-multicore systems

 

17:00

Daisuke Takahashi, U. Tsukuba, Japan

Results/International collaboration with Japan

Optimization of a Parallel 3-D FFT with 2-D Decomposition

 

17:30

Alex Yee, UIUC, USA

Early Results

A Single-Transpose implementation of the Distributed out-of-order 3D-FFT

 

17:50

Jeongnim Kim, NCSA, USA

Early Results

Toward petaflop 3D FFT on clusters of SMP

 

 

 

 

 

 

 

 

 

 

Workshop Day 3 (Auditorium)

Wednesday Nov 24th

 

 

 

 

 

 

 

 

Break out sessions introduction

8:30

Cappello, Snir

Overview

Objectives of Break-out, expected results
Collaborations mechanisms (internship, visits, etc.)

Topics

 

Participants

Other NCSA participants

 

Break out session 1

9:00-10:30

 

 

 

Routing, topology mapping, scheduling, perf. modeling

 

Snir, Hoefler, Vivien, Gautier, Jeannot, Kale

 

Room

3D-FFT

 

Cappello, Takahashi, Yee, Jeongnim

 

Room

Libraries

 

Gropp, Baboulin, Désiré, Simplice, Sébastien, Fourestier

 

Room

 

 

 

 

 

 

10:15

Break

 

 

Break out session 2

10:30-12:00

 

 

 

Resilience

 

Kramer, Cappello, Gainaru, Ropars, Menese, Beautista,

 

Room

Programing models / GPU

 

Kale, Méhaut, Namyst, Wu, Amedro, Perez, Hoefler, Jeannot

 

Room

I/O

 

Snir, Viven, Jaquelin, Antoniu, Richard

 

 

Break out session report

12:00

Speakers: Snir, Cappello, Gropp, Kramer, Kale

 

Auditorium

Closing

12:30

Cappello, Snir

 

Auditorium

 

13:00

Lunch

 

 

...

In a High Performance Computing infrastructure, it is particularly difficult to master the architecture as a whole. With the physical infrastructure, the platform management software and the users' applications, understanding the global behavior and diagnosing problems is quite challenging. And it is even more true in a petascale context with thousands of compute nodes to manage and a high occupation rate of the resources. A global study of the platform will thus consider the Resource and Job Management System (RJMS), the File System and the Applications triptych as a whole. Studying their behavior is complicated because it means having some knowledge of the applications requirements in terms of physical resources and access to the File System. In this presentation, we propose a first step toward an experimental approach that mix the use of Jobs Workloads patterns and File System access patterns that, once combined, will give a full set of jobs behaviors. These synthetic jobs will then be used to test and benchmark infrastructure, considering the RJMS and the File System.

Anchor
Torsten_A
Torsten_A

Torsten Hoefler, NCSA

Application Performance Modeling on Petascale and Beyond

Performance modeling of parallel application is gaining more importance. It can not only help to predict scalability and find performance bottlenecks but it can also help to understand trade-offs in the design space of computing systems and drive hardware-software co-design of future computing systems. We will discuss established performance modeling techniques and propose a mixed approach to analytic application performance modeling. We then discuss open problems and possible future research directions.

Anchor
Vivien_A
Vivien_A

Frédéric Viven, INRIA/ENS Lyon

...