...
Main Topics | Schedule | Speakers | Types of presentation | Topic | Download |
Sunday Nov. 18th | Dinner | Giordano's | http://www.giordanos.com/ |
| |
Workshop Day 1 (Room 1416, TCS conference center) | Monday Nov. 19th |
|
|
| |
| 07:30-8:30 | Transportation: Guest House to TCS (building 240) |
| (Entrance of the conference center) |
|
| 08:00 | Contiental Breakfast and Registration |
| Food available in Room 1407, Lunch seating in room 1416 (second half) |
|
Welcome and Introduction | 08:30 | Franck Cappello, INRIA & UIUC, Marc Snir ANL | Opening | Welcome, formal opening and workshop details |
|
| 08:40 | Marc Snir | Opening | ANL presentation and vision of the collaboration |
|
| 08:50 | Bill Gropp | Opening | UIUC/NCSA update and vision of the collaboation |
|
| 09:00 | Frederic Desprez | Opening | INRIA update on HPC strategy and vision of the collaboration |
|
Big Apps, Big DATA - Big I/O | 09:15 | Robert Jacob | Trends in HPC | Climate simulation at extreme scale | |
| 09:45 | Rob Ross, ANL | Trends in HPC | Trends in HPC I/O and File systems |
|
| 10:15 | Break |
|
|
|
| 10:45 | Rob Pennington, NCSA | Trends in HPC | Big Data | |
| 11:15 | Andrew Chien, ANL | Potential collaboration | Presto/Blockus: Towards a Scalable R Programming System | |
| 11:45 | Matthieu Dorier, INRIA | Joint Results | I/O and in-situ visualization: recent results with the Damaris approach | |
| 12:15 | Lunch |
|
|
|
Programming Models/Runtime chair: Sanjay Kale | 13:30 | Wen-Mei Hwu, UIUC | TBA | Scalability, Performance, and Numerical Stability of Many-core GPU Algorithms - A Case Study of Tri-diagonal Solvers | |
| 14:00 | Pavan Balaji, ANL | Potential collaboration | MPI3 and Unified Runtime | |
| 14:30 | Andra Hugo, Raymond Namyst, INRIA | Potential collaboration | Composing multiple StarPU applications over heterogeneous machines: a supervised approach | |
| 15:00 | Jean-François Mehaut, INRIA | Potential collaboration | Optimizations for modern NUMA |
|
| 15:30 | Break |
|
|
|
Numerical algorithms and Methods | 16:00 | Barry Smith, ANL | Trend | Performance Issues in DOE PDE Simulations | |
16:30 | Laura Grigori | Results | Communication avoiding | ||
| 17:00 | Bill Gropp, UIUC | Results | Hybrid Scheduling | |
| 17:30 | Laurent Hascoet, INRIA | Early Results | The Data-Dependence graph of Adjoint Codes | |
18:00 | Adjourn |
| |||
19:00 | Dinner | Jameson's |
| ||
|
|
|
|
|
|
Workshop Day 2 (Main room) | Tuesday Nov. 20th |
|
|
|
|
|
|
|
|
|
|
Big Systems | 08:30 | Pete Beckman, ANL | Trends | New Directions in Extreme-Scale Operating Systems and Runtime Software |
|
| 09:00 | Bill Kramer, UIUC/NCSA | Trends | Blue Waters update |
|
Cloud | 09:30 | Ian Foster, ANL | Potential collaboration TBA | Big Process for Big Data | |
| 10:00 | Christine Morin, INRIA | Potential collaboration | Contrial | |
| 10:30 | Break |
|
|
|
11:00 | Frederic Desprez, INRIA | Potential collaboration | TBA | ||
Resilience: | 11:30 | Yves Robert | Early Result | Performance modeling of checkpointing under failure prediction | |
| 12:00 | Rinku Gupta, ANL | Potential collaboration | CIFTS: An infrastructure for coordinated and comprehensive system-wide fault tolerance. |
|
| 12:30 | Ana Gainaru, UIUC | Early Results | Coupling failure prediction, proactive and preventive checkpoint for current production HPC systems. |
|
| 13:00 | Lunch |
| Food buffet in Room 1407, Lunch seating in room 1416 (second half) |
|
|
|
|
| Parallel Session |
|
Mini workshop on Numerical libraries | 8:30 | Stefan Wild, ANL | Potential collaboration | Numerical optimization for "automatic" tuning of codes | |
| 09:00 | Bill Gropp, UIUC | Potential collaboration | TBA | |
| 09:30 | Laura Grigori, INRIA | Potential collaboration | TBA | |
| 10:00 | Break | TBA | ||
| 10:30 | Anshu Dubey, ANL | Potential collaboration | Optimizing Scientific Codes While Retaining Portability |
|
| 11:00 | Discussion |
|
|
|
| 12:00 | Adjourn |
|
|
|
| 13:00 | Lunch |
|
|
|
|
|
|
| Parallel Sessions |
|
Mini workshop on Performance Modeling and simulation | 14:30 | Sanjay Kale, UIUC | Early Results | BIG SIM |
|
| 15:00 | Arnaud Legrand, INRIA |
| SimGrid for HPC |
|
| 15:30 | Torsten Hoefler, ETH | Early Results | TBA |
|
| 16:00 | Break |
|
|
|
| 16:30 | Timo Schneider, ETH | Early Results | TBA |
|
| 17:00 | Discussion |
|
|
|
| 18:00 | Adjourn |
|
|
|
| 19:00 | Dinner | Meggaiano's | [http://www.maggianos.com/EN/Oak-Brook_Oak-Brook_IL/Pages/LocationLanding.aspx?AspxAutoDetectCookieSupport=1 |
|
|
|
|
|
|
|
Mini workshop on Cloud | 14:30 | Kate Keahey, ANL | Potential collaboration | TBA |
|
| 15:00 | Narayan Deai, ANL | Potential collaboration | TBA |
|
| 15:30 | Jonathan Rouzaud, INRIA | Potential collaboration | Provisioning Virtual Machines in Federated Clouds |
|
| 16:00 | Break |
|
|
|
| 16:30 | Michael Wilde | Potential collaboration | Swift: simpler parallel programming for cloud and HPC domains http://www.ci.uchicago.edu/swift (Swift for clouds and clusters) |
|
| 17:00 | Discussion |
|
|
|
| 18:00 | Adjourn |
|
|
|
| 19:00 | Dinner | Meggaiano's | [http://www.maggianos.com/EN/Oak-Brook_Oak-Brook_IL/Pages/LocationLanding.aspx?AspxAutoDetectCookieSupport=1 |
|
|
|
|
|
|
|
Workshop Day 3 (Main room) | Wednesday Nov 21st |
|
|
|
|
|
|
|
| Parallel Sessions |
|
Mini workshop on Programming models/runtime | 08:30 | Emmanuel Jeannot, INRIA | Results | TBA |
|
09:00 | Sanjay Kale, UIUC | Charm++ update |
| ||
09:30 | Christian Perez, INRIA |
| TBA |
| |
10:00 | Break |
|
| ||
10:30 | Jim Dinan |
| A One-Sided View of HPC: Global-View Models and Portable Runtime Systems |
| |
11:00 | Sebastien Fourestier | Potential collaboration | Parallel repartitioning and re-mapping in Scotch |
| |
| 11:30 | Discussion |
|
|
|
| 12:30 | Closing |
|
|
|
| 13:00 | Lunch |
|
|
|
|
|
|
|
|
|
Mini workshop on Resilience | 08:30 | Mohamed Slim Bouguerra | Result | TBA |
|
| 09:00 | Amina Guermouche, INRIA | Result | Unified Model for Assessing Checkpointing Protocols at Extreme-Scale |
|
| 09:30 | Bogdan Nicolae, IBM | Result | I-Ckpt: Leveraging memory access patterns and inline collective deduplication to improve scalability of CR |
|
| 10:00 | Break |
|
|
|
| 10:30 | Tatiana Martsinkevich, INRIA | Result | Fully distributed recovery for send-determinism applications |
|
| 11:00 | Peter Brune, ANL | Trends | Multilevel Resiliency for PDE Simulations |
|
| 11:30 | Xiang Ni, Estaban Menese | Results | Scalable in-memory checkpoint with automatic restart on failure |
|
| 12:00 | Discussion |
|
| |
| 12:30 | Closing |
|
|
|
| 13:00 | Lunch |
| Boxe Lunches |
|
...
All aspects of HPC systems are undergoing change as we move into petascale and towards exascale computing. The traditional "I/O software stack" is no exception: the layers, capabilities, and abstractions in the stack are all in flux as we consider how to best support future HPC applications. This talk will discuss these developmental trends, using ongoing work at Argonne as examples of some directions of study.
Ian Foster
Big Process for Big Data
Large and diverse data result in challenging data management problems that researchers and facilities are often ill-equipped to handle. I propose a new approach to these problems based on the outsourcing of research data management tasks to software-as-a-service providers. I argue that this approach can both achieve significant economies of scale and accelerate discovery by allowing researchers to focus on research rather than mundane information technology tasks. I present early results with the approach in the context of Globus Online.
Andrew Chien
Presto/Blockus: Towards a Scalable R Programming System
...
This talk deals with the impact of fault prediction techniques on checkpointing strategies. We extend the classical analysis of Young and Daly in the presence of a fault prediction system, which is characterized by its recall and its precision, and which provides either exact or window-based time predictions. We succeed in deriving the optimal value of the checkpointing period (thereby minimizing the waste of resource usage due to checkpoint overhead) in all scenarios. These results allow to analytically assess the key parameters that impact the performance of fault predictors at very large scale. In addition, the results of this analytical evaluation are nicely corroborated by a comprehensive set of simulations, thereby demonstrating the validity of the model and the accuracy of the results.