Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Leonardo TBDANLG8 ECS and international collaboration toward extreme scale climate simulationLunch

Main Topics

Schedule

            Speaker

Affiliation

Type of presentation

Title (tentative)

Download

 

 

 

 

 

 

 

Sunday Nov. 24th
Dinner Before the Workshop

7:00 PM

(Departure from Hampton Inn at 6:45PM) with mini buses

Only people registered for the dinner

 

 

 

Restaurant:
Silvercreek:

Address: 402 N Race St, Urbana, IL 61801
Phone:(217) 328-3402

 

 

 

 

 

 

 

 

Workshop Day 1

Monday Nov. 25th

 

 

 

 

 

 

 

 

 

 

TITLES ARE TEMPORARY (except if in bold font)

 

Registration

08:00

 

 

 

 

 

Welcome and Introduction

Auditorium 1122

Chair: Franck Cappello

08:30

Marc Snir + Franck Cappello

Co-directors of the joint-lab

 

Background

Welcome, Workshop objectives and organization

 
Opening-10th-Workshop.pdf

 

08:45

Ed. Seidel

Incoming NCSA director

UIUC

Background

NCSA update and vision of the collaboration

(This address has been inverted with the next one due to schedule constraints)

 
 09:00

Peter Schiffer

UIUC Vice Chancellor for Research

UIUCBackgroundWelcome from UIUC Vice Chancellor for Research 

 

09:15

Michel Cosnard

Inria CEO and President

Inria

Background

INRIA updates and vision of the collaboration

 
HPC@Inria-UIUC-nov13-v2.pptx


09:30

Marc Snir

Director of Argonne/ MCS and co-director of the joint-lab

ANL

Background

Argonne updates and vision of the collaboration

 jlpc 11-13 snir.pdf
 09:45

Marc Daumas

Attaché for Science and Technology

Embassy of FranceBackgroundFrance-USA collaboration program updates http://prezi.com/hsggz_30xlqt/2013-jlpc-workshop-ncsa-uiuc-il/

 

9h55

Franck Cappello

Co-director of the Joint-lab

ANL

Background

Joint-Lab, PUF, New Joint-Lab, organization

 Joint-Lab-JLESC-PUF.pdf

 

10:15

Break

 

 

 

 

Extreme Scale Systems and infrastructures

Auditorium 1122

Chair: Yves RobertPavan Balaji

10:45

Pete Beckman

ANL

 

Extreme Scale Computing & Co-design Challenges

 

 

11:15

John Towns

UIUC

 

Applications Challenges in the XSEDE Environment

 XSEDE-Apps-Challenges-for-Joint-Lab.pdf
 11:45Gabriel AntoniuInria  A-Brain and Z-CloudFlow: Scalable Data Processing on Azure Clouds - Lessons Learned in Three Years and Future Directions 2013-11-25-JLPC-Azure-final.pdf

 

12:15

Lunch

 

 

 

 

Chair: Yves Robert

13:45

Bill Kramer

UIUC

Blue Waters

Is Petascale Completely Done?  What Should We Do Now?
 

 
Kramer JLPC November Workshop - v1.pdf


14:15

Torsten Hoefler

ETH

IEEE/ACM SC13 Best Paper

Enabling Highly-Scalable Remote Memory Access Programming with MPI-3 One Sided

 hoefler-mpi3rma-slides.pdf

 

14:45

Rob Ross

ANL

 

Thinking Past POSIX: Persistent Storage in Extreme Scale Systems

ross_uiuc-storage-20131125.pdf
 15:15Break    
Chair: Bill Gropp15:1545François PellegriniInria Parallel repartitioning and remeshing : results and prospects 

pellegrini_scotch.pdf

pellegrini_pampa.pdf

 16:15:45BreakPavan Balaji ANL    16:15Pavan BalagiANL Message Message Passing in Massively Multithreaded Environments 2013-11-25-jlpc-threads-pavanbalaji.pptx
 16:45Wen Mei HwuUIUC 

 A New, Portable Algorithm Framework for Parallel Linear Recurrence Problems

 
UIUC_INRIA__Tangram_GPU_2013_Hwu.pdf
 17:15Adjourn    

 

18:45

Bus for Diner

 

 

Diner

(Departure from Hampton Inn at 6:45PM) with mini buses)


 

 

Restaurant:
Kamakura:

Address: 715 S Neil St, Champaign, IL 61820
Phone:(217) 351-9898
 

 

 

 

 

 

 

 

 

Workshop Day 2


Tuesday Nov. 26

 

 

 

 

 

Applications, I/O, Visualization, Big data

Auditorium 1122

Chair: Rob Ross

08:30

Greg BauerUIUC  Applications and their challenges on Blue Waters

 GBAUER-INRIA-NCSA-BW-2013.pdf

 

09:00

Matthieu Dorier

Inria

Joint-result, submitted

CALCioM: Mitigating I/O Interferences in HPC Systems through Cross-Application Coordination

 DORIER-JLPC-November2013.pdf
 

09:30

Dries KempeKimpe

ANL

 

Mercury: Enabling Remote Procedure Call for High-Performance Computing

dkimpe-mercury.pdf
 

10:00

Break

    

Chair: Gabriel Antoniu

10:0030

Venkat Vishwanath

ANL

 

Addressing I/O Bottlenecks and Simulation-Time Analytics at Extreme Scales

 

 

10:30

Break

 

 

 

 VISHWANATH_INRIA_JLPC_DIST.pdf

 

11:00

Babak Behzad

UIUC

ACM/IEEE SC13

Taming Parallel I/O Complexity with Auto-Tuning

 Babak_Slides.pdf

 

11:30

McHenry, Kenton Guadron

UIUC

 

NSF CIF21 DIBBs: Brown Dog

 

 

12:00

Lunch

 

 


 

 

 

 

 

 

 

 

Mini Workshop1

Resilience

Room 1030

Chair: Yves Robert Frederic Vivien

 

 

 

 

 

 

 13:30

Wesley Bland

ANL

Joint-result

Detecting Silent Data Corruption through Data Dynamic Monitoring for Scientific Applications

 Fault Tolerant Runtime Research at ANLbland-jlpc.pdf

 

14:00

Tatiana Martsinkevich

Inria

Joint-result

On the feasibility of message logging in hybrid hierarchical FT protocols

 martsinkevich jlpc workshop in ncsa.pdf

 

14:30

Mohamed Slim Bouguera

Inria

Joint-result, submitted

 Failure prediction: what to do with unpredicted failures ?

 
jointlab_ipdps_presentation_v0.pdf

 

15:00

Ana Gainaru

UIUC

Joint-result, submitted

Topology and behaviour aware failure prediction for Blue Waters.

 jlpc13_againaru.pdf

 

15:30

Break

 

 

 

  

Chair: Franck

16:00

Sheng Di

Inria

Joint-result, submitted

 Optimization of Multi-level Checkpoint Model for Large Scale HPC Applications

 
10th-Joint-workshop-UIUC-sdi.ppt

 

16:30

Yves Robert

Inria

Joint-result,

Assessing the impact of ABFT & Checkpoint composite strategies

 joint-lab2013.pdf

 

17h00Weslay Bland

Leonardo Bautista Gomez

ANL

 

Fault Tolerant Runtime Research at ANL

Joint-result

ACM PPoPP 2014

Detecting Silent Data Corruption through Data Dynamic Monitoring for Scientific Applications

jlpc10leo.pdf 

 

17H30

Adjourn

 

 

 

 

Diner

(Departure from Hampton Inn at 7PM) with mini buses)


 

 

19Restaurant:00

Bus for Diner

 

 


Ko-Fusion:

Address: 1 Main St #104, Champaign, IL 61820

Phone:(217) 531-1166
 

 

       

Mini Workshop2

Numerical Agorithms

Room 1040

Chair: Bill GroppStefan Wild

 

 

 

 

 

 

 

13:30

Luke Olson

UIUC

 

Toward a more robust sparse solver with some ideas on resilience and scalability  2013_JointLab_NCSA_Olson.pdf
 14:00 Prasanna BalaprakashANL  Active-Learning-based Surrogate Models for Empirical Performance Tuning Balaprakash.pdf

 

14:30

Yushan Wang

Inria

 

Solving 3D incompressible Navier-Stokes equations on hybrid CPU/GPU systems.

 JointLab-Urbana.pdf

 

15:00

Jed Brown

ANL

 

 Fast solvers for implicit Runge-Kutta systems

 
20131126-JointLabRungeKutta.pdf

 

15:30

Break

 

 

 

  

Chair: Luke Olson

16:00

Pierre Jolivet

Inria

Best Paper finalist, IEEE, ACM SC13

Scalable Domain Decomposition Preconditioners For Heterogeneous Elliptic Problems

 jolivet-ddm.pdf
 16:30Vincent BaudouiTotal&ANLJoint-resultRound-off error propagation and non-determinism in parallel applications baudoui-roundoff_errors.pdf
 17:00Torsten HoeflerEPFLETH Using Automated Performance Modeling to Find Scalability Bugs in Complex Codes htor.pdf

 

17:30

Adjourn

 

 

 

 

       

Diner

(Departure from Hampton Inn at 7PM) with mini buses)


 

 

 

19:00

Bus for diner

 

 

Restaurant:
Ko-Fusion:

Address: 1 Main St #104, Champaign, IL 61820
Phone:(217) 531-1166
 

 

 

 

 

 

 

 

 

Workshop Day 3


Wednesday Nov. 27

 

 

 

 

 

 

 

 

 

 

 

 

Mini Workshop3


 

 

 

 

 

 

 Programming models, compilation and runtime.

Room 1030

Chair: Marc Snir

08:30

Grigori Fursin

Inria

 

Collective Mind: making auto-tuning practical using crowdsourcing and predictive modeling

 Fursin_Slides.pdf

 

09:00

Maria Garzaran

UIUC

 

Optimization by Run-time Specialization for Sparse Matrix-Vector Multiplication

 garzaranNCSA-INRIA.pdf


09:30

Jean-François Mehaut

Inria

 

From Multicores to Manycores Processors: Challenging Programming Issues with the MPPA/KALRAY

 slides_JFM.pdf
 10:00Break    

 

10:30

Frederic Vivien

Inria

 

Scheduling tree-shaped task graphs to minimize memory and makespan 

 

 

11:00

Rafael Tesser

Inria

Joint result PDP 2013

Using AMPI to improve the performance of the Ondes3D seismic wave simulator through dynamic load balancing

 RafaelTessser-WSJLPC-Nov2013.pdf

 

11:3000

Emmanuel Jeannot

Inria

Joint-result, IEEE Cluster2013

Communication and Topology-aware Load Balancing in Charm++ with TreeMatch

 

cluster_slide.pdf

Auditorium 1122

11:30

 

12:00

Closing

 

 

 

 

 

12:3000

Lunch

 

 

 

 

       

 

18:00

Bus for diner

 

 

Diner

(Departure from Hampton Inn at 5:45 PM) with mini buses)


 

 

Restaurant:
Ribeye:

Address: 1701 S Neil St, Champaign, IL 61820
Phone:(217) 351-9115
 

 

Mini Workshop4

Large scale systems and their simulators

Room 1040

Chair: Bill Kramer

 

 

 

 

 

 


08:30

Eric Bohm

UIUC

 

A Multi-resolution Emulation + Simulation Methodology for Exascale

 JLPC_Bigsim-201311.pdf

 

09:00

Arnault Legrand

Inria

 

SMPI: Toward Better Simulation of MPI Applications

smpi_jlpc_13.pdf

 09:30Frederic VivienInria  Scheduling tree-shaped task graphs to minimize memory and makespan  

 

10:00

Break

 

 

 

 


10:30

 Kate Kahey Keahey

ANL

 

 Evaluating Streaming Strategies for Event Processing across Infrastructure Clouds 

 
jointlab-ncsa.pdf

 

11:00

Jeremy HenosEnos

UIUC

 

 Application Runtime Consistency and Performance Challenges on a shared 3D torus.

 smpi_jlpc_13.pdf

Auditorium 1122 

11:30TBD

Closing

 

 

 

 Auditorium 1122


12:00

ClosingLunch

 

 

 

 

 

12:30

       
Diner(Departure from Hampton Inn at 5:45 PM) with mini buses)     

Restaurant:

18

Ribeye:

00
Bus for diner  

Address: 1701 S Neil St, Champaign, IL 61820
Phone:(217) 351-9115
 
 

Abstracts

Kenton McHenry

NSF CIF21 DIBBs: Brown Dog

The objective of this project is to construct a service that will allow for past and present un-curated data to be utilized by science while simultaneously demonstrating the novel science that can be conducted from such data. The proposed effort will focus on the large distributed and heterogeneous bodies of past and present un-curated data, what is often referred to in the scientific community as long-tail data, data that would have great value to science if its contents were readily accessible. The proposed framework will be made up of two re-purposable cyberinfrastructure building blocks referred to as a Data Access Proxy (DAP) and Data Tilling Service (DTS). These building blocks will be developed and tested in the context of three use cases that will advance science in geoscience, biology, engineering, and social science. The DAP will aim to enable a new era of applications that are agnostic to file formats through the use of a tool called a Software Server which itself will serve as a workflow tool to access functionality within 3rd party applications. By chaining together open/save operations within arbitrary software the DAP will provide a consistent means of gaining access to content stored across the large numbers of file formats that plague long tail data. The DTS will utilize the DAP to access data contents and will serve to index unstructured data sources (i.e. instrument data or data without text metadata). Building off of the Versus content based comparison framework and the Medici extraction services for auto-curation the DTS will assign content specific identifiers to untagged data allowing one to search collections of such data. The intellectual merit of this work lies in the proposed solution which does not attempt to construct a single piece of software that magically understands all data, but instead aims at utilizing every possible source of automatable help already in existence in a robust and provenance preserving manner to create a service that can deal with as much of this data as possible. This proverbial “super mutt” of software, or Brown Dog, will serve as a low level data infrastructure to interface with digital data contents and through its capabilities enable a new era of science and applications at large. The broader impact of this work is in its potential to serve not just the scientific community but the general public, as a DNS for data, moving civilization towards an era where a user’s access to data is not limited by a file’s format or un-curated collections.

...

SMPI: Toward Better Simulation of MPI Applications
We will present our last result on the SMPI/SimGrid framework. SMPI now implements all the collective algorithms and selection logics of both OpenMPI and MPICH and even a few other collective algorithms from Star MPI. Together with a flexible network model and topology description mechanisme, this allowed us to obtain almost perfect prediction of NASPB and BigDFT on Ethernet/TCP based clusters. We are currently working on extending this work to other kind of networks as well as on mixing the emulation capability of SMPI with the trace replay mechanism. We are also working on improving the replay mechanism so that it handles seamlessly classical trace formats.
Welsley Wesley Bland
Fault Tolerant Runtime Research at ANL
Fault tolerance has been presented as an emerging problem for decades, with researchers often claiming that the next generation of hardware will introduce new levels of failure rates that will destroy productivity and cause applications to become unusable. While it is true that as machines have scaled, resilience has become more and more of a concern, there are issues already affecting applications at current scales. Process failure remains a concern, though primarily for applications that can run at the largest scales or on very unstable hardware. For smaller applications however, there are other concerns, such as soft errors, performance loss, etc. This talk will cover some of the research being performed in the Programming Models and Runtime Systems group at Argonne National Laboratory to study these phenomena.

...

We propose a novel technique to detect silent data corruption based on low-overhead, localized data monitoring. We implemented our technique on a generic library that allows scientific applications to easily self-analyze during runtime. Using this technique, an application an learn the normal dynamics of its datasets, allowing it to quickly spot anomalies. We evaluate our technique with synthetic enchmarks and large scientific datasets of production-level scientific applications simulating real phenomena. We show that our technique can detect up to 50% of injected errors while incurring only negligible overhead on real scientific applications.

Pavan BalagiBalaji
Message Passing in Massively Multithreaded Environments
Many-core architectures, such as the IBM Blue Gene/Q and Intel Xeon Phi, provide dozens of cores and hundreds of hardware threads.  To utilize such architectures, application programmers are increasingly looking at hybrid programming models (frequently referred to as ``MPI+X’' models), where multiple threads interact with the MPI library.  A common mode of operation for hybrid MPI+threads applications is where multiple threads are used to parallelize the computation, and one or more threads also issues MPI operations.  While such a model is becoming increasingly popular because of the reducing per-core hardware resources available in modern architectures, it poses several challenges for the efficiency of MPI communication in such environments.  In this talk, I’ll describe some of our recent work on optimizing MPI in such environments, either with multiple threads calling MPI operations or a single thread doing so.

...