...
Main Topics | Schedule | Speaker | Affiliation | Type of presentation | Title (tentative) | Download | |
|
|
|
|
|
|
| |
Sunday Nov. 24th | 7:00 PM (Departure from Hampton Inn at 6:45PM) with mini buses | Only people registered for the dinner |
|
| Restaurant: Address: 402 N Race St, Urbana, IL 61801 Phone:(217) 328-3402 |
| |
|
|
|
|
|
|
| |
Workshop Day 1 | Monday Nov. 25th |
|
|
|
|
| |
|
|
|
|
| TITLES ARE TEMPORARY (except if in bold font) |
| |
Registration | 08:00 |
|
|
|
|
| |
Welcome and Introduction Auditorium 1122 Chair: Franck Cappello | 08:30 | Marc Snir + Franck Cappello Co-directors of the joint-lab |
| Background | Welcome, Workshop objectives and organization | Opening-10th-Workshop.pdf | |
| 08:45 | Ed. Seidel Incoming NCSA director | UIUC | Background | NCSA update and vision of the collaboration (This address has been inverted with the next one due to schedule constraints) | ||
09:00 | Peter Schiffer UIUC Vice Chancellor for Research | UIUC | Background | Welcome from UIUC Vice Chancellor for Research | |||
| 09:15 | Michel Cosnard Inria CEO and President | Inria | Background | INRIA updates and vision of the collaboration | HPC@Inria-UIUC-nov13-v2.pptx | |
09:30 | Marc Snir Director of Argonne/ MCS and co-director of the joint-lab | ANL | Background | Argonne updates and vision of the collaboration | jlpc 11-13 snir.pdf | ||
09:45 | Marc Daumas Attaché for Science and Technology | Embassy of France | Background | France-USA collaboration program updates | http://prezi.com/hsggz_30xlqt/2013-jlpc-workshop-ncsa-uiuc-il/ | ||
| 9h55 | Franck Cappello Co-director of the Joint-lab | ANL | Background | Joint-Lab, PUF, New Joint-Lab, organization | ||
| 10:15 | Break |
|
|
| ||
Extreme Scale Systems and infrastructures Auditorium 1122 Chair: Pavan Balaji | 10:45 | Pete Beckman | ANL |
| Extreme Scale Computing & Co-design Challenges |
| |
| 11:15 | John Towns | UIUC |
| Applications Challenges in the XSEDE Environment | XSEDE-Apps-Challenges-for-Joint-Lab.pdf | |
11:45 | Gabriel Antoniu | Inria | A-Brain and Z-CloudFlow: Scalable Data Processing on Azure Clouds - Lessons Learned in Three Years and Future Directions | 2013-11-25-JLPC-Azure-final.pdf | |||
| 12:15 | Lunch |
|
|
|
| |
Chair: Yves Robert | 13:45 | Bill Kramer | UIUC | Blue Waters | Is Petascale Completely Done? What Should We Do Now? | Kramer JLPC November Workshop - v1.pdf | |
14:15 | Torsten Hoefler | ETH | IEEE/ACM SC13 Best Paper | Enabling Highly-Scalable Remote Memory Access Programming with MPI-3 One Sided | |||
| 14:45 | Rob Ross | ANL |
| Thinking Past POSIX: Persistent Storage in Extreme Scale Systems | ross_uiuc-storage-20131125.pdf | |
15:15 | Break | ||||||
Chair: Bill Gropp | 15:45 | François Pellegrini | Inria | Parallel repartitioning and remeshing : results and prospects | |||
16:15 | Pavan Balaji | ANL | Message Passing in Massively Multithreaded Environments | 2013-11-25-jlpc-threads-pavanbalaji.pptx | |||
16:45 | Wen Mei Hwu | UIUC | A New, Portable Algorithm Framework for Parallel Linear Recurrence Problems | UIUC_INRIA__Tangram_GPU_2013_Hwu.pdf | |||
17:15 | Adjourn | ||||||
Diner | (Departure from Hampton Inn at 6:45PM) with mini buses) |
|
| Restaurant: Address: 715 S Neil St, Champaign, IL 61820 Phone:(217) 351-9898 |
| ||
|
|
|
|
|
|
| |
Workshop Day 2 | Tuesday Nov. 26 |
|
|
|
|
| |
Applications, I/O, Visualization, Big data Auditorium 1122 Chair: Rob Ross | 08:30 | Greg Bauer | UIUC | Applications and their challenges on Blue Waters | |||
| 09:00 | Matthieu Dorier | Inria | Joint-result, submitted | CALCioM: Mitigating I/O Interferences in HPC Systems through Cross-Application Coordination | DORIER-JLPC-November2013.pdf | |
09:30 | Dries Kimpe | ANL |
| Mercury: Enabling Remote Procedure Call for High-Performance Computing | dkimpe-mercury.pdf | ||
10:00 | Break | ||||||
Chair: Gabriel Antoniu | 10:30 | Venkat Vishwanath | ANL |
| Addressing I/O Bottlenecks and Simulation-Time Analytics at Extreme Scales | VISHWANATH_INRIA_JLPC_DIST.pdf | |
| 11:00 | Babak Behzad | UIUC | ACM/IEEE SC13 | Taming Parallel I/O Complexity with Auto-Tuning | ||
| 11:30 | McHenry, Kenton Guadron | UIUC |
| NSF CIF21 DIBBs: Brown Dog | ||
| 12:00 | Lunch |
|
|
| ||
|
|
|
|
|
|
| |
Mini Workshop1 Resilience Room 1030 Chair: Frederic Vivien |
|
|
|
|
|
| |
13:30 | Weslay Wesley Bland | ANL | Fault Tolerant Runtime Research at ANL | bland-jlpc.pdf | |||
| 14:00 | Tatiana Martsinkevich | Inria | Joint-result | On the feasibility of message logging in hybrid hierarchical FT protocols | martsinkevich jlpc workshop in ncsa.pdf | |
| 14:30 | Mohamed Slim Bouguera | Inria | Joint-result, submitted | Failure prediction: what to do with unpredicted failures ? | jointlab_ipdps_presentation_v0.pdf | |
| 15:00 | Ana Gainaru | UIUC | Joint-result, submitted | Topology and behaviour aware failure prediction for Blue Waters. | jlpc13_againaru.pdf | |
| 15:30 | Break |
|
|
|
| |
Chair: Franck | 16:00 | Sheng Di | Inria | Joint-result, submitted | Optimization of Multi-level Checkpoint Model for Large Scale HPC Applications | 10th-Joint-workshop-UIUC-sdi.ppt | |
| 16:30 | Yves Robert | Inria | Joint-result, | Assessing the impact of ABFT & Checkpoint composite strategies | ||
| 17h00 | Leonardo Bautista Gomez | ANL | Joint-result ACM PPoPP 2014 | Detecting Silent Data Corruption through Data Dynamic Monitoring for Scientific Applications | jlpc10leo.pdf | |
| 17H30 | Adjourn |
|
|
|
| |
Diner | (Departure from Hampton Inn at 7PM) with mini buses) |
|
| Restaurant: Address: 1 Main St #104, Champaign, IL 61820 Phone:(217) 531-1166 |
| ||
Mini Workshop2 Numerical Agorithms Room 1040 Chair: Bill GroppStefan Wild |
|
|
|
|
|
| |
| 13:30 | Luke Olson | UIUC |
| Toward a more robust sparse solver with some ideas on resilience and scalability | 2013_JointLab_NCSA_Olson.pdf | |
14:00 | Prasanna Balaprakash | ANL | Active-Learning-based Surrogate Models for Empirical Performance Tuning | Balaprakash.pdf | |||
| 14:30 | Yushan Wang | Inria |
| Solving 3D incompressible Navier-Stokes equations on hybrid CPU/GPU systems. | JointLab-Urbana.pdf | |
| 15:00 | Jed Brown | ANL |
| Fast solvers for implicit Runge-Kutta systems | 20131126-JointLabRungeKutta.pdf | |
| 15:30 | Break |
|
|
|
| |
Chair: Luke Olson | 16:00 | Pierre Jolivet | Inria | Best Paper finalist, IEEE, ACM SC13 | Scalable Domain Decomposition Preconditioners For Heterogeneous Elliptic Problems | jolivet-ddm.pdf | |
16:30 | Vincent Baudoui | Total&ANL | Joint-result | Round-off error propagation and non-determinism in parallel applications | baudoui-roundoff_errors.pdf | ||
17:00 | Torsten Hoefler | EPFLETH | Using Automated Performance Modeling to Find Scalability Bugs in Complex Codes | htor.pdf | |||
| 17:30 | Adjourn |
|
|
|
| |
Diner | (Departure from Hampton Inn at 7PM) with mini buses) |
|
| Restaurant: Phone:(217) 531-1166 |
| ||
|
|
|
|
|
|
| |
Workshop Day 3 | Wednesday Nov. 27 |
|
|
|
|
| |
|
|
|
|
|
|
| |
Mini Workshop3 |
|
|
|
|
|
| |
Programming models, compilation and runtime. Room 1030 Chair: Marc Snir | 08:30 | Grigori Fursin | Inria |
| Collective Mind: making auto-tuning practical using crowdsourcing and predictive modeling | Fursin_Slides.pdf | |
| 09:00 | Maria Garzaran | UIUC |
| Optimization by Run-time Specialization for Sparse Matrix-Vector Multiplication | garzaranNCSA-INRIA.pdf | |
09:30 | Jean-François Mehaut | Inria |
| From Multicores to Manycores Processors: Challenging Programming Issues with the MPPA/KALRAY | slides_JFM.pdf | ||
10:00 | Break | ||||||
| 10:30 | Rafael Tesser | Inria | Joint result PDP 2013 | Using AMPI to improve the performance of the Ondes3D seismic wave simulator through dynamic load balancing | RafaelTessser-WSJLPC-Nov2013.pdf | |
| 11:00 | Emmanuel Jeannot | Inria | Joint-result, IEEE Cluster2013 | Communication and Topology-aware Load Balancing in Charm++ with TreeMatch | cluster_slide.pdf | |
Auditorium 1122 | 11:30 | Closing |
|
|
| ||
| 12:00 | Lunch |
|
|
|
| |
Diner | (Departure from Hampton Inn at 5:45 PM) with mini buses) |
|
| Restaurant: Address: 1701 S Neil St, Champaign, IL 61820 Phone:(217) 351-9115 |
| ||
Mini Workshop4 Large scale systems and their simulators Room 1040 Chair: Bill Kramer |
|
|
|
|
|
| |
08:30 | Eric Bohm | UIUC |
| A Multi-resolution Emulation + Simulation Methodology for Exascale | JLPC_Bigsim-201311.pdf | ||
| 09:00 | Arnault Legrand | Inria |
| SMPI: Toward Better Simulation of MPI Applications | ||
09:30 | Kate KaheyFrederic Vivien | ANLInria | Evaluating Streaming Strategies for Event Processing across Infrastructure Clouds | Scheduling tree-shaped task graphs to minimize memory and makespan | |||
| 10:00 | Break |
|
|
|
| |
10:30 | Frederic Vivien | Kate Keahey | ANLInria |
| Scheduling tree-shaped task graphs to minimize memory and makespan Evaluating Streaming Strategies for Event Processing across Infrastructure Clouds | jointlab-ncsa.pdf | |
| 11:00 | Jeremy HenosEnos | UIUC |
| Application Runtime Consistency and Performance Challenges on a shared 3D torus. | smpi_jlpc_13.pdf | |
Auditorium 1122 | 11:30 | Closing |
|
|
| ||
12:00 | Lunch |
|
|
|
| ||
Diner | (Departure from Hampton Inn at 5:45 PM) with mini buses) | Restaurant: Address: 1701 S Neil St, Champaign, IL 61820 Phone:(217) 351-9115 |
...
We will present our last result on the SMPI/SimGrid framework. SMPI now implements all the collective algorithms and selection logics of both OpenMPI and MPICH and even a few other collective algorithms from Star MPI. Together with a flexible network model and topology description mechanisme, this allowed us to obtain almost perfect prediction of NASPB and BigDFT on Ethernet/TCP based clusters. We are currently working on extending this work to other kind of networks as well as on mixing the emulation capability of SMPI with the trace replay mechanism. We are also working on improving the replay mechanism so that it handles seamlessly classical trace formats.
Fault tolerance has been presented as an emerging problem for decades, with researchers often claiming that the next generation of hardware will introduce new levels of failure rates that will destroy productivity and cause applications to become unusable. While it is true that as machines have scaled, resilience has become more and more of a concern, there are issues already affecting applications at current scales. Process failure remains a concern, though primarily for applications that can run at the largest scales or on very unstable hardware. For smaller applications however, there are other concerns, such as soft errors, performance loss, etc. This talk will cover some of the research being performed in the Programming Models and Runtime Systems group at Argonne National Laboratory to study these phenomena.
...