Page History

...

Leonardo Kate KaheyANL Evaluating Streaming Strategies for Event Processing across Infrastructure Clouds

Main Topics	Schedule	Speaker	Affiliation	Type of presentation	Title (tentative)	Download

Sunday Nov. 24th Dinner Before the Workshop	7:00 PM (Departure from Hampton Inn at 6:45PM) with mini buses	Only people registered for the dinner			Restaurant: Silvercreek: Address: 402 N Race St, Urbana, IL 61801 Phone:(217) 328-3402

Workshop Day 1	Monday Nov. 25th
					TITLES ARE TEMPORARY (except if in bold font)
Registration	08:00
Welcome and Introduction Auditorium 1122 Chair: Franck Cappello	08:30	Marc Snir + Franck Cappello Co-directors of the joint-lab		Background	Welcome, Workshop objectives and organization	Opening-10th-Workshop.pdf
	08:45	Ed. Seidel Incoming NCSA director	UIUC	Background	NCSA update and vision of the collaboration (This address has been inverted with the next one due to schedule constraints)
	09:00	Peter Schiffer UIUC Vice Chancellor for Research	UIUC	Background	Welcome from UIUC Vice Chancellor for Research
	09:15	Michel Cosnard Inria CEO and President	Inria	Background	INRIA updates and vision of the collaboration	HPC@Inria-UIUC-nov13-v2.pptx
	09:30	Marc Snir Director of Argonne/ MCS and co-director of the joint-lab	ANL	Background	Argonne updates and vision of the collaboration	jlpc 11-13 snir.pdf
	09:45	Marc Daumas Attaché for Science and Technology	Embassy of France	Background	France-USA collaboration program updates	http://prezi.com/hsggz_30xlqt/2013-jlpc-workshop-ncsa-uiuc-il/
	9h55	Franck Cappello Co-director of the Joint-lab	ANL	Background	Joint-Lab, PUF, New Joint-Lab, organization	Joint-Lab-JLESC-PUF.pdf
	10:15	Break
Extreme Scale Systems and infrastructures Auditorium 1122 Chair: Pavan Balaji	10:45	Pete Beckman	ANL		Extreme Scale Computing & Co-design Challenges
	11:15	John Towns	UIUC		Applications Challenges in the XSEDE Environment		XSEDE-Apps-Challenges-for-Joint-Lab.pdf
		11:45	Gabriel Antoniu	Inria		A-Brain and Z-CloudFlow: Scalable Data Processing on Azure Clouds - Lessons Learned in Three Years and Future Directions	2013-11-25-JLPC-Azure-final.pdf
	12:15	Lunch
Chair: Yves Robert	13:45	Bill Kramer	UIUC	Blue Waters	Is Petascale Completely Done? What Should We Do Now?	Kramer JLPC November Workshop - v1.pdf
	14:15	Torsten Hoefler	ETH	IEEE/ACM SC13 Best Paper	Enabling Highly-Scalable Remote Memory Access Programming with MPI-3 One Sided	hoefler-mpi3rma-slides.pdf
	14:45	Rob Ross	ANL		Thinking Past POSIX: Persistent Storage in Extreme Scale Systems	ross_uiuc-storage-20131125.pdf
	15:15	Break
Chair: Bill Gropp	15:45	François Pellegrini	Inria		Parallel repartitioning and remeshing : results and prospects	pellegrini_scotch.pdf pellegrini_pampa.pdf
	16:15	Pavan BalagiBalaji	ANL		Message Passing in Massively Multithreaded Environments	2013-11-25-jlpc-threads-pavanbalaji.pptx
	16:45	Wen Mei Hwu	UIUC		A New, Portable Algorithm Framework for Parallel Linear Recurrence Problems	UIUC_INRIA__Tangram_GPU_2013_Hwu.pdf
	17:15	Adjourn						18:45	Bus for Diner
Diner	(Departure from Hampton Inn at 6:45PM) with mini buses)				Restaurant: Kamakura: Address: 715 S Neil St, Champaign, IL 61820 Phone:(217) 351-9898

Workshop Day 2	Tuesday Nov. 26
Applications, I/O, Visualization, Big data Auditorium 1122 Chair: Rob Ross	08:30	Greg Bauer	UIUC		Applications and their challenges on Blue Waters	GBAUER-INRIA-NCSA-BW-2013.pdf
	09:00	Matthieu Dorier	Inria	Joint-result, submitted	CALCioM: Mitigating I/O Interferences in HPC Systems through Cross-Application Coordination	DORIER-JLPC-November2013.pdf
	09:30	Dries KempeKimpe	ANL		Mercury: Enabling Remote Procedure Call for High-Performance Computing	dkimpe-mercury.pdf
	10:00	Break
Chair: Gabriel Antoniu	10:30	Venkat Vishwanath	ANL		Addressing I/O Bottlenecks and Simulation-Time Analytics at Extreme Scales	VISHWANATH_INRIA_JLPC_DIST.pdf
	11:00	Babak Behzad	UIUC	ACM/IEEE SC13	Taming Parallel I/O Complexity with Auto-Tuning	Babak_Slides.pdf
	11:30	McHenry, Kenton Guadron	UIUC		NSF CIF21 DIBBs: Brown Dog
	12:00	Lunch

Mini Workshop1 Resilience Room 1030 Chair: Luke Olson Frederic Vivien
	13:30	Wesley Bland	ANL	Joint-result	Detecting Silent Data Corruption through Data Dynamic Monitoring for Scientific Applications		Fault Tolerant Runtime Research at ANL	bland-jlpc.pdf
	14:00	Tatiana Martsinkevich	Inria	Joint-result	On the feasibility of message logging in hybrid hierarchical FT protocols	martsinkevich jlpc workshop in ncsa.pdf
	14:30	Mohamed Slim Bouguera	Inria	Joint-result, submitted	Failure prediction: what to do with unpredicted failures ?	jointlab_ipdps_presentation_v0.pdf
	15:00	Ana Gainaru	UIUC	Joint-result, submitted	Topology and behaviour aware failure prediction for Blue Waters.	jlpc13_againaru.pdf
	15:30	Break
Chair: Frederic VivienFranck	16:00	Sheng Di	Inria	Joint-result, submitted	Optimization of Multi-level Checkpoint Model for Large Scale HPC Applications	10th-Joint-workshop-UIUC-sdi.ppt
	16:30	Yves Robert	Inria	Joint-result,	Assessing the impact of ABFT & Checkpoint composite strategies	joint-lab2013.pdf
	17h00Weslay Bland	Leonardo Bautista Gomez	ANL		Fault Tolerant Runtime Research at ANL		Joint-result ACM PPoPP 2014	Detecting Silent Data Corruption through Data Dynamic Monitoring for Scientific Applications	jlpc10leo.pdf
	17H30	Adjourn
Diner	(Departure from Hampton Inn at 7PM) with mini buses)				19Restaurant:00	Bus for Diner			Ko-Fusion: Address: 1 Main St #104, Champaign, IL 61820 Phone:(217) 531-1166

Mini Workshop2 Numerical Agorithms Room 1040 Chair: Bill GroppStefan Wild
	13:30	Luke Olson	UIUC		Toward a more robust sparse solver with some ideas on resilience and scalability	2013_JointLab_NCSA_Olson.pdf
	14:00	Prasanna Balaprakash	ANL		Active-Learning-based Surrogate Models for Empirical Performance Tuning	Balaprakash.pdf
	14:30	Yushan Wang	Inria		Solving 3D incompressible Navier-Stokes equations on hybrid CPU/GPU systems.	JointLab-Urbana.pdf
	15:00	Jed Brown	ANL		Fast solvers for implicit Runge-Kutta systems	20131126-JointLabRungeKutta.pdf
	15:30	Break
Chair: Luke Olson	16:00	Pierre Jolivet	Inria	Best Paper finalist, IEEE, ACM SC13	Scalable Domain Decomposition Preconditioners For Heterogeneous Elliptic Problems	jolivet-ddm.pdf
	16:30	Vincent Baudoui	Total&ANL	Joint-result	Round-off error propagation and non-determinism in parallel applications	baudoui-roundoff_errors.pdf
	17:00	Torsten Hoefler	EPFLETH		Using Automated Performance Modeling to Find Scalability Bugs in Complex Codes	htor.pdf
	17:30	Adjourn

Diner	(Departure from Hampton Inn at 7PM) with mini buses)				19Restaurant:00	Bus for diner			Ko-Fusion: Address: 1 Main St #104, Champaign, IL 61820 Phone:(217) 531-1166

Workshop Day 3	Wednesday Nov. 27

Mini Workshop3
Programming models, compilation and runtime. Room 1030 Chair: Marc Snir	08:30	Grigori Fursin	Inria		Collective Mind: making auto-tuning practical using crowdsourcing and predictive modeling	Fursin_Slides.pdf
	09:00	Maria Garzaran	UIUC		Optimization by Run-time Specialization for Sparse Matrix-Vector Multiplication	garzaranNCSA-INRIA.pdf
	09:30	Jean-François Mehaut	Inria		From Multicores to Manycores Processors: Challenging Programming Issues with the MPPA/KALRAY	slides_JFM.pdf
	10:00	Break
	10:30	Rafael Tesser	Inria	Joint result PDP 2013	Using AMPI to improve the performance of the Ondes3D seismic wave simulator through dynamic load balancing	RafaelTessser-WSJLPC-Nov2013.pdf
	11:00	Emmanuel Jeannot	Inria	Joint-result, IEEE Cluster2013	Communication and Topology-aware Load Balancing in Charm++ with TreeMatch	cluster_slide.pdf
Auditorium 1122	11:30	Closing
	12:00	Lunch
								18:00	Bus for diner
Diner	(Departure from Hampton Inn at 5:45 PM) with mini buses)				Restaurant: Ribeye: Address: 1701 S Neil St, Champaign, IL 61820 Phone:(217) 351-9115
Mini Workshop4 Large scale systems and their simulators Room 1040 Chair: Bill Kramer
	08:30	Eric Bohm	UIUC		A Multi-resolution Emulation + Simulation Methodology for Exascale	JLPC_Bigsim-201311.pdf
	09:00	Arnault Legrand	Inria		SMPI: Toward Better Simulation of MPI Applications	smpi_jlpc_13.pdf
	09:30	Frederic Vivien	Inria		Scheduling tree-shaped task graphs to minimize memory and makespan
	10:00	Break
	10:30	Frederic Vivien	Kate Keahey	ANLInria		Scheduling tree-shaped task graphs to minimize memory and makespan Evaluating Streaming Strategies for Event Processing across Infrastructure Clouds	jointlab-ncsa.pdf
	11:00	Jeremy HenosEnos	UIUC		Application Runtime Consistency and Performance Challenges on a shared 3D torus.	smpi_jlpc_13.pdf
Auditorium 1122	11:30	Closing
	12:00	Lunch
								18:00
Diner	(Departure from Hampton Inn at 5:45 PM) with mini buses)	Bus for diner			Restaurant: Ribeye: Address: 1701 S Neil St, Champaign, IL 61820 Phone:(217) 351-9115

Abstracts

Kenton McHenry

NSF CIF21 DIBBs: Brown Dog

The objective of this project is to construct a service that will allow for past and present un-curated data to be utilized by science while simultaneously demonstrating the novel science that can be conducted from such data. The proposed effort will focus on the large distributed and heterogeneous bodies of past and present un-curated data, what is often referred to in the scientific community as long-tail data, data that would have great value to science if its contents were readily accessible. The proposed framework will be made up of two re-purposable cyberinfrastructure building blocks referred to as a Data Access Proxy (DAP) and Data Tilling Service (DTS). These building blocks will be developed and tested in the context of three use cases that will advance science in geoscience, biology, engineering, and social science. The DAP will aim to enable a new era of applications that are agnostic to file formats through the use of a tool called a Software Server which itself will serve as a workflow tool to access functionality within 3rd party applications. By chaining together open/save operations within arbitrary software the DAP will provide a consistent means of gaining access to content stored across the large numbers of file formats that plague long tail data. The DTS will utilize the DAP to access data contents and will serve to index unstructured data sources (i.e. instrument data or data without text metadata). Building off of the Versus content based comparison framework and the Medici extraction services for auto-curation the DTS will assign content specific identifiers to untagged data allowing one to search collections of such data. The intellectual merit of this work lies in the proposed solution which does not attempt to construct a single piece of software that magically understands all data, but instead aims at utilizing every possible source of automatable help already in existence in a robust and provenance preserving manner to create a service that can deal with as much of this data as possible. This proverbial “super mutt” of software, or Brown Dog, will serve as a low level data infrastructure to interface with digital data contents and through its capabilities enable a new era of science and applications at large. The broader impact of this work is in its potential to serve not just the scientific community but the general public, as a DNS for data, moving civilization towards an era where a user’s access to data is not limited by a file’s format or un-curated collections.

...

SMPI: Toward Better Simulation of MPI Applications
We will present our last result on the SMPI/SimGrid framework. SMPI now implements all the collective algorithms and selection logics of both OpenMPI and MPICH and even a few other collective algorithms from Star MPI. Together with a flexible network model and topology description mechanisme, this allowed us to obtain almost perfect prediction of NASPB and BigDFT on Ethernet/TCP based clusters. We are currently working on extending this work to other kind of networks as well as on mixing the emulation capability of SMPI with the trace replay mechanism. We are also working on improving the replay mechanism so that it handles seamlessly classical trace formats.

Welsley Wesley Bland

Fault Tolerant Runtime Research at ANL
Fault tolerance has been presented as an emerging problem for decades, with researchers often claiming that the next generation of hardware will introduce new levels of failure rates that will destroy productivity and cause applications to become unusable. While it is true that as machines have scaled, resilience has become more and more of a concern, there are issues already affecting applications at current scales. Process failure remains a concern, though primarily for applications that can run at the largest scales or on very unstable hardware. For smaller applications however, there are other concerns, such as soft errors, performance loss, etc. This talk will cover some of the research being performed in the Programming Models and Runtime Systems group at Argonne National Laboratory to study these phenomena.

...

We propose a novel technique to detect silent data corruption based on low-overhead, localized data monitoring. We implemented our technique on a generic library that allows scientific applications to easily self-analyze during runtime. Using this technique, an application an learn the normal dynamics of its datasets, allowing it to quickly spot anomalies. We evaluate our technique with synthetic enchmarks and large scientific datasets of production-level scientific applications simulating real phenomena. We show that our technique can detect up to 50% of injected errors while incurring only negligible overhead on real scientific applications.

Pavan BalagiBalaji

Message Passing in Massively Multithreaded Environments
Many-core architectures, such as the IBM Blue Gene/Q and Intel Xeon Phi, provide dozens of cores and hundreds of hardware threads. To utilize such architectures, application programmers are increasingly looking at hybrid programming models (frequently referred to as ``MPI+X’' models), where multiple threads interact with the MPI library. A common mode of operation for hybrid MPI+threads applications is where multiple threads are used to parallelize the computation, and one or more threads also issues MPI operations. While such a model is becoming increasingly popular because of the reducing per-core hardware resources available in modern architectures, it poses several challenges for the efficiency of MPI communication in such environments. In this talk, I’ll describe some of our recent work on optimizing MPI in such environments, either with multiple threads calling MPI operations or a single thread doing so.

...

Child pages

Versions Compared

Old Version 55

New Version Current

Key

TITLES ARE TEMPORARY (except if in bold font)

Abstracts