Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Table of Contents

Overview

...

A study of cost and performance of the application of cloud computing to Astronomy 1

The performance of three workflow applications with different I/O, memory and CPU requirements are investigated on Amazon EC2 and the performance of cloud are compared with that of a typical HPC (Abe in NCSA).
The goal is to determine which type of scientific workflow applications are cheaply and efficiently run on the Amazon EC2 cloud.
Also the application of cloud computing to the generation of an atlas of periodograms for the 210,000 light curves is described.

...

 

 

Result

Runtimes

Tasks

631,992

 

Mean Task Runtime

6.34 sec

 

Jobs

25,401

 

Mean Job Runtime

2.62 min

 

Total CPU Time

1,113 hr

 

Total Wall Time

26.8 hr

Inputs

Input Files

210,664

 

Mean Input Size

0.084 MB

 

Total Input Size

17.3 GB

Outputs

Output Files

1,263,984

 

Mean Output Size

0.124 MB

 

Total Output Size

76.52 GB

Cost

Compute Cost

$291.58

 

Transfer Cost

$11.48

 

Total Cost

$303.06

...

Seeking Supernovae in the Clouds: A Performance Study

...

2

Summary

Nearby Supernova Factory(SNfactory) experiment measures the expansion history of the Universe to explore the nature of Dark Energy with Type Ia supernovae. SNfactory is a pipeline of serial processes executing various image processing algorithms in parallel on ~10TBs of data. SNfactory is ported to Amazon Web Services environment.

...

  • Need to replicate HPC cluster environment in EC2 or the application must be modified.
  • Mean rate of failure is higher in EC2 than in traditional cluster environments which needs to be handled.
  • Inability to acquire all of the VMI requested because insufficient resources are available, so need to modify the application to adapt this.
  • Transient errors.

Application of Cloud computing to the creation of image mosaic and management of their provenance 3

Summary

Workflow

Data

Cloud platform

Cloud performance

Issues/Gaps

Scientific workflow applications on Amazon EC2 4

Summary

Workflow

Data

Cloud platform

Cloud performance

Issues/Gaps

References

  1. Berriman, G.B. et al. Sixth IEEE International Conference on e-Science, 1-7 (2010)
  2. Jackson, K.R. et al. Proc. ACM Int. Symp. HPDC, 421-429 (2010)
  3. Berriman, G.B. et al. SPIE Conference 7740: Software and Cyberinfrastructure for Astronomy (2010)
  4. Juve, G. et al. Cloud Computing Workshop in Conjunction with e-Science Oxford, UK: IEEE (2009)
  5. Juve, G. et al. SC(2010)
  6. Wiley, K. et al. Publications of the Astronomical Society of the Pacific 123 366-380 (2011)
  7. Gaudet, SJackson, K.R. et al. Proc . ACM Int. Symp. HPDCSPIE (2010)
  8. Ramakrishnan, L. et al. Wands '10 , 421-429 (2010)

Notes and other links

a. Workflow: loosely coupled parallel applications that consist of a set of computational tasks linked by data- and control-flow dependencies.
b. A parallel file system and high-speed interconnect would make dramatic performance upgrades. Recently Amazon released a new resource type including a 10Gb interconnect.
c. There is a movement towards providing academic clouds, such as FutureGrid or Magellan.
d. Only true for intra-zone transfer (before July 1st, 2011). Also the request for data transfer is not free.