Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

C3 is a Java-based data analytics engine designed to make the ingestion and analysis of heterogeneous data sources
as painless as possible. The C3 provides a system to join joins data from multiple sources into a single unified federated data image.
With the federated data image defined, C3 then provides an API to access that data, and in the case of time-series data,
perform numerous transformations and computations all producing normalized time-series data at regular intervals.

C3 also supports R and Python Jupyter notebook analysis of the federated data image. These notebooks provide a
great way for researchers to analyze data close to where the data is stored. While C3 supports many data science
capabilities familiar to the researcher, some expected functionality may be missing.
For these cases, C3 supports
implementing new data processing functions in python and javascript.

Like any other API porting your own workflows will take some care and time to learn properly. Please leverage this guide
to make understanding C3's platform and porting your workflow as quick and easy as possible.

Services available from C3

  • Covid-19 Datalake: This unified federated Datalake includes data from numerous sources.
  • C3 computing platform:
  • C3 Integrated Development Studio
  • C3 Jupyter notebooks

How does C3 differ from traditional HPC systems?

...