Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

C3 AI has established the C3 CovidAI COVID-19 DatalakeData Lake, a federated datalake data lake containing a multitude of different datasets all related in someway to the CovidCOVID-19 pandemic.
Public access to the Datalake Data Lake is granted through a RESTful API interface (See documentation here: https://c3.ai/covid-19-api-documentation/)see the official documentation here to learn more.

While for the public , this provides a rich interface to a large conglomeration of data without the need to integrate multiple databases, Members members of the C3.ai DTI
can have more direct access. Behind the C3 Datalake AI Data Lake is a C3 package containing the definitions of every Type, as well as instructions for how to fetch that data
and integrate it into the DatalakeData Lake's data model.

By learning to use the C3 AI platform, researchers can leverage many capabilities of the platform such as defining their own Metrics, training machine learning
models on C3's platform, and additional helper functions to make navigating the Datalake Data Lake and data model easier.

Below, for the benefit of the DTI C3DTI researchers, we share an up-to-date diagram of the entire CovidC3 AI COVID-19 Datalake Data Lake data model.

Boxes denote a C3 Type defined in the datalake Data Lake package, while the connecting lines define relationships between the Types. Each box contains a list of
properties and in most cases their matching typesTypes.

Image RemovedImage Added