Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

C3 has established the C3 CovidAI COVID-19 DatalakeData Lake, a federated datalake data lake containing a multitude of different datasets all related in someway to the CovidCOVID-19 pandemic. Public access to the Datalake Data Lake is granted through a RESTful API interface (See documentation here: https://c3.ai/covid-19-api-documentation/).

While for the public, this provides a rich interface to a large conglomeration of data without the need to integrate multiple databases, Members of the DTI can have more direct access. Behind the C3 Datalake Data Lake is a C3 package containing the definitions of every Type as well as instructions for how to fetch that data and integrate it into the DatalakeData Lake's data model.

By learning to use the C3 platform, researchers can leverage many capabilities of the platform such as defining their own Metrics, training machine learning models on C3's platform, and additional helper functions to make navigating the Datalake Data Lake and data model easier.

Below for the benefit of the DTI researchers, we share an up-to-date diagram of the entire CovidCOVID-19 Datalake Data Lake data model.

Boxes denote a C3 Type defined in the datalake Data Lake package, while the connecting lines define relationships between the Types. Each box contains a list of properties and in most cases their matching types.

...