Skip to content

Transformation and Modelling

We will use dbt for any processig and modelling that will be done to the data. When we curate the data, we will move the curated to an analytical database.

Clickhouse

We will be using clickhouse for the Analytics database. What I have learned is that the OSS does not have an in-built UI but you can use third-party app for this.

TODO: Fine-tuning Clickhouse for larger datasets

DuckDB

We will use Duckdb and Iceberg connected to S3. This is an experiment that can change after the first project trial.