dbt is the transformation layer in the modern data stack, the T in ELT. Instead of hiding SQL in BI tools or scripts, dbt defines versioned, tested models directly in the warehouse, with lineage and documentation.

Do I need dbt, or is SQL in the BI tool enough?

For a single report, SQL in the BI tool is fine. Once multiple sources, teams, and dashboards build on the same definitions, hand-maintained SQL becomes a source of errors. dbt versions the logic in one place and tests it.

dbt Core or dbt Cloud?

dbt Core is open source and runs self-hosted, often orchestrated via Dagster or Airflow. dbt Cloud adds a scheduler, IDE, and hosting. Which one fits depends on your team, governance, and existing orchestration.

How does dbt prevent silent data errors?

Through tests and assertions: unique, not_null, accepted_values, referential checks, and custom rules run on every build. When an assumption breaks, the build fails before a wrong value reaches a dashboard.

Does dbt work with BigQuery?

Yes. dbt runs natively on BigQuery, Snowflake, and other warehouses. The models execute as SQL in the warehouse itself; dbt only orchestrates order, tests, and docs.

dbt: Transformation & Modelling Setup

dbt is the transformation layer that turns raw loads into a clean, traceable data model. Versioned, testable, documented. Exactly the foundation reliable BI and AI build on.

What is dbt?

dbt is the T in ELT. Data lands raw in the warehouse, dbt transforms it there into versioned SQL models, with tests, lineage, and docs. The logic no longer lives scattered across BI tools and scripts, but in one place, under version control.

That sounds like engineering discipline, and that's exactly what it is. The effect: a metric is defined the same way everywhere, an error is traceable, and new team members read the model instead of guessing it.

When dbt fits, and when it doesn't

A fit when:

multiple sources converge into one consistent model
the same definitions feed multiple dashboards
data quality and lineage need to be verifiable
a team brings SQL and git discipline

Less so when:

only a single report is needed
there's no modelled data foundation to transform
nobody maintains the models

Without dbt vs. with dbt

Criterion	SQL in the BI tool	dbt
Definitions	duplicated per dashboard	once, central
Tests	manual, if at all	on every build
Traceability	hard	lineage and docs
Versioning	none	git
Onboarding	knowledge in someone's head	a readable model

What Datascale builds with dbt

We set up the model and keep it maintainable:

project structure, staging and mart layers
models for the core marketing and revenue metrics
tests against silent data errors
orchestration via Dagster or dbt Cloud
lineage and docs for your team
connection to BI and, where it helps, to the AI layer

The full picture lives in Data Reliability & Governance and the Marketing Data Lakehouse. With us, dbt usually runs on BigQuery, fed from sources like funnel.io or Snowplow.

dbt: A Versioned Transformation Layer for Your Warehouse