1 of 38

Dagster Components

Low Code and AI-Native DX Without The Mess

Nick Schrock (@schrockn) - CTO & Founder - 2025-04-16

2 of 38

Data leaders today �face a critical tradeoff between�accessibility �and power

2

3 of 38

Accessibility in a data platform

Without accessibility there is a massive coordination burden that grinds everything to a halt.

What does it mean?

Accessibility means practitioners on the platform can self-serve to build and operate data pipelines.

Why does it matter?

4 of 38

A powerful data platform

A powerful platform can adapt to business needs and technological change through shared tech, process, and workflows—enabled at scale by great engineering and a centralized, leveraged team.

A leveraged data platform team can drive organization-spanning outcomes. E.g:

  • Productivity improvements
  • Enforcing standards
  • Cost reductions
  • Deploying new technologies
  • Cross-cutting policy initiatives.

What does it mean?

Why does it matter?

5 of 38

The Practitioner and the Data (Platform) Engineer

Data Practitioner

  • Subject-matter expert
  • Titles: E.g., Data Scientist, Analytics Engineer, Researcher
  • Typically uses a single, domain-specific tool: E.g. notebook, dbt, ML tool

Data (Platform) Engineer

  • Knows technology and engineering process
  • Typically have limited context on the business domain

5

6 of 38

The Practitioner and the Data (Platform) Engineer

There are very few people who can fulfill both of those roles.

They are unicorns.

The collaboration truly works only with the right systems and abstractions in place.

🦄

Data Practitioner

Data (Platform) Engineer

6

7 of 38

Approaches That

We See To Achieve

Self-Serve Pipelines

Verticalized low-code tool or platform

Engineer-only production data stack

🍰

Build a bespoke DSL and framework over the orchestrator

👨🏽‍💻

🪦

Fragile tooling, maintenance overhead, and subpar experiences for stakeholders

Not powerful and flexible enough

Inaccessible to many SMEs and practitioners

7

8 of 38

Phase One: Low-code

• Demos well, and people happy, at first�

• However, not powerful or flexible enough�

• Engineers had to build parallel infrastructure for the business-critical pipelines�

• Three years in and the situation is untenable. Engineers are bottleneck, practitioners can’t do their work, everyone is unhappy.

A Platform Journey at BigCo

�Siloed, all-in-one tool

SME / Practitioner

8

9 of 38

Phase Two: Airflow

• Now BigCo migrated to Airflow �

• Engineers can get involved and write Python, stack on infrastructure

�• Practitioners tried to participate, project degraded, and most eventually failed, leading to Shadow IT and silo’ing�

• Central data team had to take over and become bottleneck, unhappy, and drowning in toil�

• Two years in, something must be done…

A Platform Journey at BigCo

DAGs / Business logic

Loosely structured project

Infrastructure

Engineer

DevOps/Infra Engineer

9

10 of 38

Phase Three: DSL �over Airflow

• Ambitious engineer has idea to build custom YAML DSL on top of Airflow

• Practitioners happy with demos and initial, simple use cases

• Custom DSL had limited tooling and delivered bad, difficult-to-debug, and eventually unowned experience

• Two years later, central data team had to again take over everything and become bottleneck, drowned in toil

A Platform Journey at BigCo

DAGs / Business logic

Loosely structured project

Infrastructure

Engineer

DevOps/Infra Engineer

SME / Practitioner

YAML

10

11 of 38

There are decades where nothing happens; and there are weeks where decades happen.

, Data Engineer at BigCo

Vladimir Lenin

12 of 38

Excruciating tradeoffs

12

13 of 38

Data leaders are navigating this grid

Powerful

Not Powerful

Not Accessible

Accessible

+

DAG-only

Modern

Low Code

Low Code

DSL

Bespoke DSL

13

14 of 38

Power/Accessibility Frontier

Powerful

Not Powerful

Not Accessible

Accessible

DAG-only

Modern

Low Code

Low Code

Bespoke DSL

Breaking through this barrier requires innovation

14

15 of 38

Goal of Dagster Components ↗️

Powerful

Not Powerful

Not Accessible

Accessible

DAG-only

Modern

Low Code

Low Code

Bespoke DSL

Dagster Components

15

16 of 38

Demo: The Practitioner Journey Onboarding Onto a Components Project

16

17 of 38

Demo Recap

  • Practitioner can onboard easily by cargoculting and scaffolding

  • Robust support with tooling and integrated documentation

  • Tribal knowledge of the platform and workflow spread through scaffolding and built-in documentation

  • Straightforward integration into broader platform

17

18 of 38

Innovation doesn’t happen �in a vacuum

19 of 38

Where do Components fit?

We’re entering a new era.

AI is coming.

20 of 38

The data ecosystem

Big Data Era

Modern Data Era

AI Era

Map-Reduce

Hadoop

Data Lakes

Cloud Data Warehouse

Modern Data Stack

AI-Assisted Codegen

???

Map-Reduce

Hadoop

Data Lakes

Cloud Data Warehouse

Modern Data Stack

AI-Assisted Engineering

Lakehouses

???

2006 - 2016

2016 - 2024

2024 - ???

20

21 of 38

AI Mainstreaming Now in Software Engineering

🤯 AI-assisted engineering exploding

🚀 Model Context Protocol

MCP hit inflection in March. Gamechanger for tools.

Generalized AI tools in software engineering are exploding (Cursor, Co-Pilot 1M+ users)

21

22 of 38

MCP Server 🚀🌙

Dec 1 2024

Mar 30 2025

Source: Google Trends

Mar 1 2025

23 of 38

Shadow AI: Vibe Coders are Coming…

You will have vibe coding stakeholders,

whether you like it or not.

24 of 38

AI Risk in Software Systems

Unconstrained AI can be a

hallucinating demon

🐛 Easy to introduce bugs difficult or impossible to understand

🦠 Technical Debt Superspreader

😏 The wrong person working with AI without guardrails is dangerous

24

25 of 38

Shadow AI Vibecoding Can Be a Bad Time

25

26 of 38

Natural language is an insufficiently precise input to produce reliable software in a completely unsupervised manner.

I said this

CTO, Dagster

27 of 38

27

28 of 38

Components is the first AI-optimized framework for data engineering

Data engineering can have

nice things too

29 of 38

Guardrails

Modular code containers and constrained target language

29

30 of 38

Guardrails

Modular code containers and constrained target language

30

31 of 38

Guardrails are essential

Modular Code Containers:

  • Technical blast radius for AI slop
  • Limits necessary context window
  • More replaceable

Constrained codegen target:

  • E.g. SQL, Yaml, Structured Framework
  • Humans can evaluate correctness
  • Engineers and tools can impose constraints

31

32 of 38

Dagster Components

Stack

Infrastructure

Modular Project Structure and Framework

Engineer

DevOps/Infra Engineer

SME / Practitioner

Tooling & Ecosystem

Components DSL

Components

32

33 of 38

What gives us the right to call ourselves an

AI-optimized framework for data engineering?

Unique hub of context

Guardrails

Visibility Across Dev Lifecycle

Integrations, Source Code, Lineage, Metadata, Operational Data

Modular Components defined using high-level, introspectable DSL

Dev → Test → CI/CD → �Branch → Prod

01

02

03

33

34 of 38

Demo: AI Components

34

35 of 38

Data Platform Uniquely Good AI �Use Case

Diverse Practitioners that need precise outcomes: e.g. Analysts, Data Scientists, Engineers

Wide surface area: Tons of tools and configuration knowable by LLM

Tribal Knowledge: Company-specific integrations, legacy tech, and uneven documentation

35

36 of 38

Conclusion

Data platform teams have historically had to make an excruciating trade offs between leverage and accessibility.

Components increase accessibility over a adaptable, powerful platform and abstraction layer

Components is built for the future, as an AI-optimized framework

36

37 of 38

What’s launching today

Integrations for Embedded ELT, DBT, and Pipes

Early access to Components for select partners

Coming soon: �The Components AI experience

37

38 of 38

Thank you!

Want to learn more?

Request a demo��dagster.io/request-a-demo

Join us on Slack��#dg-components

38