Framework · Astral Forest

The Five Waves of AI in Data Teams

A practical map of how AI moves through a data team, from copilots to autonomous agents. Where most teams are today, and what the next 12 to 18 months actually look like.

By Michał Dębski·10 min read·For CDOs and Heads of Data

The reason most AI transformation programs underperform is that they price themselves on Wave 1 numbers and ship Wave 1 outcomes. Developers get a Copilot subscription, the org claims AI adoption, productivity moves 5 to 15 percent, and the operating model stays exactly what it was. That is the floor of what is possible, and most teams stop there.

Wave 2 is the territory where agents do the work and humans approve. The components exist today: MCP-capable agents like Claude Code, Cursor, and Cline; enterprise ticket systems like Azure DevOps, Jira, and Linear; spec sources like SharePoint, Confluence, and Notion. Most data teams already own all of them. The blocker is process design and approval discipline, not only technology.

Wave 3 is real but further out. The technology is in place; governance maturity is not, and that is the actual work between Wave 2 and Wave 3. Per-repo service principals, audit trails for agent actions, cost controls on autonomous loops: build them, and Wave 3 follows. The realistic horizon is 18 months.

Wave 2 stalls on approval discipline, not only on missing tools.
The framework

Five waves describe how AI moves through a data team

Wave 1 is where most organizations sit today: individual copilots, no change to the team or the lifecycle. Wave 5 is research. The work between Wave 2 and Wave 3 is where the next 12 to 18 months of real productivity gains happen, and that is where this guide spends most of its pages.

Diagram 1 · The Five Waves curve
Wave
Productivity uplift
Team composition change
Lifecycle change
1 · Humans with Copilots
~10%
None
None. Same lifecycle, same handoffs.
2 · Humans with Agents
Up to 30%
None. Same roles, new tools.
Agents execute tasks; humans approve at two checkpoints per task.
3 · Agents with Humans
30 to 50%
Roles shift toward review and governance.
Agents own the loop; humans own the checkpoints.
4 · Agents
TBC
Smaller, more senior teams.
No human in production execution.
5 · Autopoietic Agents
TBC
TBC
Self-modifying agent ecosystems.
Honest framing Wave 4 and 5 are real research directions, but pricing your transformation against them today is fiction. This guide stays inside Wave 1 to Wave 3, which is the territory where both the numbers and the governance are real.
Wave 1 today

Where most teams actually are

Today, everything stays inside the client tenant. No AI in the workflow. The numbers below describe a high-complexity data engineering task, the kind that comes up several times a month in any serious data team.

Diagram 2 · Wave 1, as-is setup

End-to-end time per high-complexity task: 11 person-days.

Developer effort: 8 person-days (~73% of total). Five days to analyze, discuss, implement. Two and a half days of sandbox testing. Half a day for PR and peer review.

Joint-team validation: 3 person-days (~27% of total). UAT, deployment, manual data reload. Unchanged across all waves. Joint responsibility of project manager, key user, and the data and analytics team.

Validation stays roughly constant across all waves. The developer's hands-on time is the lever, and Wave 2 pulls it first.
Wave 2 in practice

Humans with Agents

The same lifecycle as Wave 1, but the agent reads the team's tickets, specs, and code, then proposes and executes work. The developer keeps two clear approval gates: one before any code is written, one before the PR is merged.

Diagram 3 · Wave 2 swimlane

Human in the loop on every decision

Developer triggers the agent, accepts the proposed solution, runs sandbox tests, and approves the PR. Two clear approval gates: before code is written and before merge.

Developer effort minus 75% per task

Sample high-complexity task: 0.5 to 2 person-days vs. 8 person-days baseline. End-to-end 4 to 5 person-days per task, dominated by 3 person-days of joint validation that does not change.

Agent works from the same artefacts

DevOps task and comments, linked epics, SharePoint mappings. The agent reads them all before proposing a solution. Context is built from existing sources of truth, no parallel knowledge base.

Two clear developer checkpoints

Developer accepts the proposed solution before any code is written, then approves the PR before merge. Between those gates, the agent does the work and the developer reviews artefacts.

What you need to be in Wave 2

  • MCP-capable agent (Claude Code, Cursor agent, Cline)
  • Read access for the agent to your ticket system (Azure DevOps, Jira, Linear)
  • Read access to specs (SharePoint, Confluence, Notion)
  • Sandbox environment per developer (existing CI/CD usually suffices)
  • Per-task human approval gate before any code is written
  • Per-PR human approval gate before merge
  • An honest baseline measurement: how many person-days does one high-complexity task take today?
With a global food and ingredients manufacturer, we cut developer effort per high-complexity task from 8 person-days to 0.5 to 2 person-days, moving the team from Wave 1 to Wave 2.
Looking ahead

Wave 3: Agents with Humans

Wave 3 changes who initiates the work. The agent polls Azure DevOps on a schedule, proposes a solution as a task comment, implements on "do it" approval, and opens a draft PR. Two human checkpoints remain: comment approval and PR review.

Diagram 5 · Wave 3 swimlane
Honest framingMost organizations are 12 to 18 months from Wave 3. The work between Wave 2 and Wave 3 is governance maturity: per-repo service principals, agent action audit trails, cost controls on autonomous loops. The technology is already in place. This work cannot be shortcut.
How to start

A 90-day path from Wave 1 to Wave 2

Most Wave 1 to Wave 2 attempts stall because the team tries to transform too much at once. The pattern that works is bounded: one workflow, one ticket system, one developer team. Measure the baseline before you change anything. The 90 days below assume that scope, and they assume you start on Monday.

Diagram 6 · 90-day path
Days 1 to 30

Pick one workflow. Measure the baseline.

Identify one repeating, high-volume, low-risk task type. Measure the actual person-days per task today across three people, three tasks each. This is the number Wave 2 has to beat.

Days 31 to 60

Wire one agent to one source of truth.

Set up MCP for your ticket system. Set up a per-developer sandbox. Codify the CLAUDE.md for that one workflow. Run with the agent on 5 to 10 tasks and measure new person-days per task.

Days 61 to 90

Measure, expand, codify.

Compare new effort to baseline. Calculate uplift honestly. Document what works and what does not. Add a second workflow. Repeat the setup pattern.

Three traps to watch for

Context discipline. Without a curated CLAUDE.md and knowledge base, the agent produces plausible-looking work that does not match your conventions. Treat context as the product.

Approval fatigue. Developers stop reading the agent's proposals carefully after the first week. Build in spot-check audits.

Governance ambiguity. "Who owns this?" is the question that stalls more Wave 2 rollouts than any technical issue. Decide before you start.

The full field guide

Get the 30-page field guide as a PDF

Everything on this page, plus the deeper material: Jira ticket agent vision, self-assessment grid, objection handling for Genie / Cortex / Copilot for Fabric, and a longer walk-through of the 90-day path with worked numbers.

The AI-Augmented Data Product Lifecycle

PDF · 30 pages · For CDOs and Heads of Data · Free

Download the field guide (PDF, 30 pages)

What is in the PDF that is not on this page

  • Jira ticket agent: discovery call to refined backlog in 30 min
  • Wave 2 swimlane: full 15-step worked example
  • Self-assessment grid: 4 dimensions × 5 waves
  • Genie / Cortex / Copilot for Fabric: when and where they fit
  • Wave 2 checklist with measured baselines
  • Three traps and how teams that ship Wave 2 avoid them
Honest answers

Straight answers to the questions your team will ask.

About the framework

What are the Five Waves of AI in data teams?

The Five Waves describe how AI moves through a data team. Wave 1 is Humans with Copilots (individual IDE assistants, around 10 percent productivity uplift). Wave 2 is Humans with Agents (AI agents execute tasks, humans approve at two checkpoints, up to 30 percent). Wave 3 is Agents with Humans (agents own the loop, humans own the checkpoints, 30 to 50 percent). Wave 4 (Agents) and Wave 5 (Autopoietic Agents) are research, TBC at production scale.

Which wave is most data teams in today?

Most data teams are in Wave 1. Developers use copilots in their IDE, the organization claims AI adoption, productivity moves 5 to 15 percent, and the operating model stays unchanged. This is the floor of what is possible with current technology, and most teams stop there.

What is the difference between Wave 1 and Wave 2?

Wave 1 has copilots inside the IDE only, with no change to the team or the lifecycle. Wave 2 has agents reading the team's tickets, specs, and code, then proposing and executing work while humans approve at two checkpoints per task. The lifecycle changes; team composition does not.

Is Wave 3 realistic in 2026?

Most organizations are 12 to 18 months from Wave 3. The technology is already in place. The work between Wave 2 and Wave 3 is governance maturity. This work cannot be shortcut.

How is this different from Databricks Genie or Snowflake Cortex?

Genie, Cortex, and Copilot for Fabric are natural-language interfaces sitting inside the data platform. They are complementary, not competitive. The agent pattern in the Five Waves framework sits across the full toolchain (tickets, specs, code, tests, PR), in the layer where dbt code, dbt tests, and merged PRs are produced.

About getting to Wave 2

How long does Wave 1 to Wave 2 take?

Eight to twelve weeks for one bounded workflow on existing infrastructure. The 90-day path: Days 1 to 30 pick one workflow and measure the baseline. Days 31 to 60 wire one agent to one source of truth. Days 61 to 90 measure, expand, and codify into a playbook.

Do I need new tools to reach Wave 2?

Most data teams already own everything they need. The components are: an MCP-capable agent (Claude Code, Cursor agent, Cline), an enterprise ticket system (Azure DevOps, Jira, Linear), and a spec source (SharePoint, Confluence, Notion). The blocker is process design and approval discipline, not tooling.

What governance changes are needed for Wave 2?

Two clear human approval gates per task: one before code is written (developer accepts the proposed solution) and one before merge (developer approves the PR). The agent has read-only access to the team's sources of truth. For Wave 3, governance maturity extends to per-repo service principals, agent action audit trails, and cost controls on autonomous loops.

If you want help getting there

Wave 1 to Wave 2 with your team in one day

The one-day workshop is the same framework, applied to your stack, with your team in the room. We measure your baseline, design one Wave 2 workflow live, and ship a working setup by 16:30.

See the workshop
MD

Michał Dębski

Co-Founder · Astral Forest

Michał works with CDOs and Heads of Data who want a peer in the room, someone who builds, not only presents. Astral Forest is a boutique data consultancy. We architect, we build, and we hand the keys over.