From prompt to PR: sandbox-validated dbt changes before human review
A five-person data warehouse team. 100+ concurrent requests. 5,000+ active ETLs. We tried the shortcut: let Claude generate SQL, have engineers review it. It failed: code generation without validation just shifts the burden. Engineers still had to mentally execute every model.
The real answer was full-stack sandbox validation. From prompt to PR, every AI-generated model clears four layers before a human sees it: SQL compilation, dbt tests (uniqueness, range, relationships), orchestration via auto-generated Airflow DAG, and lineage integrity in OpenMetadata. All four must pass. Failures loop back to the agent, not the engineer.
Guardrails are structural: read-only prod service account, macro-enforced dev schema isolation, ephemeral Kyuubi compute per run. In production, 45% of new models carry the "ai-generated" tag.
You'll leave with the full-stack sandbox architecture, the four-layer validation blueprint, structural guardrails, and a model for governed AI promotion via explicit PR reviews.
Check out more sessions
- View sessionTraining add-on
Becoming a dbt Architect
- Breakout session
Beyond standard metrics: extending dbt semantic models for high-accuracy Text-to-SQL
Junya Morita / Recruit Co.,ltd.View session - Peer exchange
Beyond the bottleneck: position your Analytics Engineering team as a strategic force
Kasey Mazza / HubSpotView session
