StickyBlog

Can You Get from AI Demos to Systems You Can Actually Run?

Forum|Forum|1 month ago
April 8, 2026
0 replies
32 views

Pat Kelly
Rocketeer

Why are Data Integration Practitioners Feeling the Ground Shift?

If your job includes being on call, you’ve already learned an important rule:
Anything that only works in the demo will eventually page you.

Usually at 2:47am…right after a “minor” upstream change…with the extremely helpful alert message:
“job running”

AI hasn’t changed that rule. It just made the blast radius bigger.

Most AI failures don’t explode loudly. They quietly deliver outputs built on late, partial, or misunderstood data—and do it with complete confidence. When someone finally asks “can we trust this?”, the answer usually traces back to the same place it always has: The data pipelines everyone assumed were fine.

Because you’re responsible for CDC-powered data integration and, unless you operate (or manage) Rocket® Data Replicate and Sync (RDRS) specifically—this should feel uncomfortably familiar. AI isn’t creating new problems. It’s dragging existing ones into the spotlight.

This post isn’t a checklist or a tutorial. It’s a framing conversation you’re likely already having as AI starts depending on the same systems you’ve been keeping upright at night for years.

CDC Does What It’s Designed to Do — and That’s the Point

Change Data Capture isn’t the weak link in most failures. In many cases, CDC is doing exactly what it was configured to do. The issues practitioners keep running into live around CDC, not inside it:

Schema changes that replicate cleanly but alter meaning
Fields that get repurposed without downstream awareness
Mapping assumptions that stay static while sources evolve
Context that exists at capture time but is lost as data fans out

Well-constructed CDC will faithfully deliver those changes. The question data integration owners have to answer is whether the right context, controls, and expectations traveled with the change.

That responsibility doesn’t end at the CDC tool boundary—it never has.

Context Isn’t Academic — It’s What Decides Whether You Get Paged

As practitioners, you don’t debate semantics. You debug outcomes. Context is the stuff you only notice when it’s missing, like these examples:

Two systems disagree and nobody knows which one wins.
A column changes type and nothing crashes…but everything lies.
A structurally valid pipeline produces data that’s no longer trustworthy.

Those problems existed before AI, but when data is reused for analytics, automation, or AI, that missing context doesn’t raise alarms—it just quietly corrupts outcomes.

In RDRS environments, that context shows up in very concrete, technical forms:

Imported metadata and copybooks.
Keys, field mappings, CCSIDs, and structure.
Captured DDL events (where supported) that record what actually changed.

That technical context is a prerequisite for data to survive change without silently misleading downstream consumers—including analytics and AI systems that don’t pause to ask questions.

And yet, most pager incidents don’t start inside CDC. They start where replicated data meets assumptions: business logic layered on top, downstream models that encode yesterday’s meaning, or consumers that never expected “valid but different.”

RDRS preserves reality. The hard part is making sure reality stays understood as data moves further away from the source.

What is the Demo to Production Wall?

It’s where your sleep goes to die. Demos won’t page you at 3am. Production will. Demos don’t deal with:

Production restart decisions
Partial failures and lag
Schema drift over time
Downstream consumers that assume stability

Production does, and the failure modes are familiar:

“Trust” lives in Slack threads.
Every pipeline is “slightly unique”.
Governance is a post incident slide.
Schema changes turn into archaeology.

If you’ve ever chased an issue where nothing is broken but everything is off, you already know why AI makes this harder—not easier:

AI doesn’t hesitate when data is questionable.
It doesn’t slow down when signals are incomplete.

It Just. Keeps. Going.

What does “AI-Ready” Mean to People Carrying the Pager?

It doesn’t mean new tools or smarter models. It means you can explain what happened without guessing. What’s showing up now isn’t new theory—it’s pressure intensifying on systems you already own:

Trust stops being a vibe: In production, trust is what lets you sleep through the night.

For teams running RDRS, that usually comes from:

CDC restart points you actually believe
Repository state you can interrogate
The ability to answer “how far behind are we?” without squinting

Those same signals tend to be the first things AI and analytics teams ask for—usually right after their first incident review.

Pipelines are feeding AI whether you planned it or not: AI systems don’t politely wait for curated tables. They consume whatever is connected. If you’re already:

Streaming CDC into Kafka or Event Hub
Writing JSON or AVRO with explicit schemas
Fanning replicated data out to multiple targets

…then you’re already running part of the upstream data supply change that AI systems consume. The deciding factor isn’t speed—it’s whether those feeds are labeled, observable, and resilient when things change.

Governance shows up at runtime, not in a Wiki: Nobody reads governance docs at 3am.

What matters in that moment:

Who is this agent?
Is this connection secure right now?
What changed since yesterday?

RDRS contributes the mechanics—agent identity, secured connections (TLS / AT‑TLS where configured), explicit source‑to‑target paths. Practitioners still own the surrounding controls, expectations, and enforcement downstream.

That boundary isn’t a flaw. It’s how production systems stay survivable.

Change Is Inevitable. Chaos Isn’t: Every practitioner knows the law: Change will happen. The only question is whether it wakes you up. RDRS environments are already designed for:

Evolving structures
DDL capture and replication where supported
Preserving change order and restartability at the mechanics level

That doesn’t eliminate change pain—but it creates a foundation that can enable pipelines to bend instead of snap when upstream teams ship on Friday at 5pm.

If you can’t see it, AI will break it faster: AI doesn’t add observability. It weaponizes
the lack of it. If your current state is:

“Job running” dashboards
Lag discovered by customers
Errors noticed in hindsight

AI won’t fix that. It will amplify it. The same signals practitioners already rely on—lag, errors, throughput, restartability—become non‑negotiable when AI depends on those feeds.

Architecture Debates Don’t Solve the Problem

Mesh vs. fabric won’t help you at 2am. What helps in that moment is being able to answer:

Can we trace this?
Can we stop it safely?
Can we restart it without guessing?
Can we say what changed since yesterday?

Practitioners already know this. The work hasn’t changed—who depends on it has. AI is now downstream of the systems you keep stable. That makes this work more visible—and more valuable.

What Does This Mean for Data Integration Teams Running RDRS?

If you manage or operate Rocket® Data Replicate and Sync, none of this should feel like a revelation. You already:

Run agent‑based systems in production
Depend on CDC that must restart cleanly
Live with schema and structural change
Trust runtime signals over optimistic dashboards
Know that observability beats hope

The shift isn’t learning something new. It’s realizing that AI is now downstream of everything you keep stable. That makes your work more visible—and more valuable.

Your Turn

No action items. No Jira tickets. Just shop talk.

What wakes you up today: lag, silent failures, schema changes, or sheer manual ops?
Which capability helps you sleep better: restartability, metadata, monitoring, or security?
What’s the next thing leadership wants to plug into your pipelines—analytics, streaming, AI—and what worries you about it?

Say the quiet part out loud. You won’t be alone. Your forum squad will appreciate it.

More coming.

As I mentioned, this post is just the framing conversation. What follows in the next several weeks will dig into the specific shifts data integration managers and practitioners are making as AI becomes another downstream consumer of CDC—one shift at a time, rooted in real production realities.

Why are Data Integration Practitioners Feeling the Ground Shift?

CDC Does What It’s Designed to Do — and That’s the Point

Context Isn’t Academic — It’s What Decides Whether You Get Paged

What is the Demo to Production Wall?

What does “AI-Ready” Mean to People Carrying the Pager?

Architecture Debates Don’t Solve the Problem

What Does This Mean for Data Integration Teams Running RDRS?

Your Turn

More coming.

Sign up

Please log in or register:

Welcome to the Rocket Forum!

Please log in or register:

Scanning file for viruses.

This file cannot be downloaded