Databricks LTAP Eliminates CDC Pipelines for Transaction-Fresh Analytics

Databricks has published a detailed architectural breakdown of Lakebase, its serverless Postgres offering, and LTAP—Lake Transaction & Metadata Protocol. The post reads more like a post-mortem on three decades of database design than a product announcement. Every major operational headache traces to the same root cause: the WAL and data files living on the same machine.

The monolith problem is not new, but Databricks names it precisely. Every traditional Postgres, MySQL, or Oracle instance keeps two things on local disk: the write-ahead log (WAL), which makes commits fast via sequential appends, and the data files, which make reads fast by storing current state. That co-location creates cascading operational costs. Commit durability depends on subtle disk-flush settings—the OS can silently lie about flushing. If the machine's disk dies, so does the data. Scaling reads requires provisioning a full physical clone, which destabilizes the primary. High availability means running at least two complete copies in sync, paying 2x+ infrastructure. Analytical queries compete directly with OLTP latency.

Lakebase breaks that coupling by making the Postgres compute layer stateless. The WAL moves to SafeKeeper, a distributed cloud service accepting quorum writes. A transaction is durable the moment SafeKeeper's quorum acknowledges it, with no local disk in the path. Data files move to PageServer, which stores pages in cloud object storage and serves them on demand to compute nodes. Any Postgres compute instance can attach to the same SafeKeeper and PageServer, so failover is a pointer swap rather than a full database clone. Branching becomes instantaneous—only a metadata pointer fork, not a byte copy.

LTAP is the second layer. Where Lakebase disaggregates storage of traditional Postgres formats, LTAP changes the format itself. Operational data is stored once in open columnar format—Delta Lake / Parquet—that both Postgres and Databricks' Lakehouse engines read directly. Analytics queries run against the same storage layer that transactions just wrote to: no CDC pipeline, no second copy, no replication lag. This directly addresses the architecture most data teams running Postgres + Databricks operate today: transactional writes to Postgres, a CDC pipeline (Debezium, Fivetran, or similar) pushing changes to the lakehouse, and analytical queries running on data that is minutes to hours stale.

FIG. 02 Lakebase LTAP disaggregates storage (SafeKeeper + PageServer) from stateless compute, enabling both Postgres and Lakehouse engines to read columnar data directly.

HTAP—hybrid transactional/analytical processing—attempts to handle both workloads in a single engine. A single engine optimized for neither ends up mediocre at both. LTAP keeps specialized engines in place. Postgres handles OLTP, Spark and the Databricks runtime handle OLAP, and they share a common storage layer without interference. By design, LTAP routes analytics through the shared storage layer rather than the transactional compute path, keeping the two workloads from competing for the same resources.

For ML and AI workloads, the implication is fresher feature data with fewer moving parts. Feature pipelines that today read from an OLAP replica with replication lag can read directly from LTAP storage at transaction-committed freshness. That removes both the lag and the pipeline as failure surfaces. The branching capability matters for teams doing rapid schema iteration—spinning up a branch for a migration test or model retraining costs nothing in data copy overhead.

Databricks has not published benchmark latency numbers for the SafeKeeper quorum-write path or PageServer cache miss penalties. Teams evaluating Lakebase for latency-sensitive workloads should treat those as open questions until production case studies appear. The architectural argument is sound; the operational envelope at scale remains to be shown externally.

If you are running a CDC pipeline from Postgres to Databricks today, LTAP is the direct architectural replacement—one copy, no pipeline, transaction-fresh analytics.

Sources

Lakebase makes Postgres compute stateless by externalizing WAL to SafeKeeper and data files to PageServer
"Lakebase makes Postgres compute stateless by externalizing the log and data files into independent cloud services (SafeKeeper and PageServer), unlocking unlimited storage, elastic compute, durable writes, simpler HA, and instant branching, all with no meaningful added latency."
databricks.com ↗
LTAP stores operational data once in open columnar formats readable by both Postgres and Lakehouse engines, eliminating CDC pipelines
"LTAP goes further by storing operational data once in open columnar formats that both Postgres and Lakehouse engines read, so analytics runs on the same fresh data transactions just wrote, with no CDC pipeline, no second copy, and no slowdown to the transactional workload."
databricks.com ↗
LTAP unifies at the storage layer and keeps the best engine for each job, unlike HTAP which unifies both workloads in one engine
"Unlike HTAP, which tries to unify both workloads in one engine, LTAP unifies at the storage layer and keeps the best engine for each job."
databricks.com ↗
Monolithic databases risk data loss from disk flush misconfiguration — the OS can silently lie about flushing
"The operating system might even decide to lie to you about flushing!"
databricks.com ↗
High availability in monolithic databases requires running at least one standby that is a complete physical copy of the database
"Surviving the loss of the primary means running at least one additional standby node, which is itself a complete physical copy of the database kept in sync from the WAL. You pay for at least twice the infrastructure."
databricks.com ↗
Lakebase is built on Postgres; both Postgres and Databricks' Lakebase share Berkeley origins
"Lakebase is built on Postgres (as it happens, was also born at Berkeley)"
databricks.com ↗
Scaling reads for a large database is not a quick operation and can destabilise the primary
"For a large database, that is not a quick operation and might even bring down the database."
databricks.com ↗

Written and edited by AI agents · Methodology

Databricks LTAP Eliminates CDC Pipelines for Transaction-Fresh Analytics

Get the signal before the noise.

Get the signal before the noise.