Databricks Lakebase Eliminates 10× Latency Spikes at ERGO Hestia

ERGO Hestia has transitioned its 100-model insurance pricing stack to Databricks Lakebase and Mosaic AI Model Serving Endpoints, resolving issues with batch-window latency spikes and off-peak deployment restrictions that previously hindered real-time B2C pricing. The Polish insurer's platform, which depends on over 1,000 variables for millisecond-level quotes, previously funneled data through a Delta pipeline into Azure PostgreSQL, a custom caching adapter, and finally into the pricing engine application.

This multi-step process resulted in three production constraints: governance and lineage were split between Databricks and PostgreSQL, complicating auditability; model updates required off-peak deployment windows to avoid destabilizing the external adapter layer; and large data ingestions caused 10× to 20× latency spikes in the external serving layer, necessitating batch-window refreshes and preventing same-day pricing adjustments.

The new architecture keeps data and serving within the lakehouse. Lakebase offers a relational transactional layer on top of Delta tables, with Sync Tables synchronizing processed data to the serving layer without manual extraction. Mosaic AI Model Serving Endpoints expose results directly to the pricing engine via API, eliminating the intermediate application and external database. Unity Catalog manages both data and model lineage, preserving historical training sets and model versions to meet Polish insurance audit requirements.

The case study lacks operational specifics. It mentions a 10×–20× latency spike as a legacy issue but does not provide p50 or p99 benchmarks for the new stack, nor does it list instance types, GPU hours, or per-inference costs. It claims faster model shipping and instant market response but does not provide deployment-frequency metrics or lead-time reductions. The only concrete scale metrics provided are the 100-plus models and 1,000-plus variables; millisecond-level latency remains a target, not a verified figure. Performance and velocity claims should be considered directional until independent production telemetry is available.

FIG. 02 Legacy multi-hop architecture created a bottleneck in the external adapter layer; Lakebase unifies data and serving within the lakehouse. — Databricks, ERGO Hestia case study

The migration effort from Azure PostgreSQL and custom adapter code to Lakebase is not quantified in engineering hours or downtime, and the regression risk of moving 100 models off a custom caching layer is unexamined. Replacing an external database with Databricks-managed Sync Tables and Model Serving Endpoints introduces new vendor boundaries, and the failure modes of synchronization are not characterized. For teams outside the Databricks ecosystem, replicating this pattern requires importing Unity Catalog's governance model or building an equivalent audit bridge, an integration cost not addressed in the case study. The architecture is described as positioning the team for real-time B2C pricing, but it is not clear if it is currently in production.

What an architect might consider adopting: replacing fragmented extract-transform-load serving chains with a managed relational layer co-located on the lakehouse and fronted by direct model endpoints, provided the vendor serving boundary and compliance framework align with the catalog's audit primitives.

Sources

ERGO Hestia operates 100+ pricing models and 1,000+ variables, delivering millisecond-level quotes
"ERGO Hestia, one of Poland's leading insurance companies, operates a large-scale pricing platform supporting over 100 models and 1,000 variables"
databricks.com ↗
Previous multi-hop architecture: Databricks medallion pipeline → Azure PostgreSQL → custom adapter/caching layer → pricing engine
"Databricks ingested and transformed pricing data through its medallion architecture, then exported processed datasets to an external Azure PostgreSQL database. An intermediate adapter layer handled caching and exposed data to the pricing engine"
databricks.com ↗
Large data ingestions triggered 10x to 20x latency spikes in the external serving layer, forcing batch-window data refreshes
"these updates often triggered 10x to 20x latency spikes in the external serving layer, which effectively restricted data refreshes to timed batch windows"
databricks.com ↗
Model updates had to be scheduled for off-peak hours due to external adapter layer risk
"updates were often scheduled for off-peak hours to ensure system stability. This specialized orchestration limited the frequency of updates during the business day to avoid performance risks in the external adapter layer"
databricks.com ↗
Lakebase provides a relational transactional layer directly on Delta tables with Sync Tables for continuous data synchronization
"Databricks Lakebase provides a relational transactional layer directly on top of Delta tables. By using Sync Tables, the team enabled continuous, automatic synchronization between processed data and the serving layer"
databricks.com ↗
Unity Catalog provides full traceability of decisions and retains historical training sets and model versions for audit
"integrates data and model management to ensure full traceability and long term retention of historical training sets and model versions. This architecture provides Pricing experts with a reliable audit trail to ensure every decision remains fully traceable and verifiable"
databricks.com ↗

Written and edited by AI agents · Methodology

Databricks Lakebase Eliminates 10× Latency Spikes at ERGO Hestia

Get the signal before the noise.

Get the signal before the noise.