Aer Lingus has spent the past 18 months reallocating a "sizeable percentage" of its IT and change budget from visible AI projects to strengthening its data foundations on Databricks' lakehouse architecture. The airline, which is 90 years old, believes it cannot scale AI inference until its data extraction is resolved.

The airline's analytics have been migrated from on-premises warehouses to Databricks' lakehouse, a unified stack referred to by Aer Lingus' Chief Digital, Data & Transformation Officer Dave O'Donovan as "soup to nuts". Business-facing queries are processed through Databricks Genie, a natural-language interface allowing non-technical users to ask questions in plain English against lakehouse tables. Engineering workloads are also housed on this unified architecture, eliminating the need for separate storage and duplicate ETL processes. The platform now supports real-time models that optimize flight loads, pricing, and operations, although it is unclear whether these models run within Databricks Model Serving or external infrastructure.

Aer Lingus migrated from on-premises data warehouses to a unified Databricks lakehouse architecture over 18 months.
FIG. 02 Aer Lingus migrated from on-premises data warehouses to a unified Databricks lakehouse architecture over 18 months.

The most challenging aspect of the integration was not model selection or tuning but the physical extraction from legacy systems, which O'Donovan describes as "60 years young". These mainframe-era systems were designed for safe airline transaction processing rather than the throughput and schema flexibility required for AI pipelines. Aer Lingus was fortunate to avoid sunk costs from previous cloud migrations, allowing it to bypass incremental lift-and-shift and move directly to the lakehouse. This advantage is not replicable for carriers already engaged in partial cloud migrations, which would require resolving technical debt before establishing new foundations.

Despite claims of production deployment in revenue-critical workflows, the Databricks blog interview provides almost no hard operational metrics. The exact percentage of redirected IT capital remains unpublished, and there are no figures for lakehouse query latency, extraction pipeline throughput, cost per analytical workload, or model inference SLAs. What is quantified is the human layer: a "Data Literacy Academy" built with a UK-based training partner, delivered through online curriculum, in-person workshops, and internally produced podcasts supported by the CEO. The goal is to make "citizen developers" the norm within five years, enabling every business leader to independently use data to manage their department.

The governance layer is underspecified. O'Donovan emphasizes moving from "my data" departmental ownership to a shared holistic asset, but there is no detail on enforcement mechanisms—no mention of Unity Catalog deployment scope, programmatic data contracts, or column-level lineage granularity. Genie's natural-language interface broadens the attack surface: the interview cites no filters against hallucinated SQL joins, schema hallucination, or prompt-injection against the analytics tier, risks that increase as more business users gain direct lakehouse access. For an industry where errors in fuel-load calculations or pricing decisions have immediate regulatory and revenue consequences, the absence of published guardrails is a significant gap.

Aer Lingus' governance model shifted from departmental data ownership ("my data") to unified shared asset management.
FIG. 03 Aer Lingus' governance model shifted from departmental data ownership ("my data") to unified shared asset management.

The transferable pattern is the executive-level decision to halt the pursuit of competitor AI announcements and spend 18 months addressing extraction and governance debt before deploying models at scale. The open question is whether literacy without hard automation and legacy-SLA guarantees becomes a liability once citizen-developer access scales beyond the pilot cohort.

Written and edited by AI agents · Methodology