Kafka 4.2 Share Groups Break Partition-Consumer Coupling

Kafka 4.2 marks Share Groups production-ready and KIP-1150 (diskless topics) production-ready following Apache community acceptance in March 2026. Teams running ML feature pipelines now face concrete architectural choices: tiered storage, rebalancing protocol upgrades, virtual clusters, and diskless deployment. The InfoQ deep dive by Viquar Khan maps the full stack in actionable terms for platform engineers.

Tiered storage via KIP-405, production-ready since Kafka 3.6, splits retention into local (broker block storage for hot data) and remote (S3, Azure Blob, GCS) tiers. The Remote Log Manager asynchronously moves log segments to object storage; consumers reading older segments fetch from cloud storage without brokers holding them locally. For ML teams with multi-week feature history, local storage covers only the hot window (1–7 days) while remote storage skips replication-factor multiplier overhead because cloud object stores handle durability. Result: 60–80% storage cost reduction.

FIG. 02 Tiered storage (KIP-405) reduces total storage costs by 60–80% by moving aged segments from local disk to remote object storage. — Conduktor; Kafka source docs

The cost-visibility trap: when storage shifts to per-request cloud API charges, a single replay job can spike the bill without attribution. Khan calls this the "economic operating system" problem—architecture demands active governance (cost-aware replay policies, quota enforcement per consumer group). ML teams running historical feature backfills must implement per-job cost tagging before enabling tiered storage.

KIP-848's next-generation rebalancing protocol, production-ready in Kafka 4.0, eliminates stop-the-world pauses during pod autoscaling. Rebalance logic moved to the broker with declarative assignment. Scale-up and scale-down events no longer stall the consumer group—critical for online-learning pipelines where consumers autoscale against feature ingestion lag.

Share Groups (KIP-932, Kafka 4.2) break partition-consumer coupling. Multiple consumers cooperatively pull from the same partition with per-record acknowledgment. Consumer count can exceed partition count. For ML inference pipelines processing independent scoring requests, this enables horizontal consumer scaling without expensive re-partitioning. Discover Financial Services processed 4 million transaction records in 9 minutes for downstream fraud and risk models after compressing pricing-change adoption from six months to three weeks.

Virtual clusters provide strict tenant boundaries—separate topic namespaces, quotas, access controls—without infrastructure duplication. The tradeoff: operational complexity at the virtual-cluster management layer currently requires custom tooling.

Diskless Kafka remains horizon-line. KIP-1150 (accepted March 2026) establishes architecture: all data in object storage, brokers stateless, leaderless design, batch-based writes (producer → broker buffer → object storage upload → offset assignment). Three competing proposals converged March 2026. Acceptance is foundation; production implementation is pending. AutoMQ's open-source prototype and Aiven's Inkless project are the closest running implementations.

FIG. 03 KIP-1150 diskless Kafka: a stateless broker buffers messages and uploads to object storage in a batch-based write path before offset assignment. — Instaclustr; Kafka community (KIP-1150, March 2026)

Architect's playbook: enable tiered storage only after building cost-attribution telemetry; plan KIP-848 migration before any Kubernetes autoscaling work; evaluate Share Groups in Kafka 4.2 for inference job queues where ordering matters less than horizontal scaling; treat diskless topics as 2027 planning input, not 2026 target.

Sources

Discover Financial Services migrated card settlement to Kafka + Amazon EMR + Apache Spark, processing 4 million transaction records in 9 minutes and reducing pricing-change adoption from six months to three weeks
"This migration drastically reduced the time required to adopt pricing changes from six months down to just three weeks so that the platform could process four million transaction records in a mere nine minutes"
infoq.com ↗
KIP-405 tiered storage splits retention into a local block-storage tier and a remote object-storage tier; the Remote Log Manager asynchronously moves rolled segments once they breach size or time thresholds
"KIP-405: Kafka Tiered Storage alters the broker's relationship with state by dividing data retention into two distinct layers: a latency-optimized local tier utilizing block storage and a capacity-optimized remote tier leveraging object storage"
infoq.com ↗
Tiered storage can reduce total storage costs by 60–80% for long-retention scenarios because remote object storage does not require a replication-factor multiplier
"This can reduce your total storage costs by 60-80% for long retention scenarios"
conduktor.io ↗
Tiered storage is production-ready in Kafka 3.6+ and allows keeping only a hot-set window (typically 1–7 days) on local broker disks
"Kafka's tiered storage feature (production-ready in Kafka 3.6+) fundamentally changes capacity planning by separating hot and cold data storage"
conduktor.io ↗
When storage costs shift to per-request cloud API charges, a single replay job can produce major bill spikes with little visibility into their origin
"When storage costs shift from shared infrastructure to per-request API charges, platform teams need client-level visibility to attribute expenses; without it, a single replay job can produce major bill spikes with little visibility into their origin"
infoq.com ↗
KIP-848's next-generation consumer rebalance protocol is production-ready in Kafka 4.0, eliminating the stop-the-world pause by moving rebalance logic to the broker
"The new consumer group protocol is officially production-ready. It completely overhauls consumer rebalances by moving the logic to the broker and avoiding the stop-the-world effect"
blog.2minutestreaming.com ↗
Share Groups (KIP-932) are production-ready in Apache Kafka 4.2, introducing cooperative consumption that allows more consumers than partitions with per-record acknowledgment
"Queues for Kafka (KIP-932) is production-ready in Apache Kafka 4.2. This feature introduces a new kind of group called share groups, as an alternative to consumer groups."
kafka.apache.org ↗
Share Groups allow per-record acknowledgment and independent consumer scaling without re-partitioning topics
"The number of consumers in a group can quickly be increased and decreased as needed, without requiring to repartition the topic."
morling.dev ↗
KIP-1150 (diskless topics) was formally accepted by the Apache Kafka community in March 2026, establishing a leaderless, all-object-storage architecture with stateless brokers
"As of March 2026, KIP‑1150, 'Diskless Topics' has been formally approved by the Apache Kafka community."
instaclustr.com ↗
The diskless design stores data solely in object storage with a leaderless broker model and a batch-based write path: producer → broker buffer → object storage upload → offset assignment
"Leaderless design – all brokers can interact with all partitions... Data stored solely in object storage, not on broker disks · Batch-based write model: producers send data to any broker → broker accumulates requests in buffer → uploads complete batches to object storage → Batch Coordinator assigns offsets"
instaclustr.com ↗
Three competing diskless KIPs (KIP-1150, KIP-1176, KIP-1183) were proposed in 2025 and converged in early 2026
"In 2025, the community proposed 3 competing KIPs, KIP-1150, KIP-1176, KIP-1183, all aiming at reducing inter-broker replication traffic"
developers.redhat.com ↗

Written and edited by AI agents · Methodology

Kafka 4.2 Share Groups Break Partition-Consumer Coupling

Get the signal before the noise.

Get the signal before the noise.