Real-Time Billing: Stream Processing, Event Sourcing, CQRS

Real-time billing, the ability to meter, rate, and reflect usage charges within seconds of a billable event occurring, has become a competitive requirement for cloud infrastructure, AI APIs, and any SaaS product with consumption-based pricing — including real-time event processing for telecom billing. According to OpenView Partners' 2024 SaaS Benchmarks Report, companies with real-time billing visibility report 18% lower involuntary churn compared to companies that process usage only at end-of-month [1]. This guide covers the three architectural pillars of real-time billing systems: stream processing for high-throughput event handling, event sourcing for immutable billing state, and CQRS for separating write-optimized and read-optimized billing data paths.

What Is Stream Processing in Billing Systems?

Stream processing in billing systems means applying billing logic — metering, aggregation, rating — to usage events as they arrive, rather than accumulating events and processing them in batch jobs. A stream processing architecture ingests events from a message broker (Kafka, Kinesis, Pub/Sub), applies stateful transformations (summing API calls per customer per billing period), and writes results to a billing state store with sub-second latency. This enables real-time usage dashboards, live spend alerts, and hard-limit enforcement that blocks requests the moment a customer exceeds their credit balance. Batch billing systems cannot provide these capabilities because their billing state is hours or days out of date [2].

Apache Kafka is the dominant message broker for real-time billing workloads at scale. Kafka's log-based storage model retains every event for a configurable retention period, enabling consumers to replay events from any point in time — critical for billing systems that need to re-rate events when pricing rules change or correct billing errors discovered after invoice generation. Kafka's consumer group model allows multiple billing consumers to read from the same event stream independently: one consumer updates real-time usage meters, another feeds an analytics data warehouse, and a third triggers alert thresholds. Each consumer maintains its own offset and processes events at its own pace without interfering with others.

How Does Event Sourcing Apply to Billing Systems?

Event sourcing in billing systems stores every state change as an immutable, append-only event rather than updating a current-state record in place. Instead of a subscriptions table with a current_status column that gets updated to "canceled," an event sourcing system appends a SubscriptionCanceled event to an event log. The current state of any billing entity — customer balance, subscription status, invoice total — is derived by replaying the event log from the beginning. This model provides a complete, tamper-proof audit trail by design: every billing change is recorded with a timestamp, the actor who triggered it, and the full before/after state [3].

Event sourcing solves a fundamental problem in billing audits. When a customer disputes an invoice, a traditionally structured billing database can show what the invoice total is but may struggle to explain exactly why it arrived at that number — especially if pricing rules, discounts, or credits were applied at different times. An event-sourced billing system can replay the exact sequence of events that led to each line item, making it straightforward to explain any charge and demonstrate that the billing logic was applied correctly. This capability is increasingly required by enterprise customers conducting vendor audits and by financial regulators reviewing billing practices. For more on what audit-grade billing accuracy requires, see the guide on audit-grade billing accuracy, requirements, and testing.

Snapshots are a performance optimization for event-sourced billing systems. Replaying the full event log from the beginning to derive current state is prohibitively slow when a long-lived customer has years of billing events. Periodic snapshots capture the derived state at a point in time, allowing the system to start from the most recent snapshot and replay only the events since then. For billing systems, snapshots are typically generated at invoice boundaries — once a billing period closes and the invoice is finalized, the state at that point is snapshotted and earlier events can be archived to cold storage.

What Is CQRS and Why Does It Matter for Billing?

CQRS (Command Query Responsibility Segregation) separates the write path (commands that change billing state) from the read path (queries that retrieve billing data), allowing each to be optimized independently. In billing, write operations — ingesting events, generating invoices, processing payments — require strong consistency and ACID guarantees. Read operations — customer billing portals, revenue dashboards, invoice history — require high throughput and low latency but can tolerate eventual consistency. CQRS allows the write side to use a normalized, write-optimized database schema, while the read side uses denormalized, query-optimized projections tailored to specific UI views [4].

A CQRS billing architecture maintains separate read models for different consumers. The customer billing portal read model contains pre-computed invoice totals, payment history, and current balance — all the data a customer needs to understand their account, assembled in a format that serves the UI directly without complex joins. The finance team's revenue dashboard read model contains MRR trends, cohort metrics, and payment success rates. The support team's lookup model contains customer contact information alongside billing status. Each read model is updated asynchronously from the event stream, so changes to billing state propagate to all read models within seconds.

Stream Processing Frameworks for Billing

Apache Flink and Apache Kafka Streams are the two dominant stream processing frameworks for billing-scale workloads. Flink provides stateful stream processing with exactly-once semantics, allowing billing aggregations to survive worker failures without double-counting events or losing counts. Flink's windowed aggregations enable per-customer, per-billing-period usage summaries that update continuously as events arrive. Kafka Streams is simpler to operate (it's a library rather than a separate cluster) and suitable for billing workloads that don't require the full power of distributed stateful processing [5].

Exactly-once event processing is a strict requirement for billing-critical stream operations. At-least-once processing — where events may be processed multiple times during failure recovery — is acceptable for analytics but not for billing, where double-counting an API call would result in overcharging a customer. Exactly-once semantics in Kafka require idempotent producers, transactional consumers, and a database that supports idempotent writes keyed on event ID. The overhead of exactly-once processing (roughly 10–20% latency increase compared to at-least-once) is the appropriate trade-off for billing workloads where correctness is non-negotiable.

Real-Time Usage Metering Architecture

Real-time usage metering is the front end of a real-time billing system: it receives raw usage events from the application, normalizes them into a canonical billing event format, deduplicates based on event ID, and routes them to the appropriate billing meter. Each meter maintains an aggregated count or sum for a specific billing dimension — API calls per customer, compute seconds per organization, messages per subscription. Meters are updated synchronously on event ingestion so that balance checks and limit enforcement can query current meter values in real time.

Deduplication in real-time billing metering requires a deduplication window — typically 24–48 hours — within which duplicate event IDs are rejected. Events with the same ID submitted more than 48 hours apart are treated as new events. This window represents a trade-off between deduplication coverage and the storage cost of maintaining the deduplication index. For billing systems where clients may retry failed event submissions with the same ID, the deduplication window must be at least as long as the client's maximum retry window. Redis sorted sets provide an efficient data structure for time-windowed deduplication: events are stored with their timestamp as the score, and a periodic job removes entries older than the deduplication window.

Event Ordering and Late Arrival Handling

Out-of-order event arrival is a practical challenge in distributed billing systems. A usage event generated at 23:59:59 on the last day of a billing period may not arrive at the billing system until 00:01:00 of the next period due to network delays. If the invoice was already generated, the late event must be handled — either by amending the closed invoice, recording it as a credit on the next invoice, or holding invoice generation briefly to allow a late arrival window. Each approach has trade-offs: amending closed invoices affects revenue recognition accuracy; the late arrival window delays invoice delivery [6].

Watermarking is the standard technique for managing late arrivals in stream processing. A watermark is a timestamp threshold below which the system assumes all events have arrived — events with timestamps earlier than the watermark are considered late. Billing stream processors advance the watermark progressively as events arrive, closing billing windows only when the watermark passes the window boundary. A late arrival tolerance of 5–10 minutes is appropriate for most billing systems, accommodating network delays without significantly delaying invoice generation.

Integrating Real-Time Billing with Limit Enforcement

Real-time billing state enables limit enforcement that prevents customers from exceeding their allocated credits or subscription plan limits. When a customer sends a usage event, the billing system checks the current meter value against the customer's plan limit before acknowledging the event. If the meter is at or above the limit, the system returns an error that the calling application interprets as a quota exceeded response. This hard-limit enforcement requires the billing state to be current — a batch billing system updated hourly cannot enforce per-minute rate limits without a separate, faster-updating quota system.

Open-source billing platforms like Lago support real-time event ingestion and prepaid credit wallets with automatic top-up — when a customer's balance drops below a configured threshold, the wallet automatically charges the payment method and restores the balance, providing uninterrupted service without manual intervention. This capability requires tight integration between the real-time billing state, the payment processor, and the limit enforcement layer. For teams designing event ingestion pipelines that feed real-time billing systems, the guide on event ingestion architecture and metering pipelines provides detailed design guidance on building the event layer that makes real-time billing possible.

Operational Considerations for Real-Time Billing Systems

Real-time billing systems have stricter operational requirements than batch billing systems because billing state must be continuously available — a core challenge of scaling event-driven billing systems. Kafka cluster availability directly affects billing availability — if the event broker is unavailable, usage events cannot be ingested and billing meters become stale. High availability configuration (minimum 3 broker replicas, replication factor of 3, minimum in-sync replicas of 2) ensures that the loss of any single broker does not interrupt event ingestion. Monitoring for consumer lag — the difference between the latest event in the topic and the latest event processed by the billing consumer — provides an early warning of processing delays before they affect real-time billing accuracy.

Schema evolution requires careful management in event-sourced billing systems. Because events are stored permanently (see event storage and querying at scale for infrastructure approaches), changing the schema of an existing event type breaks the ability to replay historical events. Backward-compatible schema evolution — adding new optional fields rather than changing existing fields — is mandatory for event types that are already in production. Apache Avro and Protocol Buffers provide schema registries that enforce compatibility rules and prevent incompatible schema changes from being published. Billing systems that use JSON for event serialization must implement equivalent schema validation at the application layer, as JSON has no built-in schema enforcement mechanism.

Citations

OpenView Partners. 2024 SaaS Benchmarks Report: Monetization and Billing Trends. OpenView Venture Partners, 2024.
Confluent. Stream Processing for Financial Services. Confluent, Inc., 2024.
Greg Young. CQRS and Event Sourcing. Event Store Ltd., 2023.
Martin Fowler. CQRS Pattern. martinfowler.com, 2011.
Apache Flink Documentation. Stateful Stream Processing. Apache Software Foundation, 2025.
Tyler Akidau, et al. The Dataflow Model. Proceedings of the VLDB Endowment, 2015.

Usage Metering

Billing & Invoicing

Entitlements

Cash Collection

Revenue Analytics

Lago Embedded

Lago AI ✨

Integrations

AI

Enterprise

Fintechs & Banks

IoT & Telco

Engineering

Finance

Operations

Product

Hybrid Plans

Usage-based

Enterprise Plans

Multi-products

Self-hosted

API Reference

Changelog

Documentation

GitHub

About us

Hiring

Blog

Knowledge base

Learn

Security

Real-Time Billing: Stream Processing, Event Sourcing, CQRS

What Is Stream Processing in Billing Systems?

How Does Event Sourcing Apply to Billing Systems?

What Is CQRS and Why Does It Matter for Billing?

Stream Processing Frameworks for Billing

Real-Time Usage Metering Architecture

Event Ordering and Late Arrival Handling

Integrating Real-Time Billing with Limit Enforcement

Operational Considerations for Real-Time Billing Systems

Citations

More from the blog

Lago vs Orb: Open-Source Billing vs a Billing Specialist Now Inside Adyen

Lago vs Metronome: Open-Source Billing vs an Enterprise Usage-Billing Specialist Now Inside Stripe

Lago vs Stripe Billing: Feature Comparison, Pricing, and Open-Source Advantages

Enterprise SaaS Billing: What You Actually Need (2026)

Lago solves complex billing.