Pipeline Pattern

What it is

A pipeline is a chain of stages where each stage takes input, transforms it, and hands the result to the next stage (see diagram above). Each stage runs concurrently. Bounded queues between stages provide automatic backpressure: if Stage 3 slows down, its incoming queue fills, Stage 2 blocks on push, that fills its incoming queue, Stage 1 blocks, and the source throttles. The whole pipeline naturally adapts to the slowest stage.

This is the dominant shape for stream processing: ETL pipelines, HTTP request handlers that hit multiple downstreams, image processing chains, video encoding. Anything where data flows through a series of transformations.

A stage can also fan out internally: if "parse" is slow, run 4 parser workers reading from the same input queue and pushing to the same output queue. Each stage's parallelism is independent.

Why bounded queues

The instinct is to use unbounded queues so producers never block. This is wrong. With unbounded queues, a slow consumer causes the queue to grow without bound, eating memory until OOM. Bounded queues turn that failure mode into backpressure: producers block when full, the system slows down gracefully under load, and the bottleneck stage becomes visible when the queue between it and its predecessor stays full.

The right default is "small bounded queues, sized to absorb burstiness". Powers of two like 100 or 1000 are typical.

Throughput

Pipeline throughput equals the throughput of the slowest stage. Doubling all the other stages does nothing if the slow stage is unchanged. The fix is to identify the bottleneck (full queue upstream of it, empty queue downstream) and scale that stage: run multiple workers reading from the same input.

This is the same logic as Little's Law: in steady state, every stage processes the same items per second, but the slow stage holds the most concurrent items.

Shutdown

The clean shutdown protocol: close the first stage's input. The first stage drains, closes its output. The next stage sees the closed channel, drains, closes its output. Cascade through to the last stage, which signals "done".

In Go, range over a closed channel exits naturally; closing the output is the right idiom. In other languages, send a sentinel value (None, a poison pill) and have each stage propagate it downstream when it sees it.

Errors

The hard part. Each item can fail in the middle of a stage. The options:

Cancel everything. Pass a context (or shared cancellation) through; first error cancels the whole pipeline. Right when a single bad item invalidates the run (transactional ETL).

Skip and log. The stage logs the error and continues with the next item. Right when items are independent and partial completion is fine (web crawler, batch image processor).

Dead-letter queue. The stage sends bad items to a separate error channel for later inspection or retry. Right when bad items are interesting and need auditing.

Pick one and apply it consistently. Mixing strategies across stages confuses everyone.

Follow-up questions

▸What if one stage is much slower than the others?

Two options. Scale that stage horizontally: run multiple workers reading from the same input queue. Or rebalance: do less work in that stage, more in faster ones. The bounded queue makes the slowness visible (it fills up, upstream blocks); tune from there.

▸How are errors propagated through a pipeline?

Three patterns. Cancel-on-error: every stage shares a context; first error cancels everything (errgroup-style). Skip-on-error: log the error, continue with the next item. Dead-letter: send the bad item to a separate error channel for later inspection. The right choice depends on whether each item is independent.

▸When to use a pipeline vs fan-out?

Pipeline: each item flows through stages in sequence (A → B → C → D). Fan-out: multiple workers process items independently. They compose: a pipeline stage can fan out internally (10 workers all reading from the input queue, all writing to the output). Use pipeline when there is a sequence; fan-out when there is parallelism within a stage.

What it is

A stage can also fan out internally: if "parse" is slow, run 4 parser workers reading from the same input queue and pushing to the same output queue. Each stage's parallelism is independent.

Why bounded queues

The right default is "small bounded queues, sized to absorb burstiness". Powers of two like 100 or 1000 are typical.

Throughput

This is the same logic as Little's Law: in steady state, every stage processes the same items per second, but the slow stage holds the most concurrent items.

Shutdown

Errors

The hard part. Each item can fail in the middle of a stage. The options:

Cancel everything. Pass a context (or shared cancellation) through; first error cancels the whole pipeline. Right when a single bad item invalidates the run (transactional ETL).

Skip and log. The stage logs the error and continues with the next item. Right when items are independent and partial completion is fine (web crawler, batch image processor).

Dead-letter queue. The stage sends bad items to a separate error channel for later inspection or retry. Right when bad items are interesting and need auditing.

Pick one and apply it consistently. Mixing strategies across stages confuses everyone.

Follow-up questions

▸What if one stage is much slower than the others?

▸How are errors propagated through a pipeline?

▸When to use a pipeline vs fan-out?

Diagram

What it is

Why bounded queues

Throughput

Shutdown

Errors

Implementations

Key points

Follow-up questions

Gotchas

Related reading

Pipeline Pattern

Diagram

What it is

Why bounded queues

Throughput

Shutdown

Errors

Implementations

Key points

Follow-up questions

Gotchas

Related reading