Sweep Line (Event Sort)

The Intuition

A lot of interval problems look complicated until each interval is broken into two events: one for the start, one for the end. Once that translation happens, every interval-overlap question collapses into a single sweep over a sorted list of events with a running counter.

Take the canonical example: minimum number of meeting rooms. Each meeting becomes two events. The start emits +1 because a new meeting needs a room. The end emits -1 because a room frees up. Sort all events by time. Walk them in order. The running sum is the number of concurrent meetings at that point in time. The maximum value of that running sum is the answer.

The tie-break at equal timestamps is the only subtle part. If a meeting ends at 5 and another meeting starts at 5, do they share a room? No: the first room is freed before the second meeting begins. To enforce that, the -1 event must sort before the +1 event at the same timestamp. Tuple sorting (time first, then delta) does this automatically because -1 < +1. If the problem's semantics are different (intervals with shared endpoints DO overlap), reverse the tie-break.

The code below works through three LeetCode problems in this family.

Meeting Rooms II (LC 253). Given a list of meetings as [start, end] intervals, return the minimum number of rooms needed to schedule all of them without conflict. Build start/end events, sort, walk, track the maximum running count.

Car Pooling (LC 1094). Given a list of trips where each trip is [numPassengers, from, to] and a vehicle capacity, return whether the driver can complete every trip without exceeding capacity at any moment. Build pick-up/drop-off events with weights equal to the passenger count, sort, walk, return false the first time the running total exceeds capacity.

Number of Flowers in Full Bloom (LC 2251). Given a set of flower bloom intervals (each is [start, end] inclusive on both ends) and a list of query days, return how many flowers are blooming on each query day. The first two problems are pure single-pass sweeps. This one is a per-query variant: precompute snapshots of the running count at every event time, then binary-search each query day to find the latest snapshot at or before that day.

The same machinery handles a lot of variants:

Car pooling: events are passenger pick-ups (+num) and drop-offs (-num). Capacity check at each event.
Maximum population year: each person contributes (birth, +1) and (death, -1) events. Track running population.
Number of flowers in full bloom for each query day: precompute prefix counts at every event time, then binary-search each query.
Skyline problem: each building emits a "tallest height changed here" event. The state is a multiset of active heights, peeked with a heap.

The technique is O(n log n) because of the sort. The sweep itself is linear. For very large inputs with bounded coordinate ranges, a difference array can replace the sweep for true O(n + range) time, but the sweep is the universal solution.

When to use

Intervals with overlap, capacity, or concurrency questions
Each interval has a clean start and end and the question concerns "what is true at every point in time"
Need to answer per-query questions about a fixed set of intervals (combine with binary search)
Multiple criteria at the same timestamp (tie-break order encodes semantics)

When NOT to use

Coordinates are very small and dense; a difference array is faster (no sort needed)
Need to actually merge intervals into a smaller list (use the merge intervals template instead)
Question asks about a single interval at a time (sweep is overkill)
Need to track per-interval identity (sweep loses that; carry an id field in the event struct if needed)

Pattern Recognition

"minimum number of rooms / cars / arrows / runways"
"at any moment, what is the maximum / number of active X"
"given queries about specific times, count active intervals"
"skyline" or "horizon" of overlapping rectangles

Variations

Difference array (no sort needed): When timestamps are integers in a small range, allocate delta[0..max_time], increment at starts and decrement at ends, then prefix-sum. O(n + range) time, O(range) space.
With heap for state: When the state is more than a counter (e.g. set of heights for skyline), use a max-heap to track active items. Pop expired entries lazily as the sweep moves forward.
Two-dimensional sweep: Sort by x-coordinate, sweep along y. Used in computational geometry.
Offline answering of queries: Sort queries together with events by time. Each query is processed when its time is reached during the sweep.

Edge Cases

Empty input (return 0 or true depending on the problem)
Single interval (max = 1, answer is trivially derivable)
All intervals share the same start or end point (tie-break correctness is tested)
Intervals where start equals end (zero-length; usually contributes neither, but check the spec)
Negative coordinates (the sort still works; difference array does not without offset)

Practice Problems

The Intuition

The code below works through three LeetCode problems in this family.

The same machinery handles a lot of variants:

Car pooling: events are passenger pick-ups (+num) and drop-offs (-num). Capacity check at each event.
Maximum population year: each person contributes (birth, +1) and (death, -1) events. Track running population.
Number of flowers in full bloom for each query day: precompute prefix counts at every event time, then binary-search each query.
Skyline problem: each building emits a "tallest height changed here" event. The state is a multiset of active heights, peeked with a heap.

When to use

Intervals with overlap, capacity, or concurrency questions
Each interval has a clean start and end and the question concerns "what is true at every point in time"
Need to answer per-query questions about a fixed set of intervals (combine with binary search)
Multiple criteria at the same timestamp (tie-break order encodes semantics)

When NOT to use

Coordinates are very small and dense; a difference array is faster (no sort needed)
Need to actually merge intervals into a smaller list (use the merge intervals template instead)
Question asks about a single interval at a time (sweep is overkill)
Need to track per-interval identity (sweep loses that; carry an id field in the event struct if needed)

Pattern Recognition

"minimum number of rooms / cars / arrows / runways"
"at any moment, what is the maximum / number of active X"
"given queries about specific times, count active intervals"
"skyline" or "horizon" of overlapping rectangles

Variations

Difference array (no sort needed): When timestamps are integers in a small range, allocate delta[0..max_time], increment at starts and decrement at ends, then prefix-sum. O(n + range) time, O(range) space.
With heap for state: When the state is more than a counter (e.g. set of heights for skyline), use a max-heap to track active items. Pop expired entries lazily as the sweep moves forward.
Two-dimensional sweep: Sort by x-coordinate, sweep along y. Used in computational geometry.
Offline answering of queries: Sort queries together with events by time. Each query is processed when its time is reached during the sweep.

Edge Cases

Empty input (return 0 or true depending on the problem)
Single interval (max = 1, answer is trivially derivable)
All intervals share the same start or end point (tie-break correctness is tested)
Intervals where start equals end (zero-length; usually contributes neither, but check the spec)
Negative coordinates (the sort still works; difference array does not without offset)

The Intuition

When to use

When NOT to use

Pattern Recognition

Variations

Edge Cases

Practice Problems

Key Points

Code Template

Common Mistakes

Related Patterns

Sweep Line (Event Sort)

The Intuition

When to use

When NOT to use

Pattern Recognition

Variations

Edge Cases

Practice Problems

Key Points

Code Template

Common Mistakes

Related Patterns