On-Call Rotation Design

Rotation Models

Single-timezone rotation is the simplest model. One engineer holds the pager for a week (or 3-4 days), then hands it off. This works when your team is in one region, but it means overnight pages are a reality. To make this sustainable, keep the rotation pool at 6-8 people minimum so each person is on call once every 6-8 weeks.

Follow-the-sun distributes on-call across time zones so nobody gets paged overnight. You need teams in at least two regions with a 4-6 hour overlap for handoffs. This is the gold standard for quality of life but requires enough staffing in each region to maintain a local rotation pool. Companies like PagerDuty and Datadog use this model for their own operations.

Split rotation divides on-call between business hours and after hours. Some teams have a "primary" on-call during the day and a separate "night and weekend" rotation with different (usually lighter) expectations. This works for services with low after-hours traffic.

Compensation That Works

Ignoring on-call compensation is a guaranteed way to lose engineers. The most common models:

Flat weekly stipend: $500-1,500 per week of on-call duty, paid regardless of whether pages happen. Simple and predictable.
Per-incident payout: $50-200 per page, with multipliers for overnight or weekend pages. Aligns compensation with actual disruption.
Extra PTO: A day of PTO for each week of on-call. Works well at companies where cash compensation is complex (early-stage startups with tight budgets).

Many companies combine approaches: a base stipend plus per-incident bonuses for after-hours pages.

Burnout Prevention

Track on-call health metrics: pages per week, mean time to acknowledge, mean time to resolve, and the ratio of actionable to non-actionable alerts. If a team is averaging more than 2 pages per on-call shift, the alert noise is too high and needs tuning.

After a particularly rough on-call week (multiple SEV1s, extended outages), give the engineer a recovery day. Don't make them jump straight back into sprint work the morning after a 3 AM incident.

Tooling

PagerDuty is the market leader with deep integrations and analytics. Opsgenie (Atlassian) is a solid alternative, especially if you're already in the Atlassian ecosystem. Rootly focuses on incident management and pairs well with either paging tool. Grafana OnCall is the open-source option. Whichever tool you pick, make sure it supports automatic escalation, schedule overrides, and on-call analytics out of the box.

Rotation Models

Compensation That Works

Ignoring on-call compensation is a guaranteed way to lose engineers. The most common models:

Flat weekly stipend: $500-1,500 per week of on-call duty, paid regardless of whether pages happen. Simple and predictable.

Per-incident payout: $50-200 per page, with multipliers for overnight or weekend pages. Aligns compensation with actual disruption.

Extra PTO: A day of PTO for each week of on-call. Works well at companies where cash compensation is complex (early-stage startups with tight budgets).

Many companies combine approaches: a base stipend plus per-incident bonuses for after-hours pages.

Burnout Prevention

After a particularly rough on-call week (multiple SEV1s, extended outages), give the engineer a recovery day. Don't make them jump straight back into sprint work the morning after a 3 AM incident.

Tooling

Rotation Models

Compensation That Works

Burnout Prevention

Tooling

Key Points

Common Mistakes

Related Topics

On-Call Rotation Design

Rotation Models

Compensation That Works

Burnout Prevention

Tooling

Key Points

Common Mistakes

Related Topics