API Design at Staff Level

Boring APIs Win

The best APIs are boring. They do exactly what the name says, they handle errors consistently, and they have not had a breaking change in three years. Nobody writes a blog post about an API that just works. That is the point.

At Staff level, the interviewer is not testing whether you know what REST is. They want to see if you can design an API that 200 teams can integrate with and that your grandchildren will not have to deprecate. The bar is durability, not cleverness.

Resource Modeling That Scales

Start with nouns. A payment system has payments, refunds, customers, and payment methods. Each is a resource with its own lifecycle. /payments/{id} retrieves a payment. /payments/{id}/refunds creates a refund scoped to that payment. This nesting communicates ownership without requiring documentation.

Where candidates go wrong is modeling around internal implementation. If your payment service internally splits a transaction into an authorization and a capture, that does not mean your public API needs /authorizations and /captures as top-level resources. Expose the abstraction your consumers care about, not the plumbing they do not.

Keep URLs shallow. Two levels of nesting is the practical limit. /customers/{id}/payment-methods/{id}/transactions is a signal that your resource model needs rethinking. Promote transactions to a top-level resource with a customer_id filter instead.

The 45-Minute API Design Interview

Here is a time allocation that reliably produces strong answers.

Minutes 0-10: Clarify requirements. Who are the consumers? External developers, internal services, or both? What are the SLA expectations? Is this a synchronous request-response API or does it need webhooks for async operations? For a payment API, you need to establish whether you are designing a Stripe-like platform or an internal service.

Minutes 10-25: Design the resource model. Sketch the core resources, their relationships, and their state machines. A payment has states: created, processing, succeeded, failed. Define what transitions are valid. Then write out 5-7 key endpoints with request and response shapes. Do not try to be exhaustive. Focus on the endpoints that reveal interesting design decisions.

Minutes 25-35: Define the hard parts. Error responses, pagination, authentication, and idempotency. Pick Stripe's error format or build something similar. Explain your pagination strategy (cursor-based with a default page size of 25, max of 100). Specify that all mutating endpoints accept an Idempotency-Key header.

Minutes 35-45: Trade-offs and evolution. How will this API change over time? Discuss your versioning strategy. Date-based versions (like Stripe's 2023-10-16 format) let you evolve continuously without forcing consumers onto a new "v2." Explain how you would deprecate a field: mark it deprecated in the schema, stop documenting it, log usage, notify consumers, then remove it after the deprecation window (typically 12-18 months for a public API).

Backwards Compatibility as a Discipline

Additive changes are always safe. New fields in a response, new optional query parameters, new endpoints. None of these break existing consumers.

Everything else requires a deprecation process. Stripe solves this by maintaining compatibility layers for every API version they have ever shipped. When you pass a version header, the backend transforms the current internal response into the shape your version expects. Expensive to maintain, but no consumer ever breaks unexpectedly. For most companies, a simpler approach works: support two versions simultaneously, give consumers a 6-month migration window, and provide automated migration tooling.

GraphQL vs REST vs gRPC: An Honest Framework

REST is the default for public APIs. HTTP caching works natively. Every language has an HTTP client. Tooling is mature. The downside is over-fetching (you get the whole resource even if you need two fields) and the N+1 request problem for complex data requirements.

GraphQL solves the over-fetching problem, which makes it excellent for mobile clients on unreliable networks. Shopify, GitHub, and Yelp use it for their public APIs. But the trade-offs are real: query complexity attacks require depth limiting, caching requires custom infrastructure (Apollo Server, Persisted Queries), and error handling is unintuitive because GraphQL always returns 200 with errors nested in the response body.

gRPC is purpose-built for service-to-service communication. Binary protobuf serialization is 5-10x faster than JSON parsing. But browser support requires grpc-web, and debugging is harder because you cannot just curl an endpoint. Google, Netflix, and Square use it extensively for internal services while exposing REST or GraphQL at the edge.

The Staff-level answer is almost never "just pick one." It is usually "REST at the public edge, gRPC between internal services, and GraphQL for the mobile BFF layer where query flexibility matters most."

Boring APIs Win

Resource Modeling That Scales

The 45-Minute API Design Interview

Here is a time allocation that reliably produces strong answers.

Backwards Compatibility as a Discipline

Additive changes are always safe. New fields in a response, new optional query parameters, new endpoints. None of these break existing consumers.

Boring APIs Win

Resource Modeling That Scales

The 45-Minute API Design Interview

Backwards Compatibility as a Discipline

GraphQL vs REST vs gRPC: An Honest Framework

Sample Questions

Evaluation Criteria

Key Points

Common Mistakes

Related Topics

API Design at Staff Level

Boring APIs Win

Resource Modeling That Scales

The 45-Minute API Design Interview

Backwards Compatibility as a Discipline

GraphQL vs REST vs gRPC: An Honest Framework

Sample Questions

Evaluation Criteria

Key Points

Common Mistakes

Related Topics