Engineering Culture at Scale
The Moment Culture Starts Breaking
A startup I worked with had a "move fast and break things" culture that worked beautifully at 15 engineers. Everyone knew the codebase. People fixed each other's bugs without being asked. Risk-taking was celebrated because the blast radius was small.
At 80 engineers, that same culture became toxic. "Move fast" now meant three teams shipping conflicting changes to the same service in a single week. "Break things" meant a junior engineer pushed a migration without a rollback plan and took down payments for four hours. The founding engineers who thrived in the chaos started labeling newcomers as "not getting it." Meanwhile, the newcomers saw recklessness dressed up as a value. Half the engineers hired in Q3 left by Q1.
The lesson: culture that works at one scale actively harms at another. Scaling culture is not about preserving the original vibe. It is about identifying which parts were actually principles and which were just artifacts of being small.
What Values Actually Look Like in Practice
Written values documents are necessary but insufficient. Netflix published their culture deck. Stripe has operating principles. Google had "don't be evil." These artifacts only matter if they influence hard decisions.
Here is the test: when a team is behind on a deadline and suggests skipping integration tests to hit the date, what happens? If the answer is "we skip the tests and plan to add them next sprint" (and next sprint never comes), your real culture is "deadlines beat quality." The document on the wiki is irrelevant.
Pick three to five operating principles that you will actually enforce during painful tradeoffs. Stripe's "move with urgency" works because they pair it with "be meticulous." The tension between those two forces real conversations about what to cut. Single-axis values like "go fast" or "be excellent" are meaningless because they have no opposing force.
Then reference those principles constantly. In sprint planning: "We are choosing to delay this feature by a week because our principle is that we ship with tests." In promotion discussions: "This person consistently made pragmatic tradeoffs under pressure, which is exactly what our operating principle asks for." Repetition in context is how values stick.
Rituals That Scale (With Specific Formats)
Weekly engineering digest email. This is the single most impactful ritual for orgs above 50 engineers. One person (rotating monthly) compiles: three notable ships, one interesting technical decision with context, one ongoing challenge. Keep it under 500 words. Send it every Friday. Shopify, Stripe, and Linear all run variations of this. It costs almost nothing and transfers enormous amounts of context across teams.
Demo days, restructured for scale. At 20 people, demos are informal. At 200, they need structure or they become a chore. Cap presentations at four minutes with a hard cutoff. Limit to six demos per session. Have a rotating selection committee so the same teams do not dominate. Record everything and make recordings searchable. The audience should be optional; the recordings should not be.
Architecture review office hours. Once a week, two senior engineers hold open office hours where anyone can bring a design question. No formal RFC required. This catches bad architectural decisions early and spreads knowledge laterally. It also surfaces what teams are working on, which is valuable context that planning documents rarely capture.
Quarterly retros at the org level. Not a retrospective on a single sprint. A structured discussion: what worked this quarter, what did not, what should we change? Limit to 90 minutes. Publish the output. Track whether the "what should we change" items actually change. If they do not, stop running the retro. Performative reflection is worse than no reflection.
The Decisions That Actually Define Culture
Culture is shaped less by rituals and more by a handful of high-stakes decisions that everyone watches.
Who gets promoted. If you promote the engineer who shipped a critical feature solo by working 80-hour weeks, you have told every other engineer that heroics matter more than sustainable practices. If you promote the engineer who built a platform that made five other teams faster, you have said leverage matters. Your promotion criteria are your real values document.
Who gets fired (or managed out). A brilliant engineer who belittles others in code reviews, blocks PRs for stylistic reasons, and refuses to do on-call teaches the entire org that being smart enough earns you different rules. Basecamp, Stripe, and Netflix have all written publicly about the cost of keeping toxic high performers. The math is consistent: every month they stay, you lose one to two good people who leave quietly.
How incidents are handled. Blameless postmortems are easy to promise and hard to practice. The first time a junior engineer causes an outage and their manager publicly asks "how did we let this happen?" in the incident channel, every engineer in the org learns that blameless is aspirational, not real. Etsy and Google built strong incident cultures by relentlessly focusing on systemic causes, and leaders modeling that behavior themselves during high-stress moments.
Measuring Whether It Is Working
Surveys are useful if you follow three rules: ask the same questions every quarter so you can trend, keep them short (ten questions maximum), and close the loop publicly.
The questions that matter most: "Do you feel safe raising concerns about technical decisions?" (psychological safety), "Do you understand how your work connects to the company's goals?" (alignment), "Would you recommend this team to a friend?" (overall satisfaction). Free-text responses are more valuable than scores. Read every single one.
Beyond surveys, watch three signals. Voluntary attrition by tenure: if people consistently leave at 18 months, your growth opportunities are broken. Internal transfers: healthy orgs see engineers moving between teams, not just leaving the company. Time-to-backfill: if open roles take six months to fill, your reputation in the market is a problem that internal surveys will not catch.
Key Points
- •Culture is a set of decisions people make when no one is looking. It shows up in code review tone, incident response urgency, and whether someone speaks up when a deadline is unrealistic
- •The strongest scaling ritual is a written weekly engineering digest. It transfers context across teams that never overlap in meetings
- •Promotion criteria are your real values document. If you promote the hero who ships alone at 2 AM, you have declared that collaboration is optional
- •Subcultures across teams are inevitable and healthy. The failure mode is when subcultures conflict with each other on basics like code ownership or on-call expectations
- •Survey data is only useful if you close the loop publicly. Share results, name the top three problems, and commit to fixing one per quarter
Common Mistakes
- ✗Adopting the Spotify model without understanding that Spotify itself moved away from it. Squad autonomy without strong alignment mechanisms produces duplication and drift
- ✗Hiring for 'culture fit' when you mean 'people like us.' This kills diversity and creates blind spots that compound as you scale
- ✗Treating all-hands meetings as culture-building when they are actually status broadcasts. Real culture forms in the 5-person conversations after the all-hands
- ✗Letting a toxic high performer stay because they ship fast. Every month they remain, two good engineers quietly start interviewing elsewhere