Redis

Most engineers working on backend systems that need to be fast have already used Redis. It started as a caching layer, but these days it sits at the center of session management, rate limiting, real-time analytics, and a dozen other use cases. The reason it stuck around while other tools came and went is simple: it delivers sub-millisecond latency with data structures that actually match real problems, not just key-value pairs.

That said, Redis is not a database replacement. Treating it as one ends badly, usually at 3 AM during a traffic spike.

How It Works Internally

Redis runs a single-threaded event loop. It uses epoll on Linux and kqueue on macOS for I/O multiplexing. No locks, no context switches. A single instance on modern hardware pushes 100,000 to 200,000 ops/sec without breaking a sweat. Since Redis 6.0, I/O threading offloads socket reads and writes to background threads while command execution stays single-threaded, roughly doubling throughput on multi-core machines.

The data structures are where Redis gets interesting. Sorted sets use a skip list for O(log N) range queries alongside a hash table for O(1) point lookups. Small hashes and lists use ziplist encoding, which packs data into contiguous memory and cuts pointer overhead. Strings under 44 bytes use embedded SDS (Simple Dynamic Strings) to skip a separate allocation. The practical takeaway: memory efficiency varies a lot based on how the data is shaped. A hash with 100 small fields uses far less memory than 100 individual string keys. Consider the data model before writing keys.

Persistence works in two flavors. RDB snapshots fork the process and serialize the dataset using copy-on-write, producing compact point-in-time backups. AOF (Append-Only File) logs every write command. Fsync can be configured per command, per second, or never. Most production setups run both: AOF with appendfsync everysec for durability, plus periodic RDB snapshots for fast restores and offsite backups. Neither is perfect. RDB can lose up to a few minutes of data. AOF files grow large and need periodic rewrites. Pick the tradeoff.

Production Architecture

Redis Cluster partitions the keyspace into 16,384 hash slots spread across primary nodes. Each primary replicates asynchronously to one or more replicas. When resharding is needed, the MIGRATE command atomically moves keys between nodes while clients follow MOVED and ASK redirections. It works, but it adds operational weight. Redis Sentinel handles automatic failover for non-clustered setups by monitoring primaries and promoting replicas when health checks fail. Failover typically completes within 30 seconds.

Here is practical advice from running this in production. Deploy at least three Sentinel instances across separate failure domains. Set replica-read-only yes and send read traffic to replicas if the workload skews read-heavy. Watch replication lag through INFO replication. And set min-replicas-to-write to stop a partitioned primary from accepting writes that will vanish once the partition heals. This one setting has saved me from data loss more than once.

Capacity Planning

A single Redis instance on an r6g.xlarge (32GB RAM) handles about 200K ops/sec for simple GET/SET. Budget 2x to 3x the dataset size in RAM. That headroom is needed for fragmentation, replication buffers, and copy-on-write overhead during RDB saves. People consistently underestimate this. Instagram, as a reference point, stores over 300 million key-value mappings in Redis for media-ID-to-user-ID lookups and keeps it around 1GB through aggressive hash encoding optimizations.

Keep an eye on used_memory_rss versus used_memory. If the fragmentation ratio crosses 1.5, significant memory is being wasted. Track evicted_keys, keyspace_misses, and connected_clients as the primary health signals. For pure cache workloads, set maxmemory-policy to allkeys-lru. For mixed workloads where some keys need to stick around, volatile-ttl is usually the better choice.

Failure Scenarios

Scenario 1: Split-brain during a network partition. The primary gets cut off from Sentinel and its replicas but stays reachable by some app servers. It keeps accepting writes while Sentinel promotes a replica on the other side. When the partition heals, the old primary demotes itself and resynchronizes. Every write it accepted during the split is gone, permanently. Limit the damage by setting min-replicas-to-write 1 with min-replicas-max-lag 10 so the isolated primary rejects writes once it loses contact with its replicas. Monitor master_link_down_since_seconds on replicas to catch this early.

Scenario 2: Memory exhaustion from missing TTLs. Someone ships a feature that writes session data without TTLs. Memory climbs slowly. Nobody notices until maxmemory fills up and the eviction policy starts deleting cache keys. The result is a stampede of cache misses hammering the database. I have seen this take down a production database backend during a traffic peak. The fix: alert when used_memory passes 80% of maxmemory, audit key namespaces regularly with MEMORY USAGE and OBJECT IDLETIME, and enforce TTL policies at the application layer for every cache key. No exceptions.

How It Works Internally

Production Architecture

Capacity Planning

Failure Scenarios

Use Cases

Architecture

How It Works Internally

Production Architecture

Capacity Planning

Failure Scenarios

Pros

Cons

When to use

When NOT to use

Key Points

Common Mistakes

Related Technologies

Redis

Use Cases

Architecture

How It Works Internally

Production Architecture

Capacity Planning

Failure Scenarios

Pros

Cons

When to use

When NOT to use

Key Points

Common Mistakes

Related Technologies