Database Internals

"The database is the heart of any application."

Understanding database internals helps you write efficient queries, choose the right data model, and optimize performance under load.

Overview

This section covers the two most critical databases for backend engineers:

MySQL

MySQL Overview - Architecture, storage engines, and fundamentals
Architecture & Storage Engines - InnoDB vs MyISAM, query execution flow
Indexes - B+ Tree, leftmost prefix rule, optimization
Transactions - ACID, isolation levels, MVCC
Locking - Lock types, deadlock detection
Logging & Replication - Binlog, redo log, undo log, master-slave
SQL Optimization - EXPLAIN analysis, query tuning

Redis

Redis Overview - Fundamentals and use cases
Data Structures - String, Hash, List, Set, ZSet
Persistence - RDB vs AOF, durability
Cluster & Sentinel - High availability and scaling
Caching Patterns - Cache strategies and invalidation

Advanced Topics

Deadlock Prevention - Detect and prevent deadlocks
Connection Pooling - Connection pool configuration
NoSQL Comparison - MySQL vs Redis vs MongoDB

Quick Reference

MySQL Indexing

Type	Description	Use Case
B+ Tree	Balanced tree, range queries	Primary keys, most columns
Primary	Clustered index, data stored with index	Primary key
Secondary	Non-clustered, stores primary key value	Frequent queries
Composite	Multi-column	Multi-condition WHERE

Transaction Isolation Levels

Level	Dirty Read	Non-Repeatable	Phantom	Performance
Read Uncommitted	✅	✅	✅	Fastest
Read Committed (RC)	❌	✅	✅	Fast
Repeatable Read (RR)	❌	❌	⚠️*	Medium
Serializable	❌	❌	❌	Slowest

*MySQL InnoDB prevents phantom reads with Gap Locks

Redis Data Structures

Structure	Use Case	Commands
String	Simple cache, counters	`GET`, `SET`, `INCR`
Hash	Object storage	`HGET`, `HSET`, `HGETALL`
List	Message queues, feeds	`LPUSH`, `RPOP`, `LRANGE`
Set	Unique items, tags	`SADD`, `SMEMBERS`, `SINTER`
Sorted Set	Leaderboards, rankings	`ZADD`, `ZRANGE`, `ZRANK`

Common Pitfalls

MySQL

❌ Missing index on WHERE column → Full table scan
❌ Function on column (WHERE YEAR(date) = 2024) → Index invalidation
❌ Deep pagination (LIMIT 1000000, 10) → Scans millions of rows
✅ Use EXPLAIN to analyze queries
✅ Add indexes based on query patterns
✅ Monitor slow query logs

Redis

❌ No TTL set → Memory leak
❌ Cache stampede → Database overload
❌ Cache penetration → Repeated queries for non-existent data
✅ Always set TTL for cache data
✅ Use appropriate data structures (Hash for objects)
✅ Monitor memory usage and hit rate

Learning Path

For interviews:

MySQL Architecture → Indexes → Transactions → Locking
Redis Data Structures → Caching Patterns → Persistence

For production:

MySQL Optimization → Indexes → Transactions → Replication
Redis Caching Patterns → Persistence → Cluster

Production Tips

Always use connection pooling (HikariCP for Java)
Add indexes based on query patterns, not guesses
Monitor slow query logs regularly
Set appropriate TTLs for Redis cache entries
Use read replicas for read-heavy workloads

Overview​

MySQL​

Redis​

Advanced Topics​

Quick Reference​

MySQL Indexing​

Transaction Isolation Levels​

Redis Data Structures​

Common Pitfalls​

MySQL​

Redis​

Learning Path​

Overview

MySQL

Redis

Advanced Topics

Quick Reference

MySQL Indexing

Transaction Isolation Levels

Redis Data Structures

Common Pitfalls

MySQL

Redis

Learning Path