Basic idea
Scaling means running more copies of the same application so load can be handled better and failure becomes less risky.
This page explains why running more than one container instance becomes important when traffic, resilience, or uptime expectations increase.
Scaling means running more copies of the same application so load can be handled better and failure becomes less risky.
If one container becomes overloaded or stops, other running replicas can continue serving traffic.
Once multiple copies exist, traffic routing, session behavior, and shared state become much more important.
Users -> load balancer / proxy -> multiple app replicas
Scaling is rarely just “start more containers.” It also requires a way to send traffic to them properly.
Scaling a stateful app without first thinking about sessions, uploads, local file writes, or database coordination.
Stateless services are usually easier to scale because each replica can behave almost the same. Stateful behavior is harder because each instance may depend on shared sessions, files, or data coordination.
Next page: Lesson 28 explains the bigger orchestration concepts that support scaled container systems.