Docker Optimization: Multi-Stage Builds and Security

Introduction

Modern software delivery relies heavily on containers. Every commit pushed to a repository can trigger CI/CD pipelines that build, scan, test, and deploy Docker images across multiple environments. While Docker simplifies application packaging, poorly optimized images create significant operational and financial cost at scale.

For organizations running hundreds or thousands of services, image size directly impacts storage consumption, network bandwidth, deployment speed, and infrastructure costs. A difference of a few hundred megabytes per image may appear insignificant on a developer's laptop, but when multiplied across thousands of builds and deployments, it can result in terabytes of additional storage and substantial cloud expenditure.

In this guide I have mentioned the Docker optimization and security practices commonly used in large-scale environments, covering base image selection, deterministic builds, multi-stage builds, cache optimization, and container hardening techniques after reading different case studies as well as my experience.

Why Docker Optimization Matters

Container images are built once but consumed repeatedly throughout their lifecycle.

A single image may be:

Built in CI/CD pipelines
Stored in container registries
Pulled by development environments
Downloaded by Kubernetes worker nodes
Replicated across regions
Deployed multiple times per day

Every additional megabyte increases:

Registry storage requirements
Network transfer costs
Deployment latency
CI/CD execution time
Cluster startup time

As organizations scale, image optimization becomes a business concern rather than merely a technical preference.

Choosing the Right Base Image

The base image forms the foundation of every Docker image. Selecting an inappropriate base image can significantly increase image size and expand the attack surface.

Many developers begin with:

FROM ubuntu:latest

While functional, this approach often includes packages, utilities, and dependencies that are never used by the application.

A better strategy is selecting the smallest image capable of running the workload.

Never Use Mutable Tags

One of the most common mistakes is relying on mutable tags such as:

FROM node:latest

The latest tag changes over time.

An image built today may differ from an image built next month despite using the same Dockerfile. This introduces non-deterministic builds and makes debugging significantly harder, because reproducing the same bug becomes difficult as the latest version keeps changing.

Instead, use versioned tags:

FROM node:22.4-alpine

For maximum reproducibility, pin the image digest, the best option:

FROM node@sha256:<digest>

Benefits:

Reproducible builds
Predictable deployments
Easier troubleshooting
Reduced supply-chain risk

Alpine vs Distroless

Choosing the correct runtime image has a significant impact on both image size and security.

Alpine Linux

Alpine Linux is widely used because of its small footprint.

Typical image size:

Image	Approximate Size
Ubuntu	70–80 MB
Debian Slim	20–30 MB
Alpine	~5 MB

Example:

FROM node:22-alpine

Advantages:

Small image size
Fast downloads
Includes package manager
Easy debugging

Disadvantages:

Uses musl libc instead of glibc - I know some on you do not understand this, let me explain you with simple analogy. Think of glibc and musl as two different dialects of the same language.

Most software on Linux is written expecting the "glibc dialect." Alpine speaks the "musl dialect." They are very similar, but occasionally a program may use words or phrases that musl doesn't understand, causing compatibility issues.
Some packages may require additional configuration

Distroless Images

Distroless images, originally developed by Google, remove everything unnecessary for application execution.

Distroless images contain:

Runtime dependencies
Required libraries

Distroless images do not contain:

Bash
Shell utilities
Package managers
Curl
Wget

Example:

FROM gcr.io/distroless/nodejs22

Advantages:

Smaller attack surface
Fewer vulnerabilities
Smaller runtime footprint
Better compliance posture

Disadvantages:

No shell access
More difficult debugging
Requires stronger observability practices

For production workloads, Distroless images are increasingly becoming the preferred choice.

To use an analogy, if configuring a standard Ubuntu image is like packing a massive survival kit, a tent, and a cooking stove for a brief 30-minute hike, selecting a Distroless image is like packing nothing but a water bottle and a map. Distroless images strip away everything that is not explicitly required by the application runtime. They contain no package managers (apk, apt), no interactive shells (bash, sh), and no common networking utilities (curl, wget).

Multi-Stage Builds: The Most Important Docker Optimization

Multi-stage builds separate build dependencies from runtime dependencies.

Without multi-stage builds, production images often include:

Source code
Compilers
Build tools
Package managers
Test dependencies

Most of these components are unnecessary after the application has been compiled.

In order to include multiple build stages just use FROM at the starting of each stage.

Multi-Stage Build for React Applications

A common React image might exceed 1 GB if all build dependencies are shipped to production.

Using multi-stage builds:

# Build Stage
FROM node:22-alpine AS builder

WORKDIR /app

COPY package*.json ./

RUN npm ci

COPY . .

RUN npm run build

# Runtime Stage
FROM nginx:alpine

COPY --from=builder /app/build /usr/share/nginx/html

EXPOSE 80

CMD ["nginx","-g","daemon off;"]

Only the generated static files are copied into the final image.

The Node.js toolchain never reaches production.

Multi-Stage Build for Golang

Go applications benefit even more because they can be compiled into standalone binaries.

# Build Stage
FROM golang:1.23-alpine AS builder

WORKDIR /app

COPY go.mod go.sum ./

RUN go mod download

COPY . .

RUN CGO_ENABLED=0 GOOS=linux go build -o app .

# Runtime Stage
FROM alpine:3.20

RUN apk add --no-cache ca-certificates

COPY --from=builder /app/app /app

CMD ["/app"]

The final image contains only:

Binary
Certificates

Everything else remains in the build stage.

Real-World Multi-Stage Build Results

Several organizations have publicly documented the benefits of image optimization.

Organization	Optimization	Result
AWS CodeBuild	Persistent Docker caching	Build times reduced from 24 minutes to 16 seconds
Sealos	Layer consolidation and image optimization	800 GB → 2.05 GB
Wayfair	Layer cleanup and package cache optimization	324 MB → 23 MB
Node.js Application Example	Multi-stage build	1.35 GB → 135 MB

These numbers demonstrate why image optimization is a critical engineering practice rather than an optional enhancement.

Layer Optimization and Image Hygiene

Docker images are composed of layers.

Every instruction creates a new layer:

RUN apt update
RUN apt install curl
RUN apt install git

This creates multiple layers.

A better approach:

RUN apt-get update && \
    apt-get install -y curl git && \
    apt-get clean && \
    rm -rf /var/lib/apt/lists/*

This reduces unnecessary image growth.

The Financial Impact of Layer Optimization

Container optimization directly affects cloud costs.

A notable example comes from Sealos, where excessive image growth caused serious operational challenges.

After optimizing image layers:

Metric	Before	After
Image Size	800 GB	2.05 GB
Pull Time	75 sec	26 sec
Disk I/O	120 MB/s	26 MB/s
Monthly Storage Cost	~$520	~$70

Results:

390× image size reduction
65% faster pulls
78% lower disk utilization
87% lower storage costs

These improvements compound significantly across large Kubernetes clusters.

Mastering Docker Build Cache

Build cache optimization dramatically reduces CI/CD execution time.

Docker caches layers sequentially. Docker has top-down approach for caching.

If an early layer changes, every subsequent layer must be rebuilt.

Consider:

FROM node:22-alpine

WORKDIR /app

COPY . .

RUN npm install

Every source code change invalidates the dependency layer.

Docker must reinstall all packages.

The Dependency-First Pattern

A better approach:

FROM node:22-alpine

WORKDIR /app

COPY package*.json ./

RUN npm ci

COPY . .

Now dependencies are rebuilt only when package manifests change. Earlier without it the dependency layer was rebuilt again repeatedly anytime a change in source code occured rather than in the dependencies itself.

This pattern should be applied across all ecosystems.

Language	Dependency Files
Node.js	package.json, package-lock.json
Python	requirements.txt, pyproject.toml
Go	go.mod, go.sum
Rust	Cargo.toml, Cargo.lock

.dockerignore: The Most Overlooked Optimization

Many developers forget to create a .dockerignore file. There are a lot of files and directories that we do not need in our final image so it helps us to ignore all of them. It is similar to .gitignore file in git folder.

Example:

.git
node_modules
coverage
dist
.env
*.log

Benefits:

Smaller build context
Faster builds
Reduced cache invalidation
Lower network transfer

BuildKit Cache Mounts

Docker BuildKit introduces persistent caching.

Example:

RUN --mount=type=cache,target=/root/.cache/pip \
    pip install -r requirements.txt

Benefits:

Faster dependency downloads
Reduced network traffic
Improved CI/CD performance

This becomes particularly valuable for Python and Node.js applications with large dependency trees.

Container Security Best Practices

Optimization is only half the story.

Security must be built directly into container images.

Never Run Containers as Root

Bad:

CMD ["node","server.js"]

Better:

RUN groupadd -r appgroup && \
    useradd -r -g appgroup appuser

USER appuser

CMD ["node","server.js"]

Running as a non-root user reduces the impact of container compromise. Even if some hacker gets access to docker container he cannot access docker host unless the container itself is running using root user.

Drop Linux Capabilities

Most applications do not require elevated kernel capabilities. Linux capabilities as special powers that a process inside a container can have.

Traditionally, Linux had only two privilege levels:

Root → can do almost anything
Non-root → limited permissions

Linux capabilities break root privileges into smaller pieces.

For example:

Capability	Allows
`NET_BIND_SERVICE`	Bind to ports below 1024 (80, 443)
`SYS_ADMIN`	Mount filesystems, change kernel settings
`NET_ADMIN`	Modify network interfaces and routing
`SYS_TIME`	Change system time

By default, Docker containers get several capabilities.

Imagine your web application gets hacked.

If the container still has unnecessary capabilities, the attacker now has extra powers they can abuse.

Recommended:

docker run \
  --cap-drop=ALL \
  --cap-add=NET_BIND_SERVICE

Let's understand this piece of code:

--cap-drop=ALL zero capabilities, now the container has no special powers.

--cap-add=NET_BIND_SERVICE add this capability, Linux reserves ports below 1024.

80 -> HTTP
443 -> HTTPS
22 -> SSH

A normal process cannot bind to these ports.

If your container hosts a web server:

nginx

it needs to listen on port:

So you grant only:

--cap-add=NET_BIND_SERVICE

Now the application can use port 80 but cannot perform other privileged operations.

This minimizes attack vectors.

Use Read-Only Filesystems

Normally, a container can write files inside its filesystem.

For example:

touch malware.sh
echo "hacked" > secret.txt
mkdir backdoor

The container can modify its own filesystem.

When you enable:

securityContext:
  readOnlyRootFilesystem: true

the container's root filesystem becomes:

READ ONLY

The application can read files but cannot modify them.

For production workloads:

securityContext:
  readOnlyRootFilesystem: true

Benefits:

Prevents tampering
Blocks malware persistence
Reduces lateral movement opportunities

Never Store Secrets in Images

Avoid:

ENV DATABASE_PASSWORD=supersecret

Secrets become visible through:

docker history
docker inspect

Use:

Kubernetes Secrets
AWS Secrets Manager
HashiCorp Vault
External secret operators

Secrets should be injected at runtime, not hardcoded into images.

Continuously Scan Images

Every build should include vulnerability scanning.

Popular tools:

Trivy
Grype
Docker Scout
Snyk

Scanning should be integrated into CI/CD pipelines and block deployments when critical vulnerabilities are detected.

Final Thoughts

Docker optimization is not simply about reducing image size. It directly impacts deployment speed, infrastructure costs, CI/CD efficiency, cluster scalability, and security posture.

Organizations operating at scale invest heavily in:

Minimal base images
Deterministic builds
Multi-stage Dockerfiles
Layer optimization
Build cache strategies
Runtime hardening
Continuous vulnerability scanning

The combined impact is substantial. Faster builds improve developer productivity, smaller images reduce cloud costs, and hardened containers significantly reduce security risk. Whether you're running a single application or managing hundreds of microservices, these practices form the foundation of production-grade container engineering.

The Ultimate Guide to Docker Optimization and Security

Introduction

Why Docker Optimization Matters

Choosing the Right Base Image

Never Use Mutable Tags

Alpine vs Distroless

Alpine Linux

Distroless Images

Multi-Stage Builds: The Most Important Docker Optimization

Multi-Stage Build for React Applications

Multi-Stage Build for Golang

Real-World Multi-Stage Build Results

Layer Optimization and Image Hygiene

The Financial Impact of Layer Optimization

Mastering Docker Build Cache

The Dependency-First Pattern

.dockerignore: The Most Overlooked Optimization

BuildKit Cache Mounts

Container Security Best Practices

Never Run Containers as Root

Drop Linux Capabilities

Use Read-Only Filesystems

Never Store Secrets in Images

Continuously Scan Images

Final Thoughts

Comments

More from this blog

Serving Up Infrastructure: A 2-Tier AWS Architecture Explained

Build your first - Production grade EKS cluster with Terraform

Building a Serverless Image Processing Pipeline on AWS with Terraform

An Introduction to Prometheus for Effective DevOps Monitoring

Command Palette

Introduction

Why Docker Optimization Matters

Choosing the Right Base Image

Never Use Mutable Tags

Alpine vs Distroless

Alpine Linux

Distroless Images

Multi-Stage Builds: The Most Important Docker Optimization

Multi-Stage Build for React Applications

Multi-Stage Build for Golang

Real-World Multi-Stage Build Results

Layer Optimization and Image Hygiene

The Financial Impact of Layer Optimization

Mastering Docker Build Cache

The Dependency-First Pattern

.dockerignore: The Most Overlooked Optimization

BuildKit Cache Mounts

Container Security Best Practices

Never Run Containers as Root

Drop Linux Capabilities

Use Read-Only Filesystems

Never Store Secrets in Images

Continuously Scan Images

Final Thoughts

Comments

More from this blog