Kubernetes Course Labs

Updates with Staged Rollouts

Pod controllers manage Pods for you - when you update the Pod spec the controller rolls out the change by removing old Pods and creating new ones. You'll do this all the time - every OS patch, library update and new feature will be an update. Depending on your app, config changes might need a rollout too.

You can configure the controller to tweak how the rollout happens - you might choose a slow but safe update for a critical component. Deployment objects are the typical Pod controller, but all the controllers have rollout options.

Reference

Fast staged rollouts

We'll start with a simple web application:

vweb/deployment.yaml - defines a Deployment with 3 replicas of the v1 application image. There's also an init container set to sleep, so it takes a few seconds for Pods to start

Open a new terminal - split your screen if you can - and run a watch command to see Pods come online:

# there won't be any Pods to start with:
kubectl get po -l app=vweb --watch

📋 In your main terminal, deploy the app from the labs/rollouts/specs/vweb folder, and make an HTTP request to the /v.txt URL.

Create the resources:

kubectl apply -f labs/rollouts/specs/vweb

In the watch window you'll see the Pods start, the init container runs, then the Pod initializes and enters the running state

Check the Services and you'll see there's a NodePort listening on port 30018:

kubectl get svc 

curl localhost:30018/v.txt

The output from the app is just the text v1.

This update to the Deployment will cause a fast rollout to v2:

update-fast/deployment.yaml - maxSurge is set to 100% which means Kubernetes will create 3 new Pods straight away; maxUnavailable is set to 0 so no old Pods will be removed until new ones come online.

Deploy the update in your first terminal session, so you can see the changes in the watch session:

kubectl apply -f labs/rollouts/specs/vweb/update-fast

📋 What do you see in the terminal with the Pod watch? How does the change get made?

3 new Pods are created straight away - a new ReplicaSet is created with the v2 spec and desired count of 3.

The three existing Pods remain until new Pods are ready, then they're terminated - the v1 ReplicaSet is gradually scaled down to 0.

You can see the update happening in the ReplicaSets:

kubectl get rs -l app=vweb

You can try the app while the rollout is happening:

curl localhost:30018/v.txt

All the v1 and v2 Pods match the Service selector so you'll get load-balanced responses from both versions.

Staged rollouts require the app to support multiple versions running concurrently, and a fast rollout like this needs spare capacity in the cluster.

Slow staged rollouts

Rollouts aren't a separate Kubernetes object, but you can manage the rollouts for a Pod controller with Kubectl.

Check the rollouts for the Deployment:

kubectl rollout history deploy/vweb

📋 Rollback the update, so the app is at v1 again - without applying any YAML.

The rollout command has several subcommands:

kubectl rollout --help

Use undo to roll back to the previous Pod spec:

kubectl rollout undo deploy/vweb

The rollback uses the new custom rollout strategy, 3x v1 Pods come online, v2 Pods replaced when v1 Pods are running

Describe the Deployment and you'll see that the rolling update strategy hasn't changed. A rollback reverts to the previous Pod spec, not to the previous spec of the Deployment.

Now we're back at v1, we can see what happens with a slower rollout strategy:

update-slow/deployment.yaml - updates the image to v2, still with maxUnavailable of 0 so no Pods get replaced until new ones are ready; now maxSurge is set to 1 so only one new Pod is created at a time

Apply the new update:

kubectl apply -f labs/rollouts/specs/vweb/update-slow

📋 How is this rollout different? Are both versions running concurrently for a longer or shorter period?

This rollout updates 1 Pod at a time - a v2 Pod is created, and a v1 Pod is removed when the v2 Pod comes online.

This is a much slower rollout, because Pods are replaced consecutively. Both app versions are running while the rollout happens, but for a much longer period.

Big-bang rollouts

Not all apps support running different versions during a rollout. In that case you can configure a big-bang update, where all Pods are replaced immediately instead of using a staged rollout.

update-broken/deployment.yaml - uses the Recreate update strategy, and removes the init container so there's no delay in the rollout

With this strategy the existing ReplicaSet will be scaled down to 0 and then a new ReplicaSet will be created with a desired scale of 3. This is not good if there's a problem with the new release - which there is with this app.

Deploy the update and check on the Pod status in your watch window:

kubectl apply -f labs/rollouts/specs/vweb/update-broken

📋 What happens with the new Pods? Is the app still available?

All the existing Pods are terminated and then new ones are created.

There's a problem with those Pods - the image is broken because it has a bad startup command (you'll see that in the Pod logs).

The new Pods will never enter the running state, they'll go into CrashLoopBackOff after a while.

With 0 Pods ready, there are no endpoints in the Service and the app is unavailable.

Be careful using the Recreate strategy - a bad update will take your application offline:

curl localhost:30018/v.txt

There is no automatic rollback in Kubernetes. Updates need to be monitored and failed releases manually rolled back.

📋 Roll back to the previous release. Watch the Pod changes - how quickly does the app come back online?

Check the history and roll back to the previous version:

kubectl rollout history deploy/vweb

kubectl rollout undo deploy/vweb

All the failing Pods are terminated, and then the new Pods are started. They use the previous Pod spec so the app doesn't come online until the init containers have run.

The rollback doesn't change the update strategy, so the Deployment is still set to use Recreate.

Lab

An alternative update strategy is a blue-green deployment, where you have two versions of your app running, but only one receives traffic. It's simple to do that with two Deployments and one Service - you change the label selector on the Service to switch between the blue and green releases.

This lab uses Helm for a blue-green update. Start by deploying the Helm chart for the simple web app:

helm install vweb labs/rollouts/helm/vweb

Browse to the app and refresh - you'll see it flickers between the blue and green releases. Hmm. The goal is to fix that so you can switch releases with a simple update like this:

helm upgrade --set activeSlot=green # etc.

To make that work you'll need to fix the chart templates.

If you get that far, then you can experiment with automatic rollbacks - which Helm does support.

When the green release is live, try updating the blue release to use the bad kiamol/ch09-vweb:v3 image - using one Helm command which will rollback if the update isn't successful within 30 seconds.

Stuck? Try hints or check the solution.

Cleanup

Remove the Helm chart from the lab:

helm uninstall vweb

And all the other resources from the exercises:

kubectl delete ds,sts,deploy,svc -l kubernetes.courselabs.co=rollouts

EXTRA Rollouts for other Pod controllers

DaemonSets and StatefulSets also used staged rollouts, but they have different configuration options.

We'll use a new app for this. In a split terminal watch for Nginx Pods when they come online:

# there won't be any Pods to start with:
kubectl get po -l app=nginx --watch

DaemonSets are upgraded one node at a time, so by default Pods are taken down and replaced individually:

nginx-daemonset/1.18.yaml - runs a simple DaemonSet with no extra config

Create the DaemonSet with default update settings:

kubectl apply -f labs/rollouts/specs/nginx-daemonset

The v1.20 update bumps the image version and switches the update strategy to OnDelete:

kubectl apply -f labs/rollouts/specs/nginx-daemonset/update-ondelete

📋 What happens to the Nginx Pod? How can you trigger the update to start?

Nothing happens, the original Pod is not replaced.

The update strategy means Pods won't be replaced until they're explicitly deleted:

kubectl delete po -l app=nginx

When the old Pod has terminated, the new one is created.

The OnDelete strategy lets you control when Pods are replaced, but still have the replacement rolled out automatically.

StatefulSets have another variation on update strategies. By default the Pods are replaced consecutively, starting from the last Pod in the set and working backwards to the first.

nginx-statefulset/1.18.yaml - runs a 3-Pod set with the older Nginx release

Remove the DaemonSet and create the StatefulSet. If your watch is still running you'll see the old Pod removed and three new Pods created:

kubectl delete ds  nginx

kubectl apply -f labs/rollouts/specs/nginx-statefulset

This is a StatefulSet, so the Pods have predictable names: nginx-0, nginx-1 and nginx-2.

The 1.20 update uses a partitioned update.

Deploy the update:

kubectl apply -f labs/rollouts/specs/nginx-statefulset/update-partition

📋 Which Pods get updated? How would you continue with a full rollout?

The partitioned update stops the rollout at the specified Pod index - only Pod 2 gets replaced.

To continue the rollout you would need to update the partition in the YAML spec and deploy the change, or update the object directly with a patch:

# on macOS/Linux:
kubectl patch statefulset nginx -p '{"spec":{"updateStrategy":{"type":"RollingUpdate","rollingUpdate":{"partition":1}}}}'

# OR on Windows - you need to escape the quotes:
kubectl patch sts nginx -p '{""spec"":{""updateStrategy"":{""type"":""RollingUpdate"",""rollingUpdate"":{""partition"":1}}}}'

This site is open source. Improve this page.