Exploring Upgrade Strategies for Stateful Sets in Kubernetes

Ajay Nemade

Cloud & DevOps

Tags:

google cloud platform

devops

kubernetes

google container engine

continuous integration

Introduction

In the age of continuous delivery and agility where the software is being deployed 10s of times per day and sometimes per hour as well using container orchestration platforms, a seamless upgrade mechanism becomes a critical aspect of any technology adoption, Kubernetes being no exception.

Kubernetes provides a variety of controllers that define how pods are set up and deployed within the Kubernetes cluster. These controllers can group pods together according to their runtime needs and can be used to define pod replication and pod startup ordering. Kubernetes controllers are nothing but an application pattern. The controller controls the pods(smallest unit in Kubernetes), so, you don’t need to create, manage and delete the pods. There are few types of controllers in Kubernetes like,

Each controller represents an application pattern. For example, Deployment represents the stateless application pattern in which you don’t store the state of your application. Statefulset represents the statefulset application pattern where you store the data, for example, databases, message queues. We will be focusing on Statefulset controller and its update feature in this blog.

Statefulset

The StatefulSet acts as a controller in Kubernetes to deploy applications according to a specified rule set and is aimed towards the use of persistent and stateful applications. It is an ordered and graceful deployment. Statefulset is generally used with a distributed applications that require each node to have a persistent state and the ability to configure an arbitrary number of nodes. StatefulSet pods have a unique identity that is comprised of an ordinal, a stable network identity, and stable storage. The identity sticks to the pod, regardless of which node it’s scheduled on. For more details check here.

Update Strategies FOR STATEFULSETS

There are a couple of different strategies available for upgrades - Blue/Green and Rolling updates. Let's review them in detail:

Blue-Green Deployment : Blue-green deployment is one of the commonly used update strategies. There are 2 identical environments of your application in this strategy. One is the Blue environment which is running the current deployment and the Green environment is the new deployment to which we want to upgrade. The approach is simple:

Switch the load balancer to route traffic to the Green environment.
Delete the Blue environment once the Green environment is verified.

Disadvantages of Blue-Green deployment:

One of the disadvantages of this strategy is that all current transactions and sessions will be lost, due to the physical switch from one machine serving the traffic to another one.
Implementing blue-green deployment become complex with the database, especially if, the database schema changes across version.
In blue-green deployment, you need the extra cloud setup/hardware which increases the overall costing.

Rolling update strategy

After Blue-Green deployment, let's take a look at Rolling updates and how it works.

In short, as the name suggests this strategy replaces currently running instances of the application with new instances, one by one.
In this strategy, health checks play an important role i.e. old instances of the application are removed only if new version are healthy. Due to this, the existing deployment becomes heterogeneous while moving from the old version of the application to new version.
The benefit of this strategy is that its incremental approach to roll out the update and verification happens in parallel while increasing traffic to the application.
In rolling update strategy, you don’t need extra hardware/cloud setup and hence it’s cost-effective technique of upgrade.

Statefulset upgrade strategies

With the basic understanding of upgrade strategies, let's explore the update strategies available for Stateful sets in Kubernetes. Statefulsets are used for databases where the state of the application is the crucial part of the deployment. We will take the example of Cassandra to learn about statefulset upgrade feature. We will use the gce-pd storage to store the data. StatefulSets(since Kubernetes 1.7) uses an update strategy to configure and disable automated rolling updates for containers, labels, resource request/limits, and annotations for its pods. The update strategy is configured using the updateStrategy field.

The updateStrategy field accepts one of the following value

OnDelete
RollingUpdate

OnDelete update strategy

OnDelete prevents the controller from automatically updating its pods. One needs to delete the pod manually for the changes to take effect. It’s more of a manual update process for the Statefulset application and this is the main difference between OnDelete and RollingUpdate strategy. OnDelete update strategy plays an important role where the user needs to perform few action/verification post the update of each pod. For example, after updating a single pod of Cassandra user might need to check if the updated pod joined the Cassandra cluster correctly.

We will now create a Statefulset deployment first. Let’s take a simple example of Cassandra and deploy it using a Statefulset controller. Persistent storage is the key point in Statefulset controller. You can read more about the storage class here.

For the purpose of this blog, we will use the Google Kubernetes Engine.

First, define the storage class as follows:

CODE: https://gist.github.com/velotiotech/d505e83860d17fb22174619e90ca4513.js

Then create the Storage class using kubectl:

CODE: https://gist.github.com/velotiotech/c87730c207895f768f937f17550a9a3d.js

Here is the YAML file for the Cassandra service and the Statefulset deployment.

CODE: https://gist.github.com/velotiotech/88aa10205f96e932ffbcb40f556379c4.js

Let's create the Statefulset now.

CODE: https://gist.github.com/velotiotech/09b622b23f1f4b9e1d648571231f618c.js

After creating Cassandra Statefulset, if you check the running pods then you will find something like,

CODE: https://gist.github.com/velotiotech/c40dc8319b8c61a052409afbfdc8c286.js

Check if Cassandra cluster is formed correctly using following command:

CODE: https://gist.github.com/velotiotech/dd50d193a18cad4373698909b6964c93.js

Let’s describe the running pod first before updating. Look for the image field in the output of the following command

CODE: https://gist.github.com/velotiotech/a2ff0b6ad5f26f740f0e35e72aa6d390.js

The Image field will show gcr.io/google-samples/cassandra:v12 . Now, let’s patch the Cassandra statefulset with the latest image to which we want to update. The latest image might contain the new Cassandra version or database schema changes. Before upgrading such crucial components, it’s always safe to have the backup of the data,

CODE: https://gist.github.com/velotiotech/52556ee2653e650e3dc51d4c0031f0d3.js

You will see output as `statefulset.apps "cassandra" patched`, but controller won’t update the running pod automatically in this strategy. You need to delete the pods once and wait till pods with new configuration comes up. Let’s try deleting the cassandra-0 pod.

CODE: https://gist.github.com/velotiotech/7d4d4ee23ae3de4b25b8fc516f0c4857.js

Wait till cassandra-0 comes up in running state and then check if the cassandra-0 is running with intended/updated image i.e. gcr.io/google-samples/cassandra:v13 Now, cassandra-0 is running the new image while cassandra-1 and cassandra-2 are still running the old image. You need to delete these pods for the new image to take effect in this strategy.

Rolling update strategy

Rolling update is an automated update process. In this, the controller deletes and then recreates each of its pods. Pods get updated one at a time. While updating, the controller makes sure that an updated pod is running and is in ready state before updating its predecessor. The pods in the StatefulSet are updated in reverse ordinal order(same as pod termination order i.e from the largest ordinal to the smallest)

For the rolling update strategy, we will create the Cassandra statefulset with the .spec.updateStrategy field pointing to RollingUpdate.

CODE: https://gist.github.com/velotiotech/f26c6d44549947ed8b683c8e98acfc17.js

To try the rolling update feature, we can patch the existing statefulset with the updated image.

CODE: https://gist.github.com/velotiotech/694cee47437dac49d77715b66bf0ebb8.js

Once you execute the above command, monitor the output of the following command,

CODE: https://gist.github.com/velotiotech/185c8bdad27346f2607415d79841b3ce.js

In the case of failure in update process, controller restores any pod that fails during the update to its current version i.e. pods that have already received the update will be restored to the updated version, and pods that have not yet received the update will be restored to the previous version.

Partitioning a RollingUpdate (Staging an Update)

The updateStrategy contains one more field for partitioning the RollingUpdate. If a partition is specified, all pods with an ordinal greater than or equal to that of the provided partition will be updated and the pods with an ordinal that is less than the partition will not be updated. If the pods with an ordinal value less than the partition get deleted, then those pods will get recreated with the old definition/version. This partitioning rolling update feature plays important role in the scenario where if you want to stage an update, roll out a canary, or perform a phased rollout.

RollingUpdate supports partitioning option. You can define the partition parameter in the .spec.updateStrategy

CODE: https://gist.github.com/velotiotech/4921fe3b30033f5f3f6ff4d9e2fe8d98.js

In the above command, we are giving partition value as 2, which will patch the Cassandra statefulset in such a way that, whenever we try to update the Cassandra statefulset, it will update the cassandra-2 pod only. Let’s try to patch the updated image to existing statefulset.

CODE: https://gist.github.com/velotiotech/35062df4a7d6c4e11505b6bccc854fa5.js

After patching, watch the following command output,

CODE: https://gist.github.com/velotiotech/136e09e208fe6e42ede82acf4598391d.js

You can keep decrementing the partition value and that many pods will keep taking the effect of the applied patch. For example, if you patch the statefulset with partition=0 then all the pods of the Cassandra statefulset will get updated with provided upgrade configuration.

Verifying if the upgrade was successful

Verifying the upgrade process of your application is the important step to conclude the upgrade. This step might differ as per the application. Here, in the blog we have taken the Cassandra example, so we will verify if the cluster of the Cassandra nodes is being formed properly.

Use `nodetool status` command to verify the cluster. After upgrading all the pods, you might want to run some post-processing like migrating schema if your upgrade dictates that etc.

As per the upgrade strategy, verification of your application can be done by following ways.

In OnDelete update strategy, you can keep updating pod one by one and keep checking the application status to make sure the upgrade working fine.
In RollingUpdate strategy, you can check the application status once all the running pods of your application gets upgraded.

For Cassandra like application, OnDelete update is more preferred than RollingUpdate. In rolling update, we saw that Cassandra pod gets updated one by one, starting from high to low ordinal index. There might be the case where after updating 2 pods, Cassandra cluster might go in failed state but you can not recover it like the OnDelete strategy. You have to try to recover Cassandra once the complete upgrade is done i.e. once all the pods get upgraded to provided image. If you have to use the rolling update then try partitioning the rolling update.

Conclusion

In this blog, we went through the Kubernetes controllers and mainly through statefulsets. We learnt about the differences between blue-green deployment and rolling update strategies then we played with the Cassandra statefulset example and successfully upgraded it with update strategies like OnDelete and RollingUpdate. Do let us know if you have any questions, queries and additional thoughts in the comments section below.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Exploring Upgrade Strategies for Stateful Sets in Kubernetes

Introduction

Statefulset

Update Strategies FOR STATEFULSETS

There are a couple of different strategies available for upgrades - Blue/Green and Rolling updates. Let's review them in detail:

Switch the load balancer to route traffic to the Green environment.
Delete the Blue environment once the Green environment is verified.

Disadvantages of Blue-Green deployment:

One of the disadvantages of this strategy is that all current transactions and sessions will be lost, due to the physical switch from one machine serving the traffic to another one.
Implementing blue-green deployment become complex with the database, especially if, the database schema changes across version.
In blue-green deployment, you need the extra cloud setup/hardware which increases the overall costing.

Rolling update strategy

After Blue-Green deployment, let's take a look at Rolling updates and how it works.

In short, as the name suggests this strategy replaces currently running instances of the application with new instances, one by one.
In this strategy, health checks play an important role i.e. old instances of the application are removed only if new version are healthy. Due to this, the existing deployment becomes heterogeneous while moving from the old version of the application to new version.
The benefit of this strategy is that its incremental approach to roll out the update and verification happens in parallel while increasing traffic to the application.
In rolling update strategy, you don’t need extra hardware/cloud setup and hence it’s cost-effective technique of upgrade.

Statefulset upgrade strategies

The updateStrategy field accepts one of the following value

OnDelete
RollingUpdate

OnDelete update strategy

For the purpose of this blog, we will use the Google Kubernetes Engine.

First, define the storage class as follows:

CODE: https://gist.github.com/velotiotech/d505e83860d17fb22174619e90ca4513.js

Then create the Storage class using kubectl:

CODE: https://gist.github.com/velotiotech/c87730c207895f768f937f17550a9a3d.js

Here is the YAML file for the Cassandra service and the Statefulset deployment.

CODE: https://gist.github.com/velotiotech/88aa10205f96e932ffbcb40f556379c4.js

Let's create the Statefulset now.

CODE: https://gist.github.com/velotiotech/09b622b23f1f4b9e1d648571231f618c.js

After creating Cassandra Statefulset, if you check the running pods then you will find something like,

CODE: https://gist.github.com/velotiotech/c40dc8319b8c61a052409afbfdc8c286.js

Check if Cassandra cluster is formed correctly using following command:

CODE: https://gist.github.com/velotiotech/dd50d193a18cad4373698909b6964c93.js

Let’s describe the running pod first before updating. Look for the image field in the output of the following command

CODE: https://gist.github.com/velotiotech/a2ff0b6ad5f26f740f0e35e72aa6d390.js

The Image field will show gcr.io/google-samples/cassandra:v12 . Now, let’s patch the Cassandra statefulset with the latest image to which we want to update. The latest image might contain the new Cassandra version or database schema changes. Before upgrading such crucial components, it’s always safe to have the backup of the data,

CODE: https://gist.github.com/velotiotech/52556ee2653e650e3dc51d4c0031f0d3.js

CODE: https://gist.github.com/velotiotech/7d4d4ee23ae3de4b25b8fc516f0c4857.js

Wait till cassandra-0 comes up in running state and then check if the cassandra-0 is running with intended/updated image i.e. gcr.io/google-samples/cassandra:v13 Now, cassandra-0 is running the new image while cassandra-1 and cassandra-2 are still running the old image. You need to delete these pods for the new image to take effect in this strategy.

Rolling update strategy

For the rolling update strategy, we will create the Cassandra statefulset with the .spec.updateStrategy field pointing to RollingUpdate.

CODE: https://gist.github.com/velotiotech/f26c6d44549947ed8b683c8e98acfc17.js

To try the rolling update feature, we can patch the existing statefulset with the updated image.

CODE: https://gist.github.com/velotiotech/694cee47437dac49d77715b66bf0ebb8.js

Once you execute the above command, monitor the output of the following command,

CODE: https://gist.github.com/velotiotech/185c8bdad27346f2607415d79841b3ce.js

Partitioning a RollingUpdate (Staging an Update)

RollingUpdate supports partitioning option. You can define the partition parameter in the .spec.updateStrategy

CODE: https://gist.github.com/velotiotech/4921fe3b30033f5f3f6ff4d9e2fe8d98.js

CODE: https://gist.github.com/velotiotech/35062df4a7d6c4e11505b6bccc854fa5.js

After patching, watch the following command output,

CODE: https://gist.github.com/velotiotech/136e09e208fe6e42ede82acf4598391d.js

Verifying if the upgrade was successful

Use `nodetool status` command to verify the cluster. After upgrading all the pods, you might want to run some post-processing like migrating schema if your upgrade dictates that etc.

As per the upgrade strategy, verification of your application can be done by following ways.

In OnDelete update strategy, you can keep updating pod one by one and keep checking the application status to make sure the upgrade working fine.
In RollingUpdate strategy, you can check the application status once all the running pods of your application gets upgraded.

Conclusion

google cloud platform

devops

kubernetes

google container engine

continuous integration

About the Author

Did you like the blog? If yes, we're sure you'll also like to work with the people who write them - our best-in-class engineering team.

We're looking for talented developers who are passionate about new emerging technologies. If that's you, get in touch with us.

Explore current openings

Subscribe to get the latest technology updates

Exploring Upgrade Strategies for Stateful Sets in Kubernetes

Ajay Nemade

Introduction

Statefulset

Update Strategies FOR STATEFULSETS

Rolling update strategy

Statefulset upgrade strategies

OnDelete update strategy

Rolling update strategy

Partitioning a RollingUpdate (Staging an Update)

Verifying if the upgrade was successful

Conclusion

MORE POSTS BY THIS AUTHOR

Ajay Nemade

You may also like

Shebang Your Shell Commands with GenAI using AWS Bedrock

Sagar Barai

🐉 Taming the OpenStack Beast – A Fun & Easy Guide!

Shruti Anekar

Linux Internals of Kubernetes Networking

Shiwam Jaiswal

Exploring Upgrade Strategies for Stateful Sets in Kubernetes

Introduction

Statefulset

Update Strategies FOR STATEFULSETS

Rolling update strategy

Statefulset upgrade strategies

OnDelete update strategy

Rolling update strategy

Partitioning a RollingUpdate (Staging an Update)

Verifying if the upgrade was successful

Conclusion

About the Author

Did you like the blog? If yes, we're sure you'll also like to work with the people who write them - our best-in-class engineering team.

We're looking for talented developers who are passionate about new emerging technologies. If that's you, get in touch with us.

About Velotio

Subscribe to get the latest technology updates

Related Posts

Shebang Your Shell Commands with GenAI using AWS Bedrock

🐉 Taming the OpenStack Beast – A Fun & Easy Guide!

Linux Internals of Kubernetes Networking

Strategies for Cost Optimization Across Amazon EKS Clusters

Mastering Prow: A Guide to Developing Your Own Plugin for Kubernetes CI/CD Workflow

Simplifying MySQL Sharding with ProxySQL: A Step-by-Step Guide

Streamline Kubernetes Storage Upgrades

Unlocking Key Insights in NATS Development: My Journey from Novice to Expert - Part 1

Unveiling the Magic of Kubernetes: Exploring Pod Priority, Priority Classes, and Pod Preemption

How to deploy GitHub Actions Self-Hosted Runners on Kubernetes

How to Setup HashiCorp Vault HA Cluster with Integrated Storage (Raft)

How To Get Started With Logging On Kubernetes?

Create CI/CD Pipeline in GitLab in under 10 mins

Acquiring Temporary AWS Credentials with Browser Navigated Authentication

How to Avoid Screwing Up CI/CD: Best Practices for DevOps Team

How to Make Your Terminal More Productive with Z-Shell (ZSH)

Setting Up A Robust Authentication Environment For OpenSSH Using QR Code PAM

Hacking Your Way Around AWS IAM Roles

Monitoring a Docker Container with Elasticsearch, Kibana, and Metricbeat

Autoscaling in Kubernetes using HPA and VPA

Managing a TLS Certificate for Kubernetes Admission Webhook

Prow + Kubernetes - A Perfect Combination To Execute CI/CD At Scale

Building A Containerized Microservice in Golang: A Step-by-step Guide

Kubernetes Migration: How To Move Data Freely Across Clusters

OPA On Kubernetes: An Introduction For Beginners

To Go Serverless Or Not Is The Question

Ensure Continuous Delivery On Kubernetes With GitOps’ Argo CD

How To Implement Chaos Engineering For Microservices Using Istio

Helm 3: A More Secured and Simpler Kubernetes Package Manager

An Introduction To Cloudflare Workers And Cloudflare KV store

Getting Started With Kubernetes Operators (Golang Based) - Part 3

Getting Started With Kubernetes Operators (Ansible Based) - Part 2

Getting Started With Kubernetes Operators (Helm Based) - Part 1

How to Write Jenkinsfile for Angular and .Net Based Applications

Kubernetes CSI in Action: Explained with Features and Use Cases

A Comprehensive Tutorial to Implementing OpenTracing With Jaeger

The Ultimate Guide to Disaster Recovery for Your Kubernetes Clusters

Know Everything About Spinnaker & How to Deploy Using Kubernetes Engine

Mesosphere DC/OS Masterclass : Tips and Tricks to Make Life Easier

Managing Secrets Using AWS Systems Manager Parameter Store and IAM Roles

Taking Amazon's Elastic Kubernetes Service for a Spin