Enable etcd backups for OpenShift clusters in hybrid cloud environments

This article discusses etcd backups for Red Hat OpenShift 4.X clusters in hybrid scenarios. This is a crucial activity for disaster recovery or node failure. etcd backups are responsible for recovering the state of master nodes and the cluster state, as it is the primary datastore of Kubernetes. It is recommended to store it externally as it ensures accessibility for node restoration even if node access or the nodes themselves become unavailable.

When to back up

Ideally you should initiate the cluster’s etcd data backup regularly and store it in a secure location outside the OpenShift cluster. After creating a new OpenShift cluster, the first certificate rotation happens after 24 hours of installation; you should not start the etcd backup before this operation as it will contain expired certificates. Additionally, it is recommended to initiate etcd backups during non-peak hours, as an etcd snapshot has a high I/O cost. Also, be sure do your etcd backup before and after any cluster upgrade process.

How to back up

In an OpenShift cluster, to back up your etcd database, an automated script is already provided at location /usr/local/bin/cluster-backup.sh at the master node. To access it, you need to start a debug session with OpenShift CLI.

oc debug node/<master_node_name> helps you to log in to master node. Once you run it, it will create a backup at the mentioned folder location. In the following sections, we will explain how to automate this process using a CronJob. This CronJob is run on the OpenShift cluster itself and will back up this file for all the master nodes in a timely matter. Make sure this backup that is created in master node is daily cleaned so that it doesn’t fill the disk space.

Where to store the backup?

This backup can be stored in any storage outside the cluster but should be reachable from the cluster. In this article we will explore the scenario of storing the etcd backup on Cloud Object Storage like S3. Similarly, it can be stored in other object stores for other clouds and NFS and other file share available on the clouds.

Execution

The next section details the steps required to store the etcd backup on IBM Cloud Object Storage.

Prerequisites

You have access to cluster as a user with cluster-admin role.
You have created an S3 Bucket which is accessible from the cluster.

We will create the following in OpenShift cluster:

Namespace.
Service account.
Cluster role.
Cluster role binding.
AWS S3 key.
CronJob.

You can create the namespace from the console or from OpenShift client CLI.

To schedule the etcd backup as a daily CronJob, it is important to create a dedicated namespace. Also make sure only cluster-admins have access to this namespace. Other team members would not need access to this namespace. See below:

oc new-project etcd-bkp  --description “Openshift ETCD Backup” –display-name “ETCD Backup to S3”

Enable etcd backups for OpenShift clusters in hybrid cloud environments

Share:

When to back up

How to back up

Where to store the backup?

Execution

Prerequisites

Service account

Cluster role

Cluster role binding

AWS S3 secret

CronJob

References

Products

Build

Quicklinks

Communicate

RED HAT DEVELOPER

Red Hat legal and privacy links

Red Hat legal and privacy links

Report a website issue