Automating the F5 devices backup and restore tasks
Scenario: F5 device backup
Benefits
Automating the F5 device backup and restore tasks provides the following benefits:
- Minimises downtime
- Reduces the scope of errors and requires minimal or no manual intervention
- Streamlines recovery
The F5 device automated backup process performs the following functions:
- Proactively identifies issues via automated health check report.
- Creates backup files of each F5 device configuration.
- Provides user-based pipelines to restore configuration and minimize the occurrence of outages.
Automation workflow
The process automation team creates an automation workflow, a set of automated tasks that streamline the backup process.
The following graphic shows the automated backup process. A Jenkins pipeline runs every day and performs the backup.
The automated backup process performs the following actions:
- Discovers all F5 devices within the cloud-based SaaS environment.
- Performs daily health checks for all F5 devices across every data center.
- Generates a priority 1 ticket for the network team on detecting any unhealthy F5 device.
- Creates backups of the UCS files for all F5 devices.
- If an issue is observed with the automation process, it creates a high incident and assigns it to the network team.
- Downloads the backup files and stores them in Go Git Service (GOGS), a self-hosted Git service.
- Deletes backup files that are older than the retention period.
The retention period is set to 15 days.
To recover the configuration of an F5 device after a failure:
If an F5 device failure occurs, an IT team member submits a request after providing the F5 device details. The configuration is restored by a user-based Jenkins pipeline.
The following graphic shows the automated recovery process:
The automated process performs the following steps:
- Creates a work order.
- Downloads the UCS file for the F5 device from GOGS.
- Uploads the file to the F5 device where the failure occurred.
- If the restore operation is successful, updates the status of the work order status is updated, and notifies the user and the network team.
- If the restore operation fails, the workflow performs the following steps:
- Creates a high-priority incident.
- Assigns the incident to the network team.
- Updates the work order with the incident details.
Tip: For faster searching, add an asterisk to the end of your partial query. Example: cert*