Troubleshooting upgrade issues
Consult this topic for information about troubleshooting issues related to the product upgrade.
Issue with the ingress controller
If you face an issue with the ingress controller during the upgrade, run the following command:
NAME WEBHOOKS AGE
ingress-nginx-admission 1 18h
$ kubectl delete -A ValidatingWebhookConfiguration ingress-nginx-admission
The Patroni PostgreSQL Server does not work
Scope
After deploying the Patroni PostgreSQL Server the status of the primary Patroni PG pod might not change to Ready and the pod continues to look for the master pod. You can see these details in the pod logs.
This issue occurs because the previous data cleanup was not successful.
Workaround
Delete Patroni endpoints. Perform the following steps to do this:
Kuberenetes
Check whether Patroni endpoints exist. To get a list of endpoints, run the following command:
kubectl get ep -n <NAMESPACE> | grep bmcExample output:
postgres-bmc-pg-ha 10.42.21.182:5432 30h
postgres-bmc-pg-ha-config <none> 30h
postgres-bmc-pg-ha-pool 10.42.17.71:9999,10.42.20.108:9999 30h
postgres-bmc-pg-ha-repl 10.42.11.118:5432,10.42.41.13:5432 30hIf the endpoints exist, delete the Patroni endpoints. Run the following command:
kubectl delete ep -n <NAMESPACE> postgres-bmc-pg-ha postgres-bmc-pg-ha-config postgres-bmc-pg-ha-pool postgres-bmc-pg-ha-repl
OpenShift
Get the configmap. In OpenShift, the Patroni endpoints are set to false, so the configmap is used by Patroni. Run the following command to get the configmap:
oc get cm | grep bmcExample output:
oc get cm | grep bmc
postgres-bmc-pg-ha-config 0 17h
postgres-bmc-pg-ha-leader 0 17h
postgres-bmc-pg-ha-pgconfig 1 17h
postgres-bmc-pg-ha-pgpoolconfig 1 17hDelete the configmap. Run the following command:
oc delete cm postgres-bmc-pg-ha-config postgres-bmc-pg-ha-leader postgres-bmc-pg-ha-pgconfig postgres-bmc-pg-ha-pgpoolconfig
The PostgreSQL database migration is not successful
- If the migration fails, perform the following steps:
Delete the Patroni PostgreSQL by running the following command:
helm delete <releasename> -n <namespace>Delete the pgdata-postgres-bmc-pg-ha-* PVCs by running the following command:
kubectl delete pvc pgdata-postgres-bmc-pg-ha-0 pgdata-postgres-bmc-pg-ha-1 pgdata-postgres-bmc-pg-ha-2 -n <namespace>Get the endpoint by running the following command:
kubectl get ep -n <namespace> | grep postgresDelete the endpoint by running the following command:
kubectl delete ep postgres-xxxxx -n <name- Rerun the database migration.
For instructions, see step1 and step 2 under 'To migrate data from Bitnami PostgreSQL to Patroni PostgreSQL' section in the Upgrading BMC Helix IT Operations Management from version 22.4 to 23.1.02 topic.
Upgrade fails because of a Kafka exporter issue
Scope
The upgrade to version 22.4 fails because of a Kafka exporter issue. The following error is displayed:
{"app.kubernetes.io/component":"cluster-metrics", "app.kubernetes.io/instance":"kafka", "app.kubernetes.io/name":"kafka"}
, MatchExpressions:[]v1.LabelSelectorRequirement(nil)}: field is immutable
Workaround
Get the endpoint by running the following command:
kubectl get ep -n <namespace> | grep postgresDelete the endpoint by running the following command:
kubectl delete ep postgres-xxxxx -n <namespace>
Obtain the Kafka exporter details by running the following command:
kubectl get deployment -n <namespace> | grep Kafka-exporterDelete the Kafka exporter deployment by running the following command:
kubectl delete deployment -n <namespace> Kafka-exporter- Run the upgrade process again. For more information, see the Step 3: To upgrade the product section in Upgrading-BMC-Helix-IT-Operations-Management-from-version-22-4-to-23-1-02.
Upgrade fails because of a Kafka status check issue
Scope
The upgrade to version 22.4 fails because the timeout for a Kafka status check is too less.
Workaround:
- Open the helix-on-prem-deployment-manager/commons/infra_check_scripts/kafka/kafka_checker_job.yml file.
- Increase the value of the timeout parameter from 12 to 25.
This value is in seconds. - Save and close the file.
- Open the the helix-on-prem-deployment-manager/commons/infra_check_scripts/kafka/kafka_checker.sh file.
- Increase the value of the sleep parameter from 15 to 30.
This value is in seconds. - Save and close the file.
- Run the upgrade process again. For more information, see the Step 3: To upgrade the product section in Upgrading-BMC-Helix-IT-Operations-Management-from-version-22-4-to-23-1-02.
Upgrade fails because of a Elasticsearch logging
Scope
Deployment manager fails to upgrade Elasticsearch logging from version 22.2.01 to 22.4 and you get the following error:
"Error: UPGRADE FAILED: cannot patch "elasticsearch-logging-kibana" with kind PersistentVolumeClaim"
Workaround.
Clean up the old deployment for logging from the bmc-helix-logging namespace and deploy the new version of BMC Helix Logging. For more information , see 000407351