Troubleshooting upgrade issues

Consult this topic for information about troubleshooting issues related to the product upgrade.

Issue with the ingress controller

If you face an issue with the ingress controller during the upgrade, run the following command:

$ kubectl get -A ValidatingWebhookConfiguration
NAME WEBHOOKS AGE
ingress-nginx-admission 1 18h

$ kubectl delete -A ValidatingWebhookConfiguration ingress-nginx-admission

The Patroni PostgreSQL Server does not work

Scope

After deploying the Patroni PostgreSQL Server the status of the primary Patroni PG pod might not change to Ready and the pod continues to look for the master pod. You can see these details in the pod logs.

This issue occurs because the previous data cleanup was not successful.

Workaround

Delete Patroni endpoints. Perform the following steps to do this:

Kuberenetes

Check whether Patroni endpoints exist. To get a list of endpoints, run the following command:
kubectl get ep -n <NAMESPACE> | grep bmc
Example output:
postgres-bmc-pg-ha                                                10.42.21.182:5432                                                       30h
postgres-bmc-pg-ha-config                                         <none>                                                                  30h
postgres-bmc-pg-ha-pool                                           10.42.17.71:9999,10.42.20.108:9999                                      30h
postgres-bmc-pg-ha-repl                                           10.42.11.118:5432,10.42.41.13:5432                                      30h
If the endpoints exist, delete the Patroni endpoints. Run the following command:
kubectl delete ep -n <NAMESPACE> postgres-bmc-pg-ha postgres-bmc-pg-ha-config postgres-bmc-pg-ha-pool postgres-bmc-pg-ha-repl

OpenShift

Get the configmap. In OpenShift, the Patroni endpoints are set to false, so the configmap is used by Patroni. Run the following command to get the configmap:
oc get cm | grep bmc
Example output:
oc get cm | grep bmc
postgres-bmc-pg-ha-config          0      17h
postgres-bmc-pg-ha-leader          0      17h
postgres-bmc-pg-ha-pgconfig        1      17h
postgres-bmc-pg-ha-pgpoolconfig    1      17h
Delete the configmap. Run the following command:
oc delete cm postgres-bmc-pg-ha-config postgres-bmc-pg-ha-leader postgres-bmc-pg-ha-pgconfig postgres-bmc-pg-ha-pgpoolconfig

The PostgreSQL database migration is not successful

If the migration fails, perform the following steps:
1. Delete the Patroni PostgreSQL by running the following command:
  helm delete <releasename> -n <namespace>
2. Delete the pgdata-postgres-bmc-pg-ha-* PVCs by running the following command:
  kubectl delete pvc pgdata-postgres-bmc-pg-ha-0 pgdata-postgres-bmc-pg-ha-1 pgdata-postgres-bmc-pg-ha-2 -n <namespace>
3. Get the endpoint by running the following command:
  kubectl get ep -n <namespace> | grep postgres
4. Delete the endpoint by running the following command:
  kubectl delete ep postgres-xxxxx -n <name
5. Rerun the database migration.
  For instructions, see Upgrading-BMC-Helix-IT-Operations-Management-from-version-22-4-to-23-2-02 Step 1 and Step 2.

Upgrade fails because of a Kafka exporter issue

Scope

The upgrade to version 22.4 fails because of a Kafka exporter issue. The following error is displayed:

UPGRADE FAILED: cannot patch "kafka-exporter" with kind Deployment: Deployment.apps "kafka-exporter" is invalid: spec.selector: Invalid value: v1.LabelSelector{MatchLabels:map[string]string

{"app.kubernetes.io/component":"cluster-metrics", "app.kubernetes.io/instance":"kafka", "app.kubernetes.io/name":"kafka"}
, MatchExpressions:[]v1.LabelSelectorRequirement(nil)}: field is immutable

Workaround

Get the endpoint by running the following command:
kubectl get ep -n <namespace> | grep postgres
Delete the endpoint by running the following command:
kubectl delete ep postgres-xxxxx -n <namespace>

Obtain the Kafka exporter details by running the following command:
kubectl get deployment -n <namespace> | grep Kafka-exporter
Delete the Kafka exporter deployment by running the following command:
kubectl delete deployment -n <namespace> Kafka-exporter
Run the upgrade process again. For more information, see the Step 3: To upgrade the product section in Upgrading-BMC-Helix-IT-Operations-Management-from-version-22-4-to-23-2-02.

Upgrade fails because of a Kafka status check issue

Scope

The upgrade to version 22.4 fails because the timeout for a Kafka status check is too less.

Workaround:

Open the helix-on-prem-deployment-manager/commons/infra_check_scripts/kafka/kafka_checker_job.yml file.
Increase the value of the timeout parameter from 12 to 25.
This value is in seconds.
Save and close the file.
Open the the helix-on-prem-deployment-manager/commons/infra_check_scripts/kafka/kafka_checker.sh file.
Increase the value of the sleep parameter from 15 to 30.
This value is in seconds.
Save and close the file.
Run the upgrade process again. For more information, see the Step 3: To upgrade the product section in Upgrading-BMC-Helix-IT-Operations-Management-from-version-22-4-to-23-2-02.

Upgrade fails because of a Elasticsearch logging

Scope

Deployment manager fails to upgrade Elasticsearch logging from version 22.2.01 to 22.4 and you get the following error:
"Error: UPGRADE FAILED: cannot patch "elasticsearch-logging-kibana" with kind PersistentVolumeClaim"

Workaround.

Clean up the old deployment for logging from the bmc-helix-logging namespace and deploy the new version of BMC Helix Logging. For more information , see 000407351

Troubleshooting upgrade issues

Issue with the ingress controller

The Patroni PostgreSQL Server does not work

Scope

Workaround

The PostgreSQL database migration is not successful

Upgrade fails because of a Kafka exporter issue

Scope

Upgrade fails because of a Kafka status check issue

Scope

Workaround:

Upgrade fails because of a Elasticsearch logging

Scope

BMC Helix IT Operations Management deployment 23.2.02

On this page