You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This bug occurs for kfp-profile-controller usecase which deploys composite controller here. This controller needs to create a PodDefaults CR. When the CRD is not created before the composite controller is deployed the metacontroller is reporting errors
If we deploy the CRD manually the metacontroller pod gets stuck without recognizing the change. The metacontroller's pod stays in ACTIVE status without reporting any new logs. The composite controller object stays in state of not recognizing poddefault CRD.
Now metacontroller no longer responses and no poddefault CRs are created in test ns
kubectl -n kubeflow logs metacontroller-operator-charm-0 --timestamps
kubectl get poddefaults -n test
To recover remove the pod and let statefulset to recreate it
kubectl delete po metacontroller-operator-charm-0
Environment
microk8s v1.24.17
juju version 2.9.44-ubuntu-amd64
metacontroller version v2.0.4
Relevant log output
Model Controller Cloud/Region Version SLA Timestamp
kubeflow uk8sx microk8s/localhost 2.9.44 unsupported 11:49:17+02:00
App Version Status Scale Charm Channel Rev Address Exposed Message
kfp-profile-controller active 1 kfp-profile-controller 0 10.152.183.55 no
kubeflow-profiles active 1 kubeflow-profiles latest/edge 321 10.152.183.205 no
metacontroller-operator active 1 metacontroller-operator 1 10.152.183.124 no
minio res:oci-image@1755999 active 1 minio latest/edge 231 10.152.183.58 no
Unit Workload Agent Address Ports Message
kfp-profile-controller/0* active idle 10.1.137.14
kubeflow-profiles/0* active idle 10.1.137.15
metacontroller-operator/0* active idle 10.1.137.9
minio/0* active idle 10.1.137.13 9000/TCP,9001/TCP
Relation provider Requirer Interface Type Message
minio:object-storage kfp-profile-controller:object-storage object-storage regular
Additional context
No response
The text was updated successfully, but these errors were encountered:
Bug Description
This bug occurs for kfp-profile-controller usecase which deploys composite controller here. This controller needs to create a
PodDefaults
CR. When the CRD is not created before the composite controller is deployed the metacontroller is reporting errorsIf we deploy the CRD manually the metacontroller pod gets stuck without recognizing the change. The metacontroller's pod stays in ACTIVE status without reporting any new logs. The composite controller object stays in state of not recognizing poddefault CRD.
Only way to recover the metacontroller is to manually remove the metacontroller pod.
This may be problematic in prod deployments where the CRD for PodDefaults will be deployed later than the metacontroller.
To Reproduce
Deploy charms
Create test namespace
Chech metacontroller for errors (it is also stuck)
Manualy deploy crd from here https://github.com/canonical/admission-webhook-operator/blob/main/src/templates/crds.yaml.j2
Now metacontroller no longer responses and no poddefault CRs are created in test ns
To recover remove the pod and let statefulset to recreate it
Environment
microk8s v1.24.17
juju version 2.9.44-ubuntu-amd64
metacontroller version v2.0.4
Relevant log output
Additional context
No response
The text was updated successfully, but these errors were encountered: