-
Notifications
You must be signed in to change notification settings - Fork 757
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bicep kubernetes extension random 'System.OutOfMemoryException' #15758
Comments
I can confirm we're seeing the same intermittent failures. It is stopping our shop almost in the tracks, to the point were we can no longer tell if our deployments are actually broken because of our changes or just randomly failing, making continuous integration/deployment of our IaC close to impossible. I've spent quite a bit of time trying to iron out what changed on our end. To complement your list @Kravca , I've also tried downloading a bunch of different bicep CLI binaries, and build-ing the ARM-template from each, diffing them to see if a recent update has changed anything there. I've not found any changes yet in our generated template from all official releases between v0.28.1 and v0.31.92 (currrent at writing moment) |
Have you also observed that it isn't necessarily the same e.g. namespace or configmap that fails? We're deploying about
And we get intermittent fails on mostly the namespaces, but sometimes the secrets too. |
We also suspected ConfigMaps at some point (because of 'config' word in the error), but removing those from Bicep didn't bring any results, that's why I tried to focus on kubeConfig. The error is not very meaningful , as in our case, the error came from the module which uses the kubernetes extension or the one where you supply kubeConfig to. Which resource exactly in that module is failing, is not known (Id think its any), the error doesn't reveal it, that why I used the simplest example with just namespace, I didn't mean that specifically namespace resources are failing. Also it is not known which exactly API produces this error, is it ARM API (as we can see the error in ARM deployment logs), is it Kubernetes API (deeper), or something else. |
@Kravca Thanks for bringing this to our attention. We’re aware of the issue and have already developed a fix. However, due to the Azure service deployment freeze, we won’t be able to roll it out until early January. In the meantime, we will be making manual adjustments to our service VMs to mitigate the OOM issue. This process may take about two weeks since it involves making changes across all Azure regions. I’ll keep you updated on our progress. |
We were able to apply for an exception to proceed with deploying the fix. The deployment succeeded in the Canary regions, but further work is needed to roll it out to the other regions. |
Bicep version
Bicep CLI version 0.31.92
Describe the bug
We are using a lot Bicep kubernetes extension to deploy apps to Azure AKS cluster. Around 28 of November our pipelines started to fail with 'System.OutOfMemoryException' exception.
Like this:
Or like this:
Both errors from same deployment, without changing anything, just repeatedly redeploying same thing over and over (example lower)
To Reproduce
I have created the smallest biceps files (it only creates Namespace), that are getting this error, as example is really simple it has ~90% success.
main.bicep:
app.bicep:
Additional context
We tried:
Nothing really helps, this error appears randomly, somewhere its 50% success, on some pipelines its 10% success.
The text was updated successfully, but these errors were encountered: