-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OPA response is nil issue #6972
Comments
I have updated the clusters to opa 0.66.0. Things were quite for a few weeks. The issue has resurfaced though. I tried asking in the community slack channel, but so far no response. The decision logger we use does not return the admission review when the response from OPA is nil. When the response is nil, should be possible to get the admissionReview? I would like to be able to provide more specifics, but I am having trouble getting any information at the time of the issue. I do know that doing a rollout restart of the opa deployment alleviates the issue for a time hours/a couple days. The issue seems more prevalent in certain clusters. |
If you could provide some details that help repro the issue that would be really helpful. |
Is that part of the input? |
If only I could reproduce the issue :-) We use OPA as and admission webhook in multiple k8s clusters. We load our policies using kube-mgmt. We have a decision logger configured to log OPA policy success/fail with some details to aid in troubleshooting. If OPA returns a true/false response, we can see the k8s admissionReview just fine. But, if OPA returns a nil response, the admissionReview is also nil. Given that the decision logger does not return any useful information when the response from OPA is nil, there is not much to go on. Does OPA expose the k8s admission review details when OPA response is nil? Or is there some other recommended way to get useful information to help determine what triggers the nil response? From our decision logger..
We are not running into any issues when we run the unit tests on our policies. We alert when OPA returns a nil response, as it impacts the users submitting to the cluster. Performing a k8s rolling restart of the OPA deployment temporarily resolves the issue. Jobs submitted that encounter the |
Even if the policy result is
Are there any errors reported in the logs? |
I have only looked at the decision-logger side-car when the issue has surfaced. I will make a point to examine OPA logs to see if there are any clues there.
Thank you for confirming that! I will update the decision logger to spew.Dump to the console and see what that gets me |
Can you please also share your OPA config? |
I am working on gathering more information to help diagnose this issue. |
Looks like the issue we are seeing is the same as open-policy-agent/kube-mgmt#189 OPA is getting restarted on our AWS cluster, possible dues to Karpenter. kube-mgmt does not reload the policies. The response we get back just contains the decision-id and nothing else. We are planning on adding a livenessProbe to the |
Originally posted by @ashutosh-narkar in #6585 (comment)
The text was updated successfully, but these errors were encountered: