NE-2138: Bump cluster-dns-operator to Kubernetes 1.33 for 4.21 #448

davidesalerno · 2025-09-12T14:10:58Z

This change bumps Kubernetes libraries to 1.33.4 and controller-runtime to 0.21.0.

Due to some breaking changes, additional modifications were required:

Fixed calls to events.NewKubeRecorder
Bumping Dockerfile images for Openshift 4.21
Updating .ci-operator.yaml
Regenerated some manifest after executing a build

openshift-ci-robot · 2025-09-12T14:11:03Z

@davidesalerno: This pull request references NE-2138 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.21.0" version, but no target version was set.

In response to this:

This change bumps Kubernetes libraries to 1.33.4 and controller-runtime to 0.21.0.

Due to some breaking changes, additional modifications were required:

Fixed calls to events.NewKubeRecorder

Bumping Dockerfile images for Openshift 4.21

Updating .ci-operator.yaml

Regenerated some manifest after executing a build

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

rikatz · 2025-09-12T14:19:16Z

manifests/0000_70_dns-operator_00.crd.yaml

+                      pattern: ^([a-z0-9]([-a-z0-9]*[a-z0-9])?(\.[a-z0-9]([-a-z0-9]*[a-z0-9])?)*/)?(([A-Za-z0-9][-A-Za-z0-9_.]*)?[A-Za-z0-9])$
                      type: string
                  required:
+                  - lastTransitionTime


this is a breaking change. Can we confirm that our intention was always to have these as "required" and that this won't cause any issue?

(or at least track where this change came from)

This change come from the update into the OpenShift APIs version and a similar change was applied to the CIO in this commit.

If we would like to be compliant to that API version we cannot avoid this change.

I checked that at the moment we are setting everytime there is a change this field and the Status too.

Can we confirm that our intention was always to have these as "required" and that this won't cause any issue?

(or at least track where this change came from)

The change was added in openshift/api#2037 then reverted by TRT openshift/api#2045 (for the reason of breaking some cluster operators, good catch!) and then re-added again in openshift/api#2048. Looking at the bug created by the TRT team seems like the apiserver and the authentication were impacted by the change. David Eads tried to make sure these gaps covered in his unrevert PR.

We did a similar change in CIO and it seemed to go unnoticed there (I don't see any comment about it in the PR).
OperatorCondition is used mostly by cluster operators to report Degraded, Progressing and Available statuses. Also, I know that we use this condition in some operands like IngressController whose status is also managed by a cluster operator (ingress).
Overall, I think it's unlikely the end user will face a compatibility issue for a cluster operator (or a cluster operand) statuses. But we can follow up with David Eads on 1) how we can make sure we don't break the compatibility (maybe Davide's check about setting newly required fields is enough), 2) how incompatible changes are managed in openshift/api module (taking into account that the module has no semver).

@deads2k Could you help us in verifying that these changes won't cause any issue to the impacted systems?

As discussed with David Eads, we (CDO as part of OCP payload) are consumers of the CRDs generated in openshift/api from the API types. The compatibility requirements are more relaxed for us (compared to clients). David suggested to keep in sync with latest openshift/api updates which includes code changes. I think that we can move on with this change in the Condition type taking into account the CI signal (status updates should go smooth without API server errors).

rikatz · 2025-09-12T14:20:25Z

manifests/0000_70_dns-operator_00.crd.yaml

                      type: string
                    status:
+                      description: status of the condition, one of True, False, Unknown.
+                      enum:


We need to verify if we are trying to set something other than this enum as this apparently was added recently on some API change

rikatz · 2025-09-12T14:24:12Z

go.mod

-	github.com/openshift/client-go v0.0.0-20231024221206-506d798bc61c
+	github.com/go-logr/logr v1.4.2
+	github.com/google/go-cmp v0.7.0
+	github.com/openshift/api v0.0.0-20250710004639-926605d3338b


btw the breaking changes I am pointing below may be coming from this bump.

tbh I didn't tried to re-generate the manifests on openshift/cluster-ingress-operator#1279 and seeing the breaking changes below kind of concerns me we may be missing something.

yes, these changes are coming from that bump after executing a make

Changes are due to this update into OperatorCondition into the OpenShift API https://github.com/openshift/api/blob/master/operator/v1/types.go#L190

I think that if we would like to bump to that version the API we cannot avoid it even if it could be a breaking change.

davidesalerno · 2025-09-15T07:35:39Z

/test okd-scos-e2e-aws-ovn

davidesalerno · 2025-09-15T12:26:20Z

/test e2e-aws-ovn-serial

rikatz · 2025-09-15T12:42:17Z

/approve

From my perspective, this looks fine, but I will leave final comments to Andrey and Miciah regarding this "breaking change" and if it already happened on past, we should be fine :)

/cc @alebedev87 @Miciah

openshift-ci · 2025-09-15T12:46:40Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: rikatz

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [rikatz]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

alebedev87 · 2025-09-15T15:16:23Z

/assign

alebedev87

I commented on the point about the breaking change. I don't think we are going to impact end users. The cluster dns operator's e2e checks the operator conditions which means that the operator is setting the status condition during the test. This should give us a signal if any required field is not set or if any unexpected value is used.

/lgtm
/hold

@davidesalerno: LGTM, just keeping a small hold in case you would like to follow up with David Eads.

davidesalerno · 2025-09-17T13:00:10Z

/test okd-scos-e2e-aws-ovn

lihongan · 2025-09-19T01:12:57Z

/retest

lihongan · 2025-09-19T01:13:11Z

/verified by @lihongan

openshift-ci-robot · 2025-09-19T01:13:23Z

@lihongan: This PR has been marked as verified by @lihongan.

In response to this:

/verified by @lihongan

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

Signed-off-by: Davide Salerno <[email protected]>

davidesalerno · 2025-09-24T18:35:35Z

/retest-required

lihongan · 2025-09-25T01:46:01Z

/retest-required

alebedev87 · 2025-09-25T18:03:53Z

As discussed with David Eads, we should check the CI signal about the status updates. No status update errors should be seen in the CDO logs.
/lgtm

@davidesalerno : I let you unhold the PR once the CDO logs are checked ^.

davidesalerno · 2025-09-26T06:24:25Z

/retest

openshift-ci · 2025-09-26T10:53:27Z

@davidesalerno: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
ci/prow/e2e-aws-ovn-single-node	`84d742b`	link	false	`/test e2e-aws-ovn-single-node`
ci/prow/okd-scos-e2e-aws-ovn	`84d742b`	link	false	`/test okd-scos-e2e-aws-ovn`
ci/prow/e2e-aws-ovn-techpreview	`84d742b`	link	false	`/test e2e-aws-ovn-techpreview`

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

openshift-ci-robot added the jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. label Sep 12, 2025

openshift-ci bot requested review from alebedev87 and rikatz September 12, 2025 14:12

rikatz reviewed Sep 12, 2025

View reviewed changes

openshift-ci bot requested a review from Miciah September 15, 2025 12:42

openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Sep 15, 2025

openshift-ci bot assigned alebedev87 Sep 15, 2025

alebedev87 reviewed Sep 15, 2025

View reviewed changes

openshift-ci bot added do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. lgtm Indicates that a PR is ready to be merged. labels Sep 15, 2025

openshift-ci-robot added the verified Signifies that the PR passed pre-merge verification criteria label Sep 19, 2025

davidesalerno added 2 commits September 24, 2025 17:45

NE-2138: Bump cluster-dns-operator to Kubernetes 1.33 for 4.21 - vendor

48966e1

Signed-off-by: Davide Salerno <[email protected]>

NE-2138: Bump cluster-dns-operator to Kubernetes 1.33 for 4.21

84d742b

Signed-off-by: Davide Salerno <[email protected]>

davidesalerno force-pushed the NE-2138 branch from e71bd37 to 84d742b Compare September 24, 2025 15:47

openshift-ci-robot removed the verified Signifies that the PR passed pre-merge verification criteria label Sep 24, 2025

openshift-ci bot removed the lgtm Indicates that a PR is ready to be merged. label Sep 24, 2025

openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Sep 25, 2025

NE-2138: Bump cluster-dns-operator to Kubernetes 1.33 for 4.21 #448

Are you sure you want to change the base?

NE-2138: Bump cluster-dns-operator to Kubernetes 1.33 for 4.21 #448

Uh oh!

Conversation

davidesalerno commented Sep 12, 2025

Uh oh!

openshift-ci-robot commented Sep 12, 2025 • edited by openshift-ci bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

davidesalerno commented Sep 15, 2025

Uh oh!

davidesalerno commented Sep 15, 2025

Uh oh!

rikatz commented Sep 15, 2025

Uh oh!

openshift-ci bot commented Sep 15, 2025

Uh oh!

alebedev87 commented Sep 15, 2025

Uh oh!

alebedev87 left a comment

Choose a reason for hiding this comment

Uh oh!

davidesalerno commented Sep 17, 2025

Uh oh!

lihongan commented Sep 19, 2025

Uh oh!

lihongan commented Sep 19, 2025

Uh oh!

openshift-ci-robot commented Sep 19, 2025

Uh oh!

davidesalerno commented Sep 24, 2025

Uh oh!

lihongan commented Sep 25, 2025

Uh oh!

alebedev87 commented Sep 25, 2025

Uh oh!

davidesalerno commented Sep 26, 2025

Uh oh!

openshift-ci bot commented Sep 26, 2025

Uh oh!

Uh oh!

openshift-ci-robot commented Sep 12, 2025 •

edited by openshift-ci bot

Loading