Add Kubernetes Deployer for Pipeline Deployments #4127

safoinme · 2025-11-04T15:11:55Z

Describe changes

This PR introduces a new Kubernetes Deployer that enables deploying ZenML pipelines as long-running services on any Kubernetes cluster.

Pre-requisites

Please ensure you have done the following:

I have read the CONTRIBUTING.md document.
I have added tests to cover my changes.
I have based my new branch on develop and the open PR is targeting develop. If your branch wasn't based on develop read Contribution guide on rebasing branch to develop.
IMPORTANT: I made sure that my changes are reflected properly in the following resources:
- ZenML Docs
- Dashboard: Needs to be communicated to the frontend team.
- Templates: Might need adjustments (that are not reflected in the template tests) in case of non-breaking changes and deprecations.
- Projects: Depending on the version dependencies, different projects might get affected.

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Other (add details above)

github-actions · 2025-11-04T15:13:28Z

Documentation Link Check Results

✅ Absolute links check passed
✅ Relative links check passed
_{Last checked: 2025-11-14 11:36:54 UTC}

src/zenml/integrations/kubernetes/deployers/kubernetes_deployer.py

+            sanitized_key = self._sanitize_secret_key(key, secret_key_map)
+            if sanitized_key != key:
+                logger.warning(
+                    f"Secret key '{key}' sanitized to '{sanitized_key}'"


The best way to fix the problem is to avoid logging the actual value of key (from the secrets dictionary) in clear text. The log message currently exposes the original secret key and the sanitized version of it, which could leak both internal secret naming conventions and potentially sensitive user-supplied names.

To address this, the log message should be reworded so that neither the original nor sanitized secret key are included verbatim. Instead, you can log the fact that a secret key was sanitized, possibly including metadata such as an index, length, or even just a generic message without displaying the keys themselves.

Specific changes:

Edit only the log statement on line 517 in src/zenml/integrations/kubernetes/deployers/kubernetes_deployer.py.

Remove or obfuscate both key and sanitized_key from the log output, or limit the output to a generic message such as: "A secret key was sanitized.".

No import or method changes are needed as the log level, logger, etc. remain unchanged.

stefannica

I only took a brief look at this PR, but I can already suggest a few major items:

please move the dry-run feature in a separate PR. 6k LOC is a big PR to have to review and this feature doesn't belong here. In addition to the scope creep, it doesn't seem like you put too much thought into its design.
parts of the code still feel like they've been written by an LLM that doesn't doesn't know how your Pydantic types are defined. getattr and hasattr should never appear in your code. Please review your own code and make sure that it's not vibe-coded before you open it for review.

src/zenml/cli/pipeline.py

src/zenml/integrations/kubernetes/kube_utils.py

schustmi · 2025-11-13T02:18:02Z

src/zenml/integrations/kubernetes/k8s_applier.py

+    """Yield single resources from possibly 'List' wrappers.
+
+    Args:
+        objs: Iterable of resource dicts (already normalized). Supports 'kind: List'.


Supports 'kind: List'. This is not clear at all what it does

Updated docstring to explain this.

Kubernetes allows manifests that wrap multiple resources in a single object

kind: List items: - kind: Deployment ... - kind: Service ...

schustmi · 2025-11-13T02:19:14Z

src/zenml/integrations/kubernetes/k8s_applier.py

+            )
+        return host
+
+    def _svc_lb_host(self, service_obj: Any) -> Optional[str]:


Because we are using kubernetes client which also get resource type as any and basically it could return:
• a raw dict coming from to_dict() or some other JSON-ish source, or
• a Kubernetes client / dynamic client object (e.g. ResourceInstance) that has a .to_dict() method but doesn’t have a nice static type in the Python client.

schustmi · 2025-11-13T02:22:29Z

src/zenml/integrations/kubernetes/k8s_applier.py

+            raise ValueError(
+                f"Expected dict after serialization, got {type(d)}"
+            )
+        if "api_version" in d and "apiVersion" not in d:


Is that a bug in the Kubernetes package that they don't convert this one exact key?

technically kubernetes only accepted apiVersion but the api_version is usually the variable name in python client so this is just extra step to not fail on that naming otherwise this will happen because user provided wrong key and we can also just fail

src/zenml/integrations/kubernetes/kube_utils.py

schustmi · 2025-11-13T02:37:01Z

src/zenml/integrations/kubernetes/template_engine.py

+from typing import Any, Dict, List
+
+import yaml
+from jinja2 import Environment, StrictUndefined, TemplateError, Undefined


My question still stands: Where is jinja getting installed from?

I have just added it to k8s integration requirements

…l-io/zenml into feature/kubernetes-deployer

safoinme added 15 commits November 1, 2025 18:52

init

d6d99ed

tests

80e9943

fixes

0bdb1cc

fix jinja

aab44ce

fixes

9960044

docstring

346d727

fixes

640244c

mypy

8e2f0bd

fix tests

7bcb962

few fixes

1b228ef

fix loadbalancer url

0eaa6d8

fix

b72967b

updated cleaner

7a561d9

docs and docstring

4534f26

fix

7a9b8d5

github-actions bot added internal To filter out internal PRs and issues enhancement New feature or request labels Nov 4, 2025

github-advanced-security bot found potential problems Nov 4, 2025

View reviewed changes

safoinme added 2 commits November 4, 2025 16:15

fix security warnning

f664717

Merge branch 'develop' into feature/kubernetes-deployer

fda2415

safoinme requested review from stefannica and strickvl November 4, 2025 15:37

safoinme added 5 commits November 4, 2025 17:21

tests

3f72d04

dry run

03da474

cleanups

209778a

fixes

b22ce86

fix tests

2e3a380

stefannica requested changes Nov 5, 2025

View reviewed changes

src/zenml/cli/pipeline.py Outdated Show resolved Hide resolved

src/zenml/cli/pipeline.py Outdated Show resolved Hide resolved

src/zenml/cli/pipeline.py Outdated Show resolved Hide resolved

src/zenml/cli/pipeline.py Outdated Show resolved Hide resolved

cleanup

c8606ce

safoinme added 8 commits November 11, 2025 08:29

more fixes

cbda686

fix delettion and docs

6e4607b

Merge branch 'develop' into feature/kubernetes-deployer

55a8e45

fix service and status

6cc5c0d

typing

0f99916

format

c41f2b0

Merge branch 'develop' into feature/kubernetes-deployer

9d72286

logging delete

9ab4342

stefannica requested changes Nov 12, 2025

View reviewed changes

src/zenml/integrations/kubernetes/kube_utils.py Show resolved Hide resolved

stefannica added the run-slow-ci Tag that is used to trigger the slow-ci label Nov 12, 2025

safoinme added 3 commits November 12, 2025 11:26

updated get state

1690945

fixes

cb091b2

fixes

3086b72

schustmi requested changes Nov 13, 2025

View reviewed changes

safoinme added 6 commits November 13, 2025 10:11

michael review

0fbd1a3

Merge branch 'develop' into feature/kubernetes-deployer

e0595e0

comments

e71f040

few fixes

6c5d9f9

fix tests

1542dad

Merge branch 'develop' into feature/kubernetes-deployer

59dfbf0

safoinme requested review from schustmi, stefannica and strickvl November 13, 2025 11:18

safoinme added 2 commits November 13, 2025 14:43

typo

24739de

typo

568166c

stefannica approved these changes Nov 13, 2025

View reviewed changes

safoinme and others added 4 commits November 13, 2025 22:55

fixes

a96c068

Merge branch 'develop' into feature/kubernetes-deployer

63bd296

failing test

f7ebf14

Merge branch 'feature/kubernetes-deployer' of https://github.com/zenm…

a3fcae3

…l-io/zenml into feature/kubernetes-deployer

Add Kubernetes Deployer for Pipeline Deployments #4127

Are you sure you want to change the base?

Add Kubernetes Deployer for Pipeline Deployments #4127

Conversation

safoinme commented Nov 4, 2025

Describe changes

Pre-requisites

Types of changes

Uh oh!

github-actions bot commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Documentation Link Check Results

Uh oh!

Check failure

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot Autofix

stefannica left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

github-actions bot commented Nov 4, 2025 •

edited

Loading