Dynamic pipelines v0 #4074

schustmi · 2025-10-21T01:27:14Z

Describe changes

This PR implements dynamic pipelines for the local/kubernetes orchestrators.

Example

from zenml import step, pipeline

@step
def generate_int() -> int:
  return 3

@step
def do_something(index: int) -> None:
  ...

@pipeline(dynamic=True)
def dynamic_pipeline() -> None:
  count = generate_int()
  # `count` is an artifact, we now load the data
  count_data = count.load()

  for idx in range(count_data):
    # This will run sequentially, like regular python code would.
    # See the features below for an example on how to run steps
    # in parallel.
    do_something(idx)

if __name__ == "__main__":
  dynamic_pipeline()

Features

Dynamic configuration for steps:

@pipeline(dynamic=True)
def dynamic_pipeline():
  some_step.with_options(enable_cache=False)()

Specify a step runtime: The runtime configures whether the step will be run in the orchestration environment (runtime=inline) or if the orchestrator will spin up a separate step execution environment (runtime=isolated).

@step(runtime="isolated")
def some_step() -> None:
  ...

Execute multiple steps in parallel by using step.submit(...). This will either execute the step or launch a new container in a new thread.

@step
def some_step(arg: int) -> int:
  ...

@pipeline(dynamic=True)
def dynamic_pipeline():
  future = some_step.submit(arg=1)
  artifact = future.result()  # wait and get artifact response(s)
  data = future.load()  # wait and load artifact data
  downstream_step(future)  # pass the output to another step

  # Run multiple steps in parallel
  for idx in range(3):
    some_step.submit(arg=idx)

Specify config templates for steps using depends_on:

# config.yaml
steps:
  some_step:
    parameters:
      arg: 3

# run.py
@step
def some_step(arg: int) -> None:
  ...

@pipeline(dynamic=True, depends_on=[some_step])
def dynamic_pipeline():
  some_step()

if __name__ == "__main__":
  dynamic_pipeline.with_options(config_path="config.yaml")()

Limitations/Known issues

Our logging storage isn't threadsafe yet, which means logs of parallel steps are mixed up.
When running multiple steps concurrently, failure in one step does not stop the other steps. Instead, they continue executing until finished.

Pre-requisites

Please ensure you have done the following:

I have read the CONTRIBUTING.md document.
I have added tests to cover my changes.
I have based my new branch on develop and the open PR is targeting develop. If your branch wasn't based on develop read Contribution guide on rebasing branch to develop.
IMPORTANT: I made sure that my changes are reflected properly in the following resources:
- ZenML Docs
- Dashboard: Needs to be communicated to the frontend team.
- Templates: Might need adjustments (that are not reflected in the template tests) in case of non-breaking changes and deprecations.
- Projects: Depending on the version dependencies, different projects might get affected.

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Other (add details above)

Json-Andriopoulos · 2025-11-03T11:12:19Z

src/zenml/execution/pipeline/dynamic/outputs.py

+        """
+        return self._wrapped.result()
+
+    def load(self) -> Any:


Perhaps we should keep a cache dictionary for materialized artifacts, so if (for some weird reason) users do multiple load calls (instead of assigning and reusing) we can return the cached results.

I like that idea. I'll also add a boolean flag to the load method to prevent this, so users can at least manually disable it for huge artifacts.

Json-Andriopoulos · 2025-11-03T11:14:46Z

src/zenml/execution/pipeline/dynamic/outputs.py

+        else:
+            raise ValueError(f"Invalid step run output: {result}")
+
+    def __getitem__(self, key: Union[str, int]) -> ArtifactFuture:


Maybe we should provide a custom implementation for setitem as a better UX/guidance to users than getting an error like this: object does not support item assignment.

Json-Andriopoulos · 2025-11-03T11:25:11Z

src/zenml/execution/pipeline/dynamic/outputs.py

+        """
+        if isinstance(key, str):
+            index = self._output_keys.index(key)
+        elif isinstance(key, int):


I would recommend adopting one kind of wrapper behavior and stick to it. If we enable users to access futures both dict-style and list/tuple style I think the more we extend the class magic functions the more the behavior may be more confusing regarding what gets executed and what the users expects to be executed.

Plus in general accessing by key is a safer operation and would be a good pattern to enforce. The results may change length, order and accessing by position is a bit un-safe.

For instance, I can get dict-style (by specifying key) but the __iter__ function I may expect it to return the keys not the values. For better consistency I would say we have 2 options:

No magic functions - We provide public methods with clean documentation. Uses can manipulate futures based on the available functions and their signatures. No confusion, users stick to following docstrings.

We implement magic functions but we assume implementation wraps one kind of hidden data structure. Users can manipulate it as dict (preferable imo) or as a list/tuple.

Makes sense, I agree we should only support one way. My vote goes for tuple like behaviour though, as the main purpose of my implementations was to actually support the following: When calling any python function with multiple outputs, there are two cases:

def f(): return 1, "str" # Return value is a tuple that can be accessed with `int` keys tuple_result = f() int_result = tuple_result[0] # Automatic unpacking int_result, str_result = f()

I think the latter is most common, and is the use-case I think we should support to make it feel as pythonic as possible (and also mirrors how you would call sync steps, in which case the return value will not be a future but instead of tuple of artifacts).

Tuples are also immutable which in our case holds true as well as you can't add outputs to the future result of a step run.

We can then add some helper methods like get_output(key: str) like you suggested to get allow fetching specific outputs by key.

I see, yes, tuple should do it as well. As long as we are consistent I think we are ok.

Json-Andriopoulos · 2025-11-03T11:36:05Z

src/zenml/execution/pipeline/dynamic/outputs.py

+            index=index,
+        )
+
+    def __iter__(self) -> Any:


Would __contains__ make sense to also implement here? 🤔

~~Oh yes I thought I already did, that definitely makes sense!~~
Actually if we do implement it as a tuple-like data structure the contains will be with values and might not make much sense, what do you think?

yy it would make more sense in a dict-like scenario!

Json-Andriopoulos · 2025-11-03T11:50:44Z

src/zenml/execution/pipeline/utils.py

+    Yields:
+        None.
+    """
+    with env_utils.temporary_environment(


Not in the context of this PR but we can do a better job here, thread-safety wise. In general os.environ should be treated as an immutable value, changing its values may also affect un-intentionally other execution paths (for instance code running in other threads).

I think masking the environment under a custom object/class and making that accessible with context vars resolves the issue (context vars are thread-local, wrapper object loads and freezes state and exposes all the operations we want with relative safety).

Yes you're right! This is not a problem in this case, but when running multiple steps in parallel we also set different environment variables which is problematic. At least in this case I can switch to using a context var instead

We can work something here, but different story of course. ContextVars should work perfectly, maybe in combo with a centralized BaseSettings object. Will create the story and we can discuss implementation.

Json-Andriopoulos · 2025-11-03T11:54:07Z

src/zenml/execution/pipeline/utils.py

+        Whether to prevent pipeline execution.
+    """
+    return handle_bool_env_var(
+        ENV_ZENML_PREVENT_PIPELINE_EXECUTION, default=False


It is hard to track those environment variables references. Maybe a BaseSettings object to organize those in a single place would be a good idea. Also we wouldn't need this function, pydantic would validate the boolean value for us in one go.

Yep agreed in general that would be a nice thing. We even have some classes that do this for a subset of env variables (ServerConfiguration and GlobalConfiguration), but not for all of them. This function also treats values like "yes" as True, not sure how pydantic would handle that natively, but we could implement it for sure.

I think Pydantic captures multiple values as well, I can cross compare. Will open a new story for this just wanted to get your opinion :)

src/zenml/execution/step/utils.py

src/zenml/models/v2/core/pipeline_run.py

github-actions bot added internal To filter out internal PRs and issues enhancement New feature or request labels Oct 21, 2025

WIP

c6153aa

schustmi force-pushed the feature/dynamic-pipelines branch from 4c02511 to c6153aa Compare October 21, 2025 02:02

schustmi added 3 commits October 21, 2025 11:37

Dynamic step config

98d115e

misc

6b51cc5

Optional entrypoint args

0906233

schustmi force-pushed the feature/dynamic-pipelines branch 3 times, most recently from 08b3069 to 250e5d3 Compare October 23, 2025 02:42

misc

e217f58

schustmi force-pushed the feature/dynamic-pipelines branch from 250e5d3 to e217f58 Compare October 23, 2025 08:49

schustmi added 3 commits October 27, 2025 09:27

Merge branch 'develop' into feature/dynamic-pipelines

34aeb41

Dynamic config

7956099

Maybe solution for step configs

226b65f

schustmi force-pushed the feature/dynamic-pipelines branch from 575843d to 226b65f Compare October 27, 2025 05:23

DB migration

d745345

schustmi force-pushed the feature/dynamic-pipelines branch 5 times, most recently from 597b9ce to a83c54c Compare October 28, 2025 02:25

Cleanup

60e9358

schustmi force-pushed the feature/dynamic-pipelines branch 2 times, most recently from e82964c to 093ee9a Compare October 28, 2025 07:06

Misc fixes

b0b6162

schustmi force-pushed the feature/dynamic-pipelines branch from 093ee9a to b0b6162 Compare October 28, 2025 07:43

Fix DAG

ec7bae4

schustmi force-pushed the feature/dynamic-pipelines branch from cffc807 to ec7bae4 Compare October 28, 2025 08:24

Merge branch 'develop' into feature/dynamic-pipelines

ddc9e32

schustmi added 3 commits November 3, 2025 16:46

Validate dynamic pipeline source resolving

aa6552e

Integration test fixes

71ae283

Linting

e902eaf

schustmi force-pushed the feature/dynamic-pipelines branch from d5f90e8 to e902eaf Compare November 3, 2025 09:11

schustmi added 3 commits November 3, 2025 17:36

Fix DB migration

9646d24

Docstring fix

23ca5eb

Merge branch 'develop' into feature/dynamic-pipelines

c082950

Json-Andriopoulos reviewed Nov 3, 2025

View reviewed changes

src/zenml/execution/step/utils.py Show resolved Hide resolved

Json-Andriopoulos reviewed Nov 3, 2025

View reviewed changes

src/zenml/execution/step/utils.py Show resolved Hide resolved

Json-Andriopoulos reviewed Nov 3, 2025

View reviewed changes

src/zenml/models/v2/core/pipeline_run.py Outdated Show resolved Hide resolved

schustmi added 5 commits November 3, 2025 22:06

Fix wrong test

e3653ce

Rename attribute

4e1d4e6

Improve future indexing

fe0804a

Use context var to prevent pipeline execution

f1ed44a

Artifact cache

87e2a54

schustmi requested a review from Json-Andriopoulos November 4, 2025 07:55

CI fixes

b7ddcd3

schustmi force-pushed the feature/dynamic-pipelines branch from 5303878 to b7ddcd3 Compare November 4, 2025 08:33

schustmi added 5 commits November 6, 2025 11:27

Merge branch 'develop' into feature/dynamic-pipelines

190bd16

Fix source resolving

f12c0f3

Raise sub-exceptions

0618c05

Refactor source resolving

bf57f0d

Allow runtime configuration

1dbb402

schustmi force-pushed the feature/dynamic-pipelines branch from 9e4cb68 to 1dbb402 Compare November 6, 2025 09:51

Dynamic pipelines v0 #4074

Are you sure you want to change the base?

Dynamic pipelines v0 #4074

Conversation

schustmi commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe changes

Example

Features

Limitations/Known issues

Pre-requisites

Types of changes

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

schustmi Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

schustmi Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Json-Andriopoulos Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

schustmi commented Oct 21, 2025 •

edited

Loading

schustmi Nov 4, 2025 •

edited

Loading

schustmi Nov 4, 2025 •

edited

Loading

Json-Andriopoulos Nov 3, 2025 •

edited

Loading