Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ability to detect Sub-Pipeline status in Run Completion Events #442

Open
3 tasks
aidandunlop opened this issue Jan 21, 2025 · 0 comments
Open
3 tasks

Ability to detect Sub-Pipeline status in Run Completion Events #442

aidandunlop opened this issue Jan 21, 2025 · 0 comments

Comments

@aidandunlop
Copy link
Contributor

aidandunlop commented Jan 21, 2025

Is your feature request related to a problem? Please describe

KFP Operator supports Pipeline Dependencies, which allows users to split up larger machine learning pipelines into sub-pipelines, which can be then re-used by multiple dependent pipelines. For example, a "data creation" pipeline could ingest and transform a dataset ready for training, and then multiple other pipelines could reuse the dataset produced. This reduces the duration of the pipelines.

The KFP Operator also provides Run Completion Events which lets users react to pipeline events, and includes details about artefacts created by training pipelines.

At the moment the Run Completion Events do not include any detail about whether a run refers to a sub-pipeline or a larger dependent pipeline. Client components that react to events have to either know what pipelines do what, or assume that all pipelines a part of a larger dependent pipeline. Some clients might only want to react to events on larger dependent pipelines, rather than sub-pipelines . For example, a client might continually serve a model produced by a training pipeline by reacting to events when a new model is pushed. They will want to ignore sub-pipelines that do not push a serving model, but at the moment the clients need to have knowledge of which pipelines are sub-pipelines or not.

Describe the solution you would like

Run Completion Events should be populated with a field describing whether the run was for a sub-pipeline or a larger dependent pipeline.

Describe alternative solutions you have considered

There might be a programmatic way to determine whether a run was part of a sub-pipeline or not, which should be investigated.

Acceptance Criteria

  • Run Completion Events contain details about the type of pipeline the run derives from

Additional context

  • Can the detection of a sub-pipeline be determined automatically? i.e. somehow traversing using the runConfigurations field from triggers?
  • Should we add the ability to add a type to artifacts? e.g. servedModel, dataset etc. This might allow us to detect a sub-pipeline
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant