Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Epic] Release testing automation and frequent upstream releases #2485

Open
1 of 5 tasks
ktam3 opened this issue Oct 17, 2024 · 4 comments
Open
1 of 5 tasks

[Epic] Release testing automation and frequent upstream releases #2485

ktam3 opened this issue Oct 17, 2024 · 4 comments
Assignees
Labels
enhancement New feature or request stale

Comments

@ktam3
Copy link

ktam3 commented Oct 17, 2024

Goal:

  • We need integration testing on different hardware platforms as they come online.
  • We need to have a better connection between our team and the platform team.
  • We need to automate this testing
  • To help alleviate last min testing, we should have frequent upstream releases, so testing can start early.

Currently:

  • @JamesKunstle is working on a smoketest to indicate what needs to happen for a functional run.
  • He won't have the capacity to take on long term, so we need to be able to build on what James is building
    • We need to have a better handoff so it's automated and running constantly so we always have feedback on how the logic we're building is interacting with drivers and lower levels.
  • Nathan is working a lot of expanding the e2e test suite for upstream

Notes:

  • As we get configs for different HW, the difference with upstream should only be models and lora layers. Bulk should happen in upstream runners. Could be a good smoke test.
  • For 'full integration e2e' code compatibility, the root of that should be upstream first testing that will trickle into other configs.
    • Recently created a CI story that was full e2e on actual correct codepaths.
    • GPU enabled upstream runners that run either on most PRs or big ones that are every 12-24h… can expand that to use some abbreviate version of the system profiles we're working on - should cover a lot of the base functionlity for accelerated workflow.

Release testing automation will be split into two phases:
Phase 1:

Phase 2:

  • Actual Automation

Frequent upstream releases:

  • Release more ilab z streams on a regular cadence
@ktam3 ktam3 added the enhancement New feature or request label Oct 17, 2024
@ktam3 ktam3 changed the title Release testing automation and frequent upstream releases [Epic] Release testing automation and frequent upstream releases Oct 17, 2024
@ktam3
Copy link
Author

ktam3 commented Oct 17, 2024

@JamesKunstle - i know there's work that you're currently doing related to this. Please link those here when you get a chance in the Phase 1 section!

@nathan-weinberg
Copy link
Member

What is this actually meant for? The description isn't very descriptive

@ktam3
Copy link
Author

ktam3 commented Oct 18, 2024

I just updated with more info on the discussion

Copy link

This issue has been automatically marked as stale because it has not had activity within 60 days. It will be automatically closed if no further activity occurs within 30 days.

@github-actions github-actions bot added the stale label Jan 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request stale
Projects
None yet
Development

No branches or pull requests

6 participants