-
Notifications
You must be signed in to change notification settings - Fork 143
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Retrieve remote model id from registration response in IT to avoid flaky #3244
Merged
xinyual
merged 1 commit into
opensearch-project:main
from
zane-neo:remove-wait-register-for-remote-model-in-IT
Dec 3, 2024
Merged
Retrieve remote model id from registration response in IT to avoid flaky #3244
xinyual
merged 1 commit into
opensearch-project:main
from
zane-neo:remove-wait-register-for-remote-model-in-IT
Dec 3, 2024
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Signed-off-by: zane-neo <[email protected]>
zane-neo
requested review from
b4sjoo,
dhrubo-os,
jngz-es,
model-collapse,
rbhavna,
ylwu-amzn,
Zhangxunmt,
austintlee,
HenryL27 and
xinyual
as code owners
November 29, 2024 06:19
zane-neo
had a problem deploying
to
ml-commons-cicd-env
November 29, 2024 06:19 — with
GitHub Actions
Failure
zane-neo
temporarily deployed
to
ml-commons-cicd-env
November 29, 2024 06:19 — with
GitHub Actions
Inactive
zane-neo
temporarily deployed
to
ml-commons-cicd-env
November 29, 2024 07:16 — with
GitHub Actions
Inactive
zane-neo
had a problem deploying
to
ml-commons-cicd-env
December 2, 2024 20:48 — with
GitHub Actions
Failure
zane-neo
temporarily deployed
to
ml-commons-cicd-env
December 2, 2024 21:21 — with
GitHub Actions
Inactive
dhrubo-os
approved these changes
Dec 3, 2024
xinyual
approved these changes
Dec 3, 2024
opensearch-trigger-bot bot
pushed a commit
that referenced
this pull request
Dec 3, 2024
…aky (#3244) Signed-off-by: zane-neo <[email protected]> (cherry picked from commit 1d30671)
dhrubo-os
pushed a commit
that referenced
this pull request
Dec 3, 2024
…aky (#3244) (#3249) Signed-off-by: zane-neo <[email protected]> (cherry picked from commit 1d30671) Co-authored-by: zane-neo <[email protected]>
brianf-aws
added a commit
to brianf-aws/ml-commons
that referenced
this pull request
Dec 5, 2024
Following opensearch-project#3244 this IT called the task api to check the model id again however this is redundant. Instead one can directly pull the model_id upon creating the model group. Manual testing was done to see that the behavior is intact, this should help reduce the calls within a IT to make it less flaky Signed-off-by: Brian Flores <[email protected]>
5 tasks
dhrubo-os
pushed a commit
that referenced
this pull request
Dec 6, 2024
…MTest (#3253) * fix uneeded call to get model_id for task api within RestMLGuardrailsIT Following #3244 this IT called the task api to check the model id again however this is redundant. Instead one can directly pull the model_id upon creating the model group. Manual testing was done to see that the behavior is intact, this should help reduce the calls within a IT to make it less flaky Signed-off-by: Brian Flores <[email protected]> * fix ToolIntegrationWithLLMTest model undeploy race condition Previously the test class attempted to delete a model without fully knowing if the model was undeployed in time. This change adds a waiting for 5 retries each 1 second to check the status of the model and only when undeployed will it proceed to delete the model. When the number of retries are exceeded it throws a error indicating a deeper problem. Manual testing was done to check that the model is undeployed by searching for the specific model via the checkForModelUndeployedStatus method. Signed-off-by: Brian Flores <[email protected]> --------- Signed-off-by: Brian Flores <[email protected]>
brianf-aws
added a commit
to brianf-aws/ml-commons
that referenced
this pull request
Dec 9, 2024
…MTest (opensearch-project#3253) * fix uneeded call to get model_id for task api within RestMLGuardrailsIT Following opensearch-project#3244 this IT called the task api to check the model id again however this is redundant. Instead one can directly pull the model_id upon creating the model group. Manual testing was done to see that the behavior is intact, this should help reduce the calls within a IT to make it less flaky Signed-off-by: Brian Flores <[email protected]> * fix ToolIntegrationWithLLMTest model undeploy race condition Previously the test class attempted to delete a model without fully knowing if the model was undeployed in time. This change adds a waiting for 5 retries each 1 second to check the status of the model and only when undeployed will it proceed to delete the model. When the number of retries are exceeded it throws a error indicating a deeper problem. Manual testing was done to check that the model is undeployed by searching for the specific model via the checkForModelUndeployedStatus method. Signed-off-by: Brian Flores <[email protected]> --------- Signed-off-by: Brian Flores <[email protected]> (cherry picked from commit 1a659c8)
brianf-aws
added a commit
to brianf-aws/ml-commons
that referenced
this pull request
Dec 9, 2024
…MTest (opensearch-project#3253) * fix uneeded call to get model_id for task api within RestMLGuardrailsIT Following opensearch-project#3244 this IT called the task api to check the model id again however this is redundant. Instead one can directly pull the model_id upon creating the model group. Manual testing was done to see that the behavior is intact, this should help reduce the calls within a IT to make it less flaky Signed-off-by: Brian Flores <[email protected]> * fix ToolIntegrationWithLLMTest model undeploy race condition Previously the test class attempted to delete a model without fully knowing if the model was undeployed in time. This change adds a waiting for 5 retries each 1 second to check the status of the model and only when undeployed will it proceed to delete the model. When the number of retries are exceeded it throws a error indicating a deeper problem. Manual testing was done to check that the model is undeployed by searching for the specific model via the checkForModelUndeployedStatus method. Signed-off-by: Brian Flores <[email protected]> --------- Signed-off-by: Brian Flores <[email protected]> (cherry picked from commit 1a659c8)
tkykenmt
pushed a commit
to tkykenmt/ml-commons
that referenced
this pull request
Dec 15, 2024
…aky (opensearch-project#3244) Signed-off-by: zane-neo <[email protected]>
tkykenmt
pushed a commit
to tkykenmt/ml-commons
that referenced
this pull request
Dec 15, 2024
…MTest (opensearch-project#3253) * fix uneeded call to get model_id for task api within RestMLGuardrailsIT Following opensearch-project#3244 this IT called the task api to check the model id again however this is redundant. Instead one can directly pull the model_id upon creating the model group. Manual testing was done to see that the behavior is intact, this should help reduce the calls within a IT to make it less flaky Signed-off-by: Brian Flores <[email protected]> * fix ToolIntegrationWithLLMTest model undeploy race condition Previously the test class attempted to delete a model without fully knowing if the model was undeployed in time. This change adds a waiting for 5 retries each 1 second to check the status of the model and only when undeployed will it proceed to delete the model. When the number of retries are exceeded it throws a error indicating a deeper problem. Manual testing was done to check that the model is undeployed by searching for the specific model via the checkForModelUndeployedStatus method. Signed-off-by: Brian Flores <[email protected]> --------- Signed-off-by: Brian Flores <[email protected]>
tkykenmt
pushed a commit
to tkykenmt/ml-commons
that referenced
this pull request
Dec 15, 2024
…aky (opensearch-project#3244) Signed-off-by: zane-neo <[email protected]> Signed-off-by: tkykenmt <[email protected]>
tkykenmt
pushed a commit
to tkykenmt/ml-commons
that referenced
this pull request
Dec 15, 2024
…MTest (opensearch-project#3253) * fix uneeded call to get model_id for task api within RestMLGuardrailsIT Following opensearch-project#3244 this IT called the task api to check the model id again however this is redundant. Instead one can directly pull the model_id upon creating the model group. Manual testing was done to see that the behavior is intact, this should help reduce the calls within a IT to make it less flaky Signed-off-by: Brian Flores <[email protected]> * fix ToolIntegrationWithLLMTest model undeploy race condition Previously the test class attempted to delete a model without fully knowing if the model was undeployed in time. This change adds a waiting for 5 retries each 1 second to check the status of the model and only when undeployed will it proceed to delete the model. When the number of retries are exceeded it throws a error indicating a deeper problem. Manual testing was done to check that the model is undeployed by searching for the specific model via the checkForModelUndeployedStatus method. Signed-off-by: Brian Flores <[email protected]> --------- Signed-off-by: Brian Flores <[email protected]> Signed-off-by: tkykenmt <[email protected]>
dhrubo-os
pushed a commit
that referenced
this pull request
Dec 31, 2024
…tegrationWithLLMTest (#3263) * Fixes Two Flaky IT classes RestMLGuardrailsIT & ToolIntegrationWithLLMTest (#3253) * fix uneeded call to get model_id for task api within RestMLGuardrailsIT Following #3244 this IT called the task api to check the model id again however this is redundant. Instead one can directly pull the model_id upon creating the model group. Manual testing was done to see that the behavior is intact, this should help reduce the calls within a IT to make it less flaky Signed-off-by: Brian Flores <[email protected]> * fix ToolIntegrationWithLLMTest model undeploy race condition Previously the test class attempted to delete a model without fully knowing if the model was undeployed in time. This change adds a waiting for 5 retries each 1 second to check the status of the model and only when undeployed will it proceed to delete the model. When the number of retries are exceeded it throws a error indicating a deeper problem. Manual testing was done to check that the model is undeployed by searching for the specific model via the checkForModelUndeployedStatus method. Signed-off-by: Brian Flores <[email protected]> --------- Signed-off-by: Brian Flores <[email protected]> (cherry picked from commit 1a659c8) * add retry according to how many rest clients are in a IT cluster Signed-off-by: Brian Flores <[email protected]> * fix retry initialization The MAX_RETRIES variable had to wait for the cluster to form before it could call to get the cluster size Signed-off-by: Brian Flores <[email protected]> --------- Signed-off-by: Brian Flores <[email protected]>
opensearch-trigger-bot bot
pushed a commit
that referenced
this pull request
Jan 10, 2025
…aky (#3244) Signed-off-by: zane-neo <[email protected]> (cherry picked from commit 1d30671)
opensearch-trigger-bot bot
pushed a commit
that referenced
this pull request
Jan 10, 2025
…aky (#3244) Signed-off-by: zane-neo <[email protected]> (cherry picked from commit 1d30671)
opensearch-trigger-bot bot
pushed a commit
that referenced
this pull request
Jan 10, 2025
…aky (#3244) Signed-off-by: zane-neo <[email protected]> (cherry picked from commit 1d30671)
opensearch-trigger-bot bot
pushed a commit
that referenced
this pull request
Jan 10, 2025
…aky (#3244) Signed-off-by: zane-neo <[email protected]> (cherry picked from commit 1d30671)
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
Retrieve remote model id from registration response in IT to avoid flaky, an example is:
Related Issues
Resolves #[Issue number to be closed when this PR is merged]
Check List
--signoff
.By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.