generated from kubernetes/kubernetes-template-project
-
Notifications
You must be signed in to change notification settings - Fork 24
Issues: kubernetes-sigs/gateway-api-inference-extension
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
What does this project think about "disaggregated prefilling"?
#166
opened Jan 7, 2025 by
spacewander
Changes to InferenceModel Should Trigger EndpointSlice Reconciliation
#151
opened Jan 6, 2025 by
danehans
Can we configure multiple inference pools by reconciling InferencePool?
#145
opened Jan 3, 2025 by
Kuromesi
The metrics refresh time might be much larger than the refreshMetricsInterval
#99
opened Dec 14, 2024 by
spacewander
Explore if we can simplify the filter tree in pkg/ext_proc/scheduling/scheduler.go
#89
opened Dec 10, 2024 by
liu-cong
Validate model/adapter is available on the model server before sending requests to a model server
#49
opened Nov 20, 2024 by
liu-cong
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-12-07.