Skip to content

Add scorer based on active loras #43

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 90 commits into
base: dev
Choose a base branch
from

Conversation

mayabar
Copy link
Collaborator

@mayabar mayabar commented Apr 22, 2025

Given a model name (in case this is a LoRA adapter) each pod will get score based on whether the lora is currently in the running/waiting loras list.

mayabar and others added 30 commits April 10, 2025 14:50
…ill be the target for a request. Session affinity scorer added
- Rename SessionId to SessionID
- Remove datastore from scoreTargets, add datastore to SessionAffinityScorer
- Rename ScoredPod to PodScore
…f ScoreMng

- If some specific scorer failed to score pods - just log the problem, skip it and continue to the next scorer
Signed-off-by: Shane Utt <shaneutt@linux.com>
Signed-off-by: Shane Utt <shaneutt@linux.com>
Signed-off-by: Shane Utt <shaneutt@linux.com>
Signed-off-by: Shane Utt <shaneutt@linux.com>
Signed-off-by: Shane Utt <shaneutt@linux.com>
Signed-off-by: Shane Utt <shaneutt@linux.com>
Signed-off-by: Shane Utt <shaneutt@linux.com>
Bumps [golang.org/x/net](https://github.com/golang/net) from 0.37.0 to 0.38.0.
- [Commits](golang/net@v0.37.0...v0.38.0)

---
updated-dependencies:
- dependency-name: golang.org/x/net
  dependency-version: 0.38.0
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
…odules/golang.org/x/net-0.38.0

Bump golang.org/x/net from 0.37.0 to 0.38.0
Add scorers support in scheduler
…ev-deployments

First iteration of development deployments & environments
Signed-off-by: Shane Utt <shaneutt@linux.com>
Signed-off-by: Shane Utt <shaneutt@linux.com>
…e-builds

fix: basic container image builds for linux
Signed-off-by: Shane Utt <shaneutt@linux.com>
Signed-off-by: Shane Utt <shaneutt@linux.com>
Signed-off-by: Etai Lev Ran <elevran@gmail.com>
…on_yaml

empty top level kustomization.yaml - make CICD happy
clubanderson and others added 24 commits April 18, 2025 21:04
Patch Istio deployment to use 1Gi of mem
Setup the Istio service to be a NodePort service and not a ClusterIP service
Signed-off-by: Shane Utt <shaneutt@linux.com>
Signed-off-by: Shane Utt <shaneutt@linux.com>
…t-env

Add openshift-infra dev env deployment
Signed-off-by: Shane Utt <shaneutt@linux.com>
Signed-off-by: Shane Utt <shaneutt@linux.com>
Signed-off-by: Shane Utt <shaneutt@linux.com>
Signed-off-by: Shane Utt <shaneutt@linux.com>
Signed-off-by: Shane Utt <shaneutt@linux.com>
Signed-off-by: Shane Utt <shaneutt@linux.com>
Signed-off-by: Shane Utt <shaneutt@linux.com>
Signed-off-by: Shane Utt <shaneutt@linux.com>
@mayabar mayabar requested a review from shmuelk April 22, 2025 13:09
@mayabar mayabar changed the base branch from main to dev April 22, 2025 13:11
@elevran
Copy link
Collaborator

elevran commented Apr 27, 2025

please rebase/resubmit with less clutter - the PR pulled in too many files from other PRs

@@ -104,36 +105,58 @@ func (p *PodMetricsClientImpl) promToPodMetrics(

// Handle LoRA metrics (only if all LoRA MetricSpecs are present)
if p.MetricMapping.LoraRequestInfo != nil {
loraMetrics, err := p.getLatestLoraMetric(metricFamilies)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: Please try to be more consistent in use of lora, Lora, LoRA (avoiding symbol exports of course...)

errs = multierr.Append(errs, err)

if err == nil {
updated.ActiveModels = make(map[string]int)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

q: is the int value useful or used to implement a Set using a hashtable?
If Set, consider using map[string]struct{} (see here for a complete example)

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

would it be possible to put all LoRA related stuff in its own file/directory?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Prefer a directory per filter/scorer or other organization that keeps new code from the GIE original directories whenever possible.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants