Skip to content

Add LLMInferenceService support#5

Open
wseaton wants to merge 2 commits into
mainfrom
feat/llminferenceservice-support
Open

Add LLMInferenceService support#5
wseaton wants to merge 2 commits into
mainfrom
feat/llminferenceservice-support

Conversation

@wseaton
Copy link
Copy Markdown
Owner

@wseaton wseaton commented Apr 1, 2026

Adds serving.kserve.io/v1alpha1 LLMInferenceService as a scalable resource kind (l flag).

  • New resources::llminferenceservice CRD definition
  • Owner ref chain walking, scale-to-zero via minReplicas: 0
  • Full test coverage (equality, hashing, Meta trait, event generation, resource flags)

wseaton added 2 commits March 31, 2026 21:52
- minimal hand-written CRD type with spec.replicas and serde flatten
- label fast-path: app.kubernetes.io/part-of=llminferenceservice
- owner-ref chain: Pod -> RS -> Deployment -> LLMInferenceService
- scale patches both spec.replicas and spec.prefill.replicas to zero
- new 'l' flag in enabled-resources (default "drsinl")
- 10 unit tests covering bitflags, conversion, equality, hashing, meta
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant