Skip to content

add pluggable loadFormat support for model loaders #318

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 6 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
69 changes: 68 additions & 1 deletion .github/workflows/test.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -108,13 +108,80 @@ jobs:
shell: bash
run: ./chart-dependencies/ci-deps.sh

- name: Debug cluster state before chart testing
if: steps.list-changed.outputs.changed == 'true'
run: |
echo "=== DEBUG: Cluster state before chart-testing ==="
kubectl cluster-info
kubectl get nodes -o wide
kubectl get namespaces
kubectl get pods --all-namespaces
kubectl get services --all-namespaces
kubectl get crd | grep -E "(gateway|inference)" || echo "No gateway/inference CRDs found"
echo ""

echo "=== DEBUG: Checking Istio installation ==="
kubectl get pods -n istio-system || echo "No istio-system namespace"
kubectl get svc -n istio-system || echo "No services in istio-system"
echo ""

echo "=== DEBUG: Checking ingress controller ==="
kubectl get pods -A | grep ingress || echo "No ingress pods found"
kubectl get svc -A | grep ingress || echo "No ingress services found"
echo ""

- name: Run chart-testing (install)
if: steps.list-changed.outputs.changed == 'true'
env:
TARGET_BRANCH: ${{ github.event.pull_request.base.ref || 'main^' }}
run: |
echo "=== DEBUG: Starting chart-testing install ==="
echo "Target branch: $TARGET_BRANCH"
echo "Chart-testing config:"
cat ct-install.yaml
echo ""

echo "=== DEBUG: Running ct install with maximum verbosity ==="
set -x
ct install \
--debug \
--config ct-install.yaml \
--upgrade \
--target-branch "$TARGET_BRANCH"
--target-branch "$TARGET_BRANCH" || {
echo ""
echo "=== DEBUG: Chart-testing failed, checking helm releases ==="
helm list --all-namespaces || true
echo ""
echo "=== DEBUG: Checking for any test namespaces ==="
kubectl get namespaces | grep -E "(test|chart)" || echo "No test namespaces found"
echo ""
echo "=== DEBUG: Checking for any failed pods ==="
kubectl get pods --all-namespaces --field-selector=status.phase=Failed || echo "No failed pods found"
echo ""
echo "=== DEBUG: Recent events ==="
kubectl get events --all-namespaces --sort-by='.lastTimestamp' | tail -30
exit 1
}

- name: Debug cluster state after chart testing failure
if: failure() && steps.list-changed.outputs.changed == 'true'
run: |
echo "=== DEBUG: Cluster state after failure ==="
kubectl get pods --all-namespaces -o wide
kubectl get services --all-namespaces
kubectl get events --all-namespaces --sort-by='.lastTimestamp' | tail -20
echo ""

echo "=== DEBUG: Chart-related pods and services ==="
kubectl get pods -A | grep -E "(llm-d|test)" || echo "No chart-related pods found"
kubectl get svc -A | grep -E "(llm-d|test)" || echo "No chart-related services found"
echo ""

echo "=== DEBUG: Pod logs for failed pods ==="
for pod in $(kubectl get pods -A --field-selector=status.phase=Failed -o jsonpath='{range .items[*]}{.metadata.namespace}{" "}{.metadata.name}{"\n"}{end}'); do
namespace=$(echo $pod | cut -d' ' -f1)
name=$(echo $pod | cut -d' ' -f2)
echo "--- Logs for $namespace/$name ---"
kubectl logs -n $namespace $name --previous || kubectl logs -n $namespace $name || echo "No logs available"
echo ""
done
6 changes: 6 additions & 0 deletions _typos.toml
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
# Configuration for typos spell checker
# Allow specific sequences that appear in base64 encoded SVG icons
[default.extend-words]
# These are valid base64 sequences, not typos
"OT" = "OT"
"Ba" = "Ba"
105 changes: 102 additions & 3 deletions charts/llm-d/Chart.yaml
Original file line number Diff line number Diff line change
@@ -1,10 +1,109 @@
---
apiVersion: v2
name: llm-d
type: application
version: 1.0.18
version: 1.0.19
appVersion: "0.1"
icon: 
description: llm-d is a Kubernetes-native high-performance distributed LLM inference framework
# typos:disable
icon: >-

VVRGLTgiIHN0YW5kYWxvbmU9Im5vIj8+CjwhLS0gQ3JlYXRlZCB3aXRoIElua3Nj
YXBlIChodHRwOi8vd3d3Lmlua3NjYXBlLm9yZy8pIC0tPgoKPHN2ZwogICB3aWR0
aD0iODBtbSIKICAgaGVpZ2h0PSI4MG1tIgogICB2aWV3Qm94PSIwIDAgODAuMDAw
MDA0IDgwLjAwMDAwMSIKICAgdmVyc2lvbj0iMS4xIgogICBpZD0ic3ZnMSIKICAg
eG1sOnNwYWNlPSJwcmVzZXJ2ZSIKICAgeG1sbnM9Imh0dHA6Ly93d3cudzMub3Jn
LzIwMDAvc3ZnIgogICB4bWxuczpzdmc9Imh0dHA6Ly93d3cudzMub3JnLzIwMDAv
c3ZnIj48ZGVmcwogICAgIGlkPSJkZWZzMSIgLz48cGF0aAogICAgIHN0eWxlPSJm
aWxsOiM0ZDRkNGQ7ZmlsbC1vcGFjaXR5OjE7c3Ryb2tlOiM0ZDRkNGQ7c3Ryb2tl
LXdpZHRoOjIuMzQyOTk7c3Ryb2tlLW1pdGVybGltaXQ6MTA7c3Ryb2tlLWRhc2hh
cnJheTpub25lIgogICAgIGQ9Im0gNTEuNjI5Nyw0My4wNzY3IGMgLTAuODI1NCww
IC0xLjY1MDgsMC4yMTI4IC0yLjM4ODEsMC42Mzg0IGwgLTEwLjcyNjksNi4xOTI2
IGMgLTEuNDc2MywwLjg1MjIgLTIuMzg3MywyLjQzANDUgLTIuMzg3Myw0LjEzNTQg
diAxMi4zODQ3IGMgMCwxLjcwANDEgMC45MTI4LDMuMjg1NCAyLjM4ODUsNC4xMzU4
IGwgMTAuNzI1Nyw2LjE5MTggYyAxLjQ3NDcsMC44NTEzIDMuMzAxNSwwLjg1MTMg
NC43NzYyLDAgTCA2NC43NDQ3LDcwLjU2MzIgQyA2Ni4yMjEsNjkuNzExIDY3LjEz
Miw2OC4xMjg4IDY3LjEzMiw2Ni40Mjc4IFYgNTQuMDQzMSBjIDAsLTEuNzAzNiAt
MC45MTIzLC0zLjI4NDggLTIuMzg3MywtNC4xMzU0IGwgLThlLTQsLTRlLTQgLTEw
LjcyNjEsLTYuMTkyMiBjIC0wLjczNzQsLTAuNDI1NiAtMS41NjI3LC0wLjYzODQg
LTIuMzg4MSwtMC42Mzg0IHogbSAwLDMuNzM5NyBjIDAuMTc3NCwwIDAuMzU0Niww
LjA0NyAwLjUxNjcsMC4xNDA2IGwgMTAuNzI3Niw2LjE5MjUgNGUtNCw0ZS00IGMg
MC4zMTkzLDAuMTg0IDAuNTE0MywwLjUyMDMgMC41MTQzLDAuODkzMiB2IDEyLjM4
NDcgYyAwLDAuMzcyMSAtMC4xOTI3LDAuNzA3MyAtMC41MTU1LDAuODkzNiBsIC0x
MC43MjY4LDYuMTkyMiBjIC0wLjMyANDMsMC4xODcyIC0wLjcwOTEsMC4xODcyIC0x
LjAzMzQsMCBsIC0xMC43MjcyLC02LjE5MjYgLThlLTQsLTRlLTQgQyA0MC4wNjU3
LDY3LjEzNjcgMzkuODcwNyw2Ni44MDA3IDM5Ljg3MDcsNjYuNDI3OCBWIDU0LjA0
MzEgYyAwLC0wLjM3MiAwLjE5MjcsLTAuNzA3NyAwLjUxNTUsLTAuODk0IEwgNTEu
MTEzLDQ2Ljk1NyBjIDAuMTYyMSwtMC4wOTQgMC4zMzkzLC0wLjE0MDYgMC41MTY3
LC0wLjE0MDYgeiIKICAgICBpZD0icGF0aDEyMiIgLz48cGF0aAogICAgIGlkPSJw
YXRoMTI0IgogICAgIHN0eWxlPSJmaWxsOiM0ZDRkNGQ7ZmlsbC1vcGFjaXR5OjE7
c3Ryb2tlOiM0ZDRkNGQ7c3Ryb2tlLXdpZHRoOjIuMzQyOTk7c3Ryb2tlLWxpbmVj
YXA6cm91bmQ7c3Ryb2tlLW1pdGVybGltaXQ6MTA7c3Ryb2tlLWRhc2hhcnJheTpu
b25lIgogICAgIGQ9Im0gNjMuMzg5MDE4LDM0LjgxOTk1OCB2IDItLjM0NDE3NSBh
IDEuODcxNTQzLDEuODcxNTQzIDAgMCAwIDEuODcxNTQxLDEuODcxNTQxIDEuODcx
NTQzLDEuODcxNTQzIDAgMCAwIDEuODcxNTQxLC0xLjg3MTU0MSBWIDMyLjY1BODY0
NyBaIiAvPjxwYXRoCiAgICAgc3R5bGU9ImZpbGw6IzdmMzE3ZjtmaWxsLW9wYWNp
dHk6MTtzdHJva2U6IzdmMzE3ZjtzdHJva2Utd2lkdGg6Mi4yNDM7c3Ryb2tlLW1p
dGVybGltaXQ6MTA7c3Ryb2tlLWRhc2hhcnJheTpub25lO3N0cm9rZS1vcGFjaXR5
OjEiCiAgICAgZD0ibSAzNi43MzQyLDI4LjIzNDggYyAwLjQwOTcsMC43MTY1IDEu
MDA0MiwxLjMyNzMgMS43Mzk4LDEuNzU2MSBsIDEwLjcwMSw2LjIzNzIgYyAxLjQ3
MjcsMC44NTg0IDMuMjk4NCwwLjg2MzcgNC43NzUsMC4wMTkgbCAxMC43NTA2LC02
LjE0ODUgYyAxLjQ3OTMsLTAuODQ2IDIuMzk4NywtMi40MjM0IDIuNDA0NCwtNC4x
MjY3IGwgMC4wNSwtMTIuMzg0NCBjIDAuMDEsLTEuNzAyOSAtMC45LC0zLjI4ODYg
LTIuMzcxMiwtNC4xANDYxIEwgNTQuMDgzMiwzLjIwNCBDIDUyLjYxMDUsMi4zNDU1
IDUwLjc4NDcsMi4zANDAyIDQ5LjMwODIsMy4xODUgTCAzOC41NTc1LDkuMzMzNSBj
IC0xLjQ3ODksMC44NDU4IC0yLjM5ODQsMi40MjI3IC0yLjQwANDYsNC4xMjU0IGwg
MTBlLTUsOGUtNCAtMC4wNSwxMi4zODUgYyAwLDAuODUxNSAwLjItMTYsMS42NzM1
IDAuNjMxNCwyLjM5IHogbSAzLjI0NjMsLTEuODU2NiBjIC0wLjA4OCwtMC4xNTQg
LTAuMTM1MywtMC4zMzExIC0wLjEzANDUsLTAuNTE4MyBsIDAuMDUsLTEyLjM4NjYg
MmUtNCwtNmUtNCBjIDAsLTAuMzY4NCAwLjE5NjMsLTAuNzA0NyAwLjUyLC0wLjg4
OTkgTCA1MS4xNjY5LDYuNDM0MyBjIDAuMzItOSwtMC4xODQ3IDAuNzA5NywtMC4x
ODM4IDEuMDMxNiwwIGwgMTAuNzAwNiw2LjIzNzQgYyAwLjMyMzUsMC4xODg1IDAu
NTE0NSwwLjUyMjYgMC41MTMsMC44OTcgbCAtMC4wNSwxMi4zODYyIHYgOWUtNCBj
IDAsMS4zNjg0IC0wLjE5NiwwLjcwANDUgLTAuNTE5NywwLjg4OTYgbCAtMTAuNzUw
Niw2LjE0ODUgYyAtMC4zMjMsMC4xODQ3IC0wLjcxMDEsMC4xODQgLTEuMDMyLDAg
TCA0MC4zNTkyLDI2Ljc1NjcgYyAtMC4xNjE3LC0wLjA5NCAtMC4yOTA1LC0wLjIt
NDggLTAuMzc4NSwtMC4zNzg4IHoiCiAgICAgaWQ9InBhdGgxMjYiIC8+PHBhdGgK
ICAgICBpZD0icGF0aDEyOSIKICAgICBzdHlsZT0iZmlsbDojN2YzMTdmO2ZpbGwt
b3BhY2l0eToxO3N0cm9rZTojN2YzMTdmO3N0cm9rZS13aWR0aDoyLjI0MztzdHJv
a2UtbGluZWNhcDpyb3VuZDtzdHJva2UtbWl0ZXJsaW1pdDoxMDtzdHJva2UtZGFz
aGFycmF5Om5vbmU7c3Ryb2tlLW9wYWNpdHk6MSIKICAgICBkPSJNIDIzLjcyODgz
NSwyMi4xMjYxODUgANDMuMTI0OTI0LDExLjAzMzItIEEgMS44NzE1ANDMsMS44NzE1
ANDMgMCAwIDAgANDMuODIwMzkxLDguNDc5NDY2NiAxLjg3MTU0MywxLjg3MTU0MyAw
IDAgMCA0MS4yNjY2MzcsNy43ODM5OTk4IEwgMTkuOTk0ANDAxLDE5Ljk0OTk2NyBa
IiAvPjxwYXRoCiAgICAgc3R5bGU9ImZpbGw6IzdmMzE3ZjtmaWxsLW9wYWNpdHk6
MTtzdHJva2U6IzdmMzE3ZjtzdHJva2Utd2lkdGg6Mi4yNDM7c3Ryb2tlLW1pdGVy
bGltaXQ6MTA7c3Ryb2tlLWRhc2hhcnJheTpub25lO3N0cm9rZS1vcGFjaXR5OjEi
CiAgICAgZD0ibSAzMS40NzY2LDQ4LjQ1MDQgYyAwLjQxANDUsLTAuNzEzOCAwLjY0
NSwtMS41MzQ0IDAuNjQ3MiwtMi4zODU4IGwgMC4wMzIsLTEyLjM4NiBjIDAsLTEu
NzA0NiAtMC45MDY0LC0zLjI4NyAtMi4zNzczLC00LjE0MTIgTCAxOS4wNjg4LDIz
LjMxOCBjIC0xLjQ3MzcsLTAuODU1OCAtMy4yOTk1LC0wLjg2MDUgLTQuNzc2LC0w
LjAxMSBMIDMuNTUyMSwyOS40NzI3IGMgLTEuNDc2OCwwLjg0NzggLTIuMzk0Miwy
LjQyNzUgLTIuMzk4Niw0LjEzMDQgbCAtMC4wMzIsMTIuMzg1NyBjIDAsMS43MDQ3
IDAuOTA2MywzLjI4NzEgMi4zNzcyLDQuMTQxMiBsIDEwLjcwOTgsNi4yMTk1IGMg
MS40NzMyLDAuODU1NSAzLjI5ODcsMC44NjA2IDQuNzc1LDAuMDEyIGwgNmUtNCwt
NGUtNCAxMC43ANDEyLC02LjE2NTggYyAwLjczODUsLTAuANDIzOSAxLjMzNjksLTEu
MDMwOCAxLjc1MTUsLTEuNzQ0NSB6IG0gLTMuMjM0LC0xLjg3ODEgYyAtMC4wODks
MC4xNTM0IC0wLjIxODYsMC4yODMxIC0wLjM4MSwwLjM3NjMgbCAtMTAuNzQyMyw2
LjE2NyAtNmUtNCwyZS00IGMgLTAuMzE5NCwwLjE4MzYgLTAuNzA4MiwwLjE4MzQg
LTEuMDMwNywwIEwgNS4zNzgyLDQ2Ljg5NjQgQyA1LjA1NjUsANDYuNzA5NiA0Ljg2
MzMsANDYuMzc0NSA0Ljg2ANDMsANDYuMDAxOSBsIDAuMDMyLC0xMi4zODU4IGMgMCwt
MC4zNzQ0IDAuMTk0MiwtMC43MDcyIDAuNTE4OSwtMC44OTM2IGwgMTAuNzQyMiwt
Ni4xNjY3IDZlLTQsLTRlLTQgYyAwLjMxOTQsLTAuMTgzNyAwLjcwNzgsLTAuMTgz
NyAxLjAzMDMsMCBsIDEwLjcwOTgsNi4yMTk0IGMgMC4zMjE3LDAuMTg2OSAwLjUx
NTIsMC41MjIxIDAuNTE0MiwwLjg5NDggbCAtMC4wMzIsMTIuMzg1NiBjIC00ZS00
LDAuMTg3MiAtMC4wNDksMC4zNjQxIC0wLjEzNzksMC41MTc0IHoiCiAgICAgaWQ9
InBhdGgxMzkiIC8+PHBhdGgKICAgICBpZD0icGF0aDE0MSIKICAgICBzdHlsZT0i
ZmlsbDojN2YzMTdmO2ZpbGwtb3BhY2l0eToxO3N0cm9rZTojN2YzMTdmO3N0cm9r
ZS13aWR0aDoyLjI0MztzdHJva2UtbGluZWNhcDpyb3VuZDtzdHJva2UtbWl0ZXJs
aW1pdDoxMDtzdHJva2UtZGFzaGFycmF5Om5vbmU7c3Ryb2tlLW9wYWNpdHk6MSIK
ICAgICBkPSJNIDMyLjcxMTI5OSw2Mi43NjU3ANDYgMTMuMzg4OTY5LDUxLjU0NDc5
OCBhIDEuODcxNTQzLDEuODcxNTQzIDAgMCAwIC0yLjU1ODI5NSwwLjY3ODU2OCAx
Ljg3MTU0MywxLjg3MTU0MyAwIDAgMCAwLjY3ODU2OSwyLjU1ODI5NiBsIDIxLjE5
MTM0NCwxMi4zMDYzMyBaIiAvPjwvc3ZnPgo=
# typos:enable
description: >-
llm-d is a Kubernetes-native high-performance distributed LLM inference
framework
keywords:
- vllm
- llm-d
Expand Down
Loading
Loading