Add a proposal for integrating volume limits into cluster autoscaler #5031

gnufied · 2025-01-09T16:03:31Z

One-line PR description:

Integrate CSI Volume attach limits with cluster autoscaler #5030

Other comments:

sftim · 2025-01-12T16:46:14Z

This looks like a draft to me @gnufied

/retitle [WIP] Add a proposal for integrating volume limits into cluster autoscaler

Feel free to amend the PR title if that's not correct.

k8s-triage-robot · 2025-04-12T17:16:34Z

The Kubernetes project currently lacks enough contributors to adequately respond to all PRs.

This bot triages PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the PR is closed

You can:

Mark this PR as fresh with /remove-lifecycle stale
Close this PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2025-05-12T18:06:23Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all PRs.

This bot triages PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the PR is closed

You can:

Mark this PR as fresh with /remove-lifecycle rotten
Close this PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

gnufied · 2025-06-10T20:35:46Z

/remove-lifecycle-rotten

drewhagen

Hello @gnufied @dobsonj 👋, 1.34 Enhancements team here.

A friendly heads up that these fields should be ready to be updated to satisfy the requirement for the upcoming Enhancements Freeze. Please see more details in my comment on the KEP issue.

Thanks!

keps/sig-storage/5030-attach-limit-autoscaler/kep.yaml

Co-authored-by: Drew Hagen <[email protected]>

gnufied · 2025-06-11T19:08:36Z

/assign @kannon92 @jsafrane

kannon92 · 2025-06-11T19:43:36Z

keps/sig-storage/5030-attach-limit-autoscaler/README.md

+
+- Since a node does not come up with a CSI driver typically, usually too many pods get scheduled on a node, which may not be supportable by the node in the first place. This leads to bunch of pods, just stuck.
+
+Once cluster-autoscaler is aware of CSI volume attach limits, we can fix kubernete's builtin scheduler to not schedule pods to nodes that don't have CSI driver installed.


This motivation is confusing to me.

You want cluster-autoscaler to be aware of volumes that can be attached to a node.

Why are you mentioning CSI driver on a node not being present? That seems to be unrelated to the number of volumes?

Okay, you are calling out two different problems and targeting them together.

kannon92 · 2025-06-11T19:50:09Z

keps/sig-storage/5030-attach-limit-autoscaler/README.md

+Consider including folks who also work outside the SIG or subproject.
+-->
+
+## Design Details


Reading this design detail, I am thinking of #5328.

hmm, it does have some overlap but that KEP is IMO different and I don't see how it integrates with autoscaler yet (may be they need to integrate those node capabilities with autoscaler at some point).

kannon92 · 2025-06-11T19:51:38Z

keps/sig-storage/5030-attach-limit-autoscaler/README.md

+
+#### Alpha
+
+- All of the planned code changes for alpha will be done in cluster-autoscaler and not in k/k repository.


"not in k/k repository"?

So this first alpha stage is only cluster-autoscaler and your k/k PR will not be in scope for alpha?

So, the change in main k/k repository is pretty small and can only be enabled when autoscaler changes have already gone GA. In fact, to ensure some backward compatibility concerns, we have to be careful not to enable scheduler changes sooner.

I have covered this in Notes and Caveats section.

keps/sig-storage/5030-attach-limit-autoscaler/README.md

kannon92 · 2025-06-11T19:53:00Z

keps/sig-storage/5030-attach-limit-autoscaler/README.md

+  CRI or CNI may require updating that component before the kubelet.
+-->
+
+## Production Readiness Review Questionnaire


Given the new requirements on KEPs (promoting to beta should require feature to be functionally complete), I ask that you fill up the PRR as much as you can.

kannon92 · 2025-06-11T19:54:02Z

keps/sig-storage/5030-attach-limit-autoscaler/README.md

+
+We also propose that, if given node is not reporting any installed CSI drivers, we do not schedule pods that need CSI volumes to that node.
+
+The proposed change is small and a draft PR is available here - https://github.com/kubernetes/kubernetes/pull/130702


This PR does not reference the feature gate you mention here?

Will this work be gated by the same feature gate or are these two separate features?

Yes, that is just POC. First we want to fix this in cluster-autoscaler before we can target kube-scheduler, crux of the code will not change. It will be pretty small change in main k/k repository.

kannon92 · 2025-06-11T19:55:01Z

keps/sig-storage/5030-attach-limit-autoscaler/README.md

+Even if applying deprecation policies, they may still surprise some users.
+-->
+
+### Monitoring Requirements


try to answer these.

kannon92 · 2025-06-11T19:55:27Z

keps/sig-storage/5030-attach-limit-autoscaler/README.md

+### Scalability
+
+<!--
+For alpha, this section is encouraged: reviewers should consider these questions


Please fill this section out.

kannon92

Please try to fill out the PRR as much as you can.

Co-authored-by: Kevin Hannon <[email protected]>

keps/sig-storage/5030-attach-limit-autoscaler/kep.yaml

keps/sig-storage/5030-attach-limit-autoscaler/README.md

kannon92 · 2025-06-12T13:36:21Z

/hold

Since this is a storage PR, I think a hold is sufficient to get ACK/Approval from autoscaling and scheduling.

Please unhold once you have a review from autoscaling and scheduling.

k8s-ci-robot · 2025-06-16T16:36:07Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: gnufied
Once this PR has been reviewed and has the lgtm label, please assign deads2k, gjtempleton for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

keps/prod-readiness/OWNERS
keps/sig-autoscaling/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

deads2k

An order dependent design makes disablement and re-enablement hard. What can we do to avoid an order dependent design?

deads2k · 2025-06-18T15:37:45Z