Skip to content

PoolingCuVSResourceManager with memory availability #133242

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 4 commits into
base: es-gpu
Choose a base branch
from

Conversation

ldematte
Copy link
Contributor

@ldematte ldematte commented Aug 20, 2025

This PR expands #132670 to account for GPU memory availability: a requesting thread can obtain a resource only if there are enough GPU physical resources (in this first iteration, memory) available. Otherwise the requesting thread will be blocked and signalled again to re-check conditions are satisfied when memory is freed (when another thread release a resource).

Depends on rapidsai/cuvs#1267 which is currently under review (hence the draft state)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant