[Model][CI] Let more pooling models support v1 #21747

noooop · 2025-07-28T11:30:18Z

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

Purpose

#21270 Let v1 support encoder-only models, theoretically all pooling models should support v1.

Let's check which models are not supported.

fixed

fix GteNewModel
fix JinaRobertaModel
fix ModernBertModel
fix NomicBertModel
float32 is supported by V1
Alibaba-NLP/gte-Qwen2-1.5B-instruct is supported by V1

address #21470 (comment)

cc @maxdebayser @DarkLight1337

Test Plan

Test Result

(Optional) Documentation Update

Signed-off-by: wang.yuqi <[email protected]>

gemini-code-assist

Code Review

This pull request aims to expand V1 support to more pooling models by removing V1-specific skips and workarounds in the tests and updating model implementations. The changes look good and are consistent with the goal.

I've found one potential issue: ModernBertForSequenceClassification is still marked as V0-only, while the tests are being updated to run it on V1. This will likely lead to test failures. Please see my detailed comment.

github-actions · 2025-07-28T11:39:56Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: wang.yuqi <[email protected]>

noooop · 2025-07-29T02:48:04Z

@DarkLight1337

Ready for review

vllm/model_executor/models/modernbert.py

vllm/model_executor/models/bert_with_rope.py

Isotr0py

Let's merge this PR first. Will keep investigating ModernBert CUDA graph issue.

DarkLight1337 · 2025-07-30T04:17:32Z

I think the API server failure is related to pooling model memory usage somehow...

DarkLight1337 · 2025-07-30T14:50:52Z

We are currently testing precompiled wheels in CI, maybe it's not working properly

Signed-off-by: wang.yuqi <[email protected]>

noooop · 2025-07-31T04:31:45Z

Merging main into this pr will cause the tests to fail.

@DarkLight1337 @Isotr0py

(╯‵□′)╯︵┻━┻

DarkLight1337 · 2025-07-31T06:53:36Z

LGTM now

DarkLight1337 · 2025-07-31T06:53:51Z

Actually wait, let me unblock pooling tests

noooop · 2025-07-31T08:46:21Z

@DarkLight1337

Compare this PR with #21929,

Merging main into this pr will cause the tests to fail.

#21964 did not solve this problem

DarkLight1337 · 2025-07-31T08:50:45Z

Let's just merge this PR then. It seems the tests pass

DarkLight1337 · 2025-07-31T08:50:58Z

#21964 did not solve this problem

Please comment on that PR

noooop · 2025-07-31T08:57:16Z

@DarkLight1337

Is it possible that some commit in main is conflicting with this PR?

DarkLight1337 · 2025-07-31T09:03:15Z

Let's see if the CI passes after merge

noooop · 2025-07-31T09:04:04Z

anyhow, this is best I can do,

Other issues are too complex for me, let's solve them in other PRs.

Signed-off-by: wang.yuqi <[email protected]> Signed-off-by: shuw <[email protected]>

Signed-off-by: wang.yuqi <[email protected]>

Signed-off-by: wang.yuqi <[email protected]> Signed-off-by: x22x22 <[email protected]>

Signed-off-by: wang.yuqi <[email protected]>

Signed-off-by: wang.yuqi <[email protected]> Signed-off-by: jingyu <[email protected]>

Signed-off-by: wang.yuqi <[email protected]> Signed-off-by: Jinzhen Lin <[email protected]>

Signed-off-by: wang.yuqi <[email protected]> Signed-off-by: Noam Gat <[email protected]>

+ v1

c410f9e

Signed-off-by: wang.yuqi <[email protected]>

gemini-code-assist bot reviewed Jul 28, 2025

View reviewed changes

noooop added 3 commits July 28, 2025 20:15

+ config_updated

be1d139

Signed-off-by: wang.yuqi <[email protected]>

+ fix NomicBertModel

d30f507

Signed-off-by: wang.yuqi <[email protected]>

- max_num_seqs

bcd706a

Signed-off-by: wang.yuqi <[email protected]>

noooop marked this pull request as ready for review July 29, 2025 02:43

noooop requested review from DarkLight1337, ywang96, simon-mo, WoosukKwon, youkaichao, robertgshaw2-redhat, mgoin, tlrmchlsmth, houseroad and hmellor as code owners July 29, 2025 02:43

mergify bot added the qwen Related to Qwen models label Jul 29, 2025

noooop changed the title ~~[Model][CI] Have as many pooling models as possible support v1~~ [Model][CI] Let more pooling models support v1 Jul 29, 2025

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Jul 29, 2025

DarkLight1337 reviewed Jul 29, 2025

View reviewed changes

vllm/model_executor/models/modernbert.py Show resolved Hide resolved

DarkLight1337 reviewed Jul 29, 2025

View reviewed changes

vllm/model_executor/models/bert_with_rope.py Show resolved Hide resolved

noooop mentioned this pull request Jul 29, 2025

[Model]: Fused MoE for nomic-embed-text-v2-moe #18321

Merged

noooop marked this pull request as draft July 29, 2025 11:09

DarkLight1337 reviewed Jul 29, 2025

View reviewed changes

vllm/model_executor/models/bert_with_rope.py Show resolved Hide resolved

noooop marked this pull request as ready for review July 29, 2025 15:33

noooop mentioned this pull request Jul 30, 2025

[Frontend] Add LLM.reward specific to reward models #21720

Merged

4 tasks

Isotr0py approved these changes Jul 30, 2025

View reviewed changes

noooop force-pushed the pooling_v1 branch from 437e14f to bcd706a Compare July 30, 2025 17:22

fix

2c4d932

Signed-off-by: wang.yuqi <[email protected]>

noooop changed the title ~~[Do not Merge] This pr might have triggered a CI bug~~ [Model][CI] Let more pooling models support v1 Jul 31, 2025

vllm-bot merged commit 2836dd7 into vllm-project:main Jul 31, 2025
69 of 71 checks passed

noooop mentioned this pull request Jul 31, 2025

For VLLM_USE_PRECOMPILED, only compiled .so files should be extracted #21964

Merged

4 tasks

noooop mentioned this pull request Jul 31, 2025

Update transformers to v4.55 #21931

Merged

noooop deleted the pooling_v1 branch July 31, 2025 11:57

noooop restored the pooling_v1 branch July 31, 2025 12:05

wenscarl pushed a commit to wenscarl/vllm that referenced this pull request Aug 4, 2025

[Model][CI] Let more pooling models support v1 (vllm-project#21747)

0dff2af

Signed-off-by: wang.yuqi <[email protected]> Signed-off-by: shuw <[email protected]>

wenscarl pushed a commit to wenscarl/vllm that referenced this pull request Aug 4, 2025

[Model][CI] Let more pooling models support v1 (vllm-project#21747)

a825da5

Signed-off-by: wang.yuqi <[email protected]> Signed-off-by: shuw <[email protected]>

juuice-lee pushed a commit to juuice-lee/vllm-moe.code that referenced this pull request Aug 5, 2025

[Model][CI] Let more pooling models support v1 (vllm-project#21747)

f3afc63

Signed-off-by: wang.yuqi <[email protected]>

noooop deleted the pooling_v1 branch August 5, 2025 11:36

vadiklyutiy pushed a commit to CentML/vllm that referenced this pull request Aug 5, 2025

[Model][CI] Let more pooling models support v1 (vllm-project#21747)

8ba8d76

Signed-off-by: wang.yuqi <[email protected]>

x22x22 pushed a commit to x22x22/vllm that referenced this pull request Aug 5, 2025

[Model][CI] Let more pooling models support v1 (vllm-project#21747)

57d70cb

Signed-off-by: wang.yuqi <[email protected]>

x22x22 pushed a commit to x22x22/vllm that referenced this pull request Aug 5, 2025

[Model][CI] Let more pooling models support v1 (vllm-project#21747)

14a266e

Signed-off-by: wang.yuqi <[email protected]> Signed-off-by: x22x22 <[email protected]>

x22x22 pushed a commit to x22x22/vllm that referenced this pull request Aug 5, 2025

[Model][CI] Let more pooling models support v1 (vllm-project#21747)

f97b7ed

Signed-off-by: wang.yuqi <[email protected]> Signed-off-by: x22x22 <[email protected]>

npanpaliya pushed a commit to odh-on-pz/vllm-upstream that referenced this pull request Aug 6, 2025

[Model][CI] Let more pooling models support v1 (vllm-project#21747)

f0e27ae

Signed-off-by: wang.yuqi <[email protected]>

jingyu-ml pushed a commit to jingyu-ml/vllm that referenced this pull request Aug 8, 2025

[Model][CI] Let more pooling models support v1 (vllm-project#21747)

255d980

Signed-off-by: wang.yuqi <[email protected]> Signed-off-by: jingyu <[email protected]>

jinzhen-lin pushed a commit to jinzhen-lin/vllm that referenced this pull request Aug 9, 2025

[Model][CI] Let more pooling models support v1 (vllm-project#21747)

f81fa24

Signed-off-by: wang.yuqi <[email protected]> Signed-off-by: Jinzhen Lin <[email protected]>

noamgat pushed a commit to noamgat/vllm that referenced this pull request Aug 9, 2025

[Model][CI] Let more pooling models support v1 (vllm-project#21747)

3b2b270

Signed-off-by: wang.yuqi <[email protected]> Signed-off-by: Noam Gat <[email protected]>

Uh oh!

[Model][CI] Let more pooling models support v1 #21747

[Model][CI] Let more pooling models support v1 #21747

Uh oh!

Conversation

noooop commented Jul 28, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Essential Elements of an Effective PR Description Checklist

Purpose

Test Plan

Test Result

(Optional) Documentation Update

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

github-actions bot commented Jul 28, 2025

Uh oh!

noooop commented Jul 29, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Isotr0py left a comment

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 commented Jul 30, 2025

Uh oh!

DarkLight1337 commented Jul 30, 2025

Uh oh!

noooop commented Jul 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DarkLight1337 commented Jul 31, 2025

Uh oh!

DarkLight1337 commented Jul 31, 2025

Uh oh!

noooop commented Jul 31, 2025

Uh oh!

DarkLight1337 commented Jul 31, 2025

Uh oh!

DarkLight1337 commented Jul 31, 2025

Uh oh!

Uh oh!

noooop commented Jul 31, 2025

Uh oh!

DarkLight1337 commented Jul 31, 2025

Uh oh!

noooop commented Jul 31, 2025

Uh oh!

Uh oh!

noooop commented Jul 28, 2025 •

edited by github-actions bot

Loading

noooop commented Jul 31, 2025 •

edited

Loading