Skip to content

Commit 9139458

Browse files
jeejeeleejingyu
authored andcommitted
[Misc] Expand SUPPORTED_HIDDEN_SIZES for DeepEP low-latency kernels (vllm-project#21818)
Signed-off-by: Jee Jee Li <[email protected]> Signed-off-by: jingyu <[email protected]>
1 parent 34565e6 commit 9139458

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/model_executor/layers/fused_moe/deepep_ll_prepare_finalize.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@ class DeepEPLLPrepareAndFinalize(mk.FusedMoEPrepareAndFinalize):
4040

4141
# DeepEP low-latency kernels are compiled only for certain
4242
# specific hidden sizes.
43-
SUPPORTED_HIDDEN_SIZES = [2048, 2560, 4096, 5120, 7168]
43+
SUPPORTED_HIDDEN_SIZES = [2048, 2560, 4096, 5120, 6144, 7168]
4444

4545
def __init__(self,
4646
buffer: deep_ep.Buffer,

0 commit comments

Comments
 (0)