-
Notifications
You must be signed in to change notification settings - Fork 267
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[CI] Make AttentionBackend interface compatible to fix broken CI
module:tests
vllm-break
#1893
opened Jul 20, 2025 by
wangxiyuan
Loading…
[2/4][Refactor] Refactor torchair utils
module:core
module:ops
module:quantization
module:tests
#1892
opened Jul 20, 2025 by
wangxiyuan
Loading…
[Perf][MoE] Improve MoE multistream parallel performace.
module:ops
module:quantization
#1891
opened Jul 19, 2025 by
whx-sjtu
Loading…
[Perf] Avoid performing index selection of sin/cos cache every layer
#1890
opened Jul 19, 2025 by
whx-sjtu
Loading…
[1/4][Refactor] Refactor torchair worker
module:core
#1885
opened Jul 19, 2025 by
wangxiyuan
Loading…
Add Custom Kernels For LoRA Performance
module:tests
#1884
opened Jul 19, 2025 by
taoxudonghaha
Loading…
[CI]Add e2e test for 310p
e2e-310p-test
module:tests
ready-for-test
start test by label for PR
#1879
opened Jul 18, 2025 by
zhangxinyuehfad
Loading…
Add super kernel in moe
module:core
module:ops
module:quantization
#1877
opened Jul 18, 2025 by
NNUCJ
Loading…
[CI][main] Add
qwen3_moe
W8A8 quantized model test case
module:tests
#1876
opened Jul 18, 2025 by
zhoux77899
Loading…
[CI][v0.9.1] Add
qwen3_moe
W8A8 quantized model test case
module:tests
#1874
opened Jul 18, 2025 by
zhoux77899
Loading…
Add graph mode for Qwen2.5 and Qwen3
module:core
module:ops
module:tests
#1873
opened Jul 18, 2025 by
NicholasTao
Loading…
[Doc]Add Chinese translation for documentation
documentation
Improvements or additions to documentation
#1870
opened Jul 18, 2025 by
aidoczh
Loading…
[Feature] Optimize forward metadata collection across dp ranks
#1857
opened Jul 17, 2025 by
jianzs
Loading…
[MoE][Dist] Fix Qwen MoE accuracy bug in DP senario
#1856
opened Jul 17, 2025 by
MengqingCao
•
Draft
SwiftBalancer Zero OverHead Expert Movement
module:core
module:ops
module:quantization
#1855
opened Jul 17, 2025 by
raindaywhu
Loading…
[WIP][V0.9.1] add support for flashcomm2 in qwen2
merge-conflicts
#1850
opened Jul 17, 2025 by
David9857
Loading…
support cos_sin_cache prefetch for qwen2
merge-conflicts
#1846
opened Jul 17, 2025 by
Pr0Wh1teGivee
Loading…
[BugFix] Fix a bug of running chunked-prefill with torchair. (#1378)
#1844
opened Jul 17, 2025 by
MengqingCao
Loading…
cherry-pick vllm-project#1651 from v0.9.1-dev
merge-conflicts
module:tests
#1842
opened Jul 17, 2025 by
22dimensions
Loading…
[Feature] Enable inference support for Deepseekr1-w8a8-MTP
module:quantization
#1834
opened Jul 17, 2025 by
Irving11-BKN
Loading…
[1/N][CustomOp] Register RMSNorm instead of overwrite forward_oot
merge-conflicts
module:core
module:ops
module:tests
#1833
opened Jul 16, 2025 by
MengqingCao
Loading…
[0.9.1][Dist][Bugfix] Fix mc2 process group to resolve self.cpu_group is None
#1831
opened Jul 16, 2025 by
MengqingCao
Loading…
[Doc] Update support feature
documentation
Improvements or additions to documentation
#1828
opened Jul 16, 2025 by
wangxiyuan
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.