[WIP] Add support for custom DeepSeek modelling in ACL Graph mode #677

yiz-liu · 2025-04-27T08:18:28Z

Enable ACL Graph mode for custom DeepSeek modelling.

None.

Test it with any DeepSeek model.

Signed-off-by: Yizhou Liu <[email protected]>

ganyi1996ppo · 2025-04-29T08:07:01Z

vllm_ascend/models/deepseek_v2.py

+
+        return final_hidden_states.view(num_tokens, hidden_dim)
+
+    def _forward(self, hidden_states: torch.Tensor) -> torch.Tensor:


Have we tested torchair on this, access attn_metadata in global context may cause some graph related issue, better confirmed it with torchair

ganyi1996ppo · 2025-04-29T08:14:29Z

vllm_ascend/attention/mla_v1.py

-
-        return output_padded
+        if trace_flag:
+            torch.ops.vllm.unified_ascend_mla_attention_with_output(


Can we use the attention interface unified_ascend_attention_with_output which already registered in attention_v1.py

github-actions bot added module:ops module:core labels Apr 27, 2025

[Feature] Add support for custom DeepSeek modeling in ACL Graph mode

593b87e

Signed-off-by: Yizhou Liu <[email protected]>

yiz-liu force-pushed the feat-deepseek-graph branch from 3b0c106 to 593b87e Compare April 27, 2025 08:21

ganyi1996ppo reviewed Apr 29, 2025

View reviewed changes

Provide feedback


		return final_hidden_states.view(num_tokens, hidden_dim)

		def _forward(self, hidden_states: torch.Tensor) -> torch.Tensor: