[1/N][Refactor] Remove the custom split_qkv #4042

gcanlin · 2025-11-06T16:33:52Z

What this PR does / why we need it?

Due to vLLM PR #23190, I think we no longer need to worry about the additional communication overhead caused by TP. Now, we can simply set --mm-encoder-tp-mode to enable encoder-side DP. Specifically, in split_qkv, tp_size will be equal to 1, so all_gather_interleave will not be triggered.

I will complete it tomorrow.

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.11.0
vLLM main: vllm-project/vllm@83f478b

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

github-actions · 2025-11-06T16:34:37Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

[1/N][Refactor] remove the override of split_qkv

f9003eb

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

gcanlin changed the title ~~[1/N][Refactor] remove the override of split_qkv~~ [1/N][Refactor] Remove the override of split_qkv Nov 6, 2025

gcanlin changed the title ~~[1/N][Refactor] Remove the override of split_qkv~~ [1/N][Refactor] Remove the custom split_qkv Nov 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[1/N][Refactor] Remove the custom split_qkv #4042

[1/N][Refactor] Remove the custom split_qkv #4042

gcanlin commented Nov 6, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[1/N][Refactor] Remove the custom split_qkv #4042

Are you sure you want to change the base?

[1/N][Refactor] Remove the custom split_qkv #4042

Conversation

gcanlin commented Nov 6, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

gcanlin commented Nov 6, 2025 •

edited by github-actions bot

Loading