Skip to content

[Speculative Decoding][MTP] Support static CacheKV C8 quantization and optimize memory usage#5155

Merged
freeliuzc merged 2 commits intoPaddlePaddle:developfrom
freeliuzc:merge_mtp_c8_and_mem_opt
Nov 21, 2025
Merged

[Speculative Decoding][MTP] Support static CacheKV C8 quantization and optimize memory usage#5155
freeliuzc merged 2 commits intoPaddlePaddle:developfrom
freeliuzc:merge_mtp_c8_and_mem_opt

Commits

Commits on Nov 20, 2025