[Perf] Deepgemm fused layout kernel for activations, 4.3% throughput improvement, 10.7% TTFT improvement.#29546
Open
yewentao256 wants to merge 2 commits intomainfrom
Open
[Perf] Deepgemm fused layout kernel for activations, 4.3% throughput improvement, 10.7% TTFT improvement.#29546yewentao256 wants to merge 2 commits intomainfrom
yewentao256 wants to merge 2 commits intomainfrom