Skip to content

[FP16] Improved performance by fusing dequantize with compute in kernels: 20-30% Inference Speedup #144

[FP16] Improved performance by fusing dequantize with compute in kernels: 20-30% Inference Speedup

[FP16] Improved performance by fusing dequantize with compute in kernels: 20-30% Inference Speedup #144

build-and-run (opencl)

succeeded Dec 4, 2025 in 6m 58s