Roofline quantized conv3d/2d layer #3419

jainapurva · 2025-12-03T03:11:27Z

No description provided.

pytorch-bot · 2025-12-03T03:11:31Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3419

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit 44ce800 with merge base 73730e8 ():

NEW FAILURES - The following jobs have failed:

Code Analysis with Ruff / build (3.9) (gh)
Process completed with exit code 1.
PR Label Check / Check PR Labels (gh)
Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jerryzh168 · 2025-12-04T02:48:07Z

benchmarks/float8/float8_inference_roofline.py

+            and not is_sm_at_least_100()
+        )
+
+        if skip_conv_benchmarks:


I feel conditions seems a bit convoluted here

maybe:

if do_benchmarks: if op_name in ("conv2d", "conv3d") and not is_sm_at_least_100(): print warning else: # can also move this part to a function to make it clearer ....

jerryzh168 · 2025-12-04T02:51:48Z

benchmarks/float8/float8_inference_roofline.py

-            r_speedup = None
+            # use roofline model to estimate gemm time using equivalent GEMM dims
+            r_bf16_gemm_time_s = float(
+                bf16_gemm_time_sympy.subs(M, gemm_M).subs(K, gemm_K).subs(N, gemm_N)


is the memory operations of conv the same as linear well?

ao/torchao/testing/training/roofline_utils.py

Line 332 in 0975a40

mem_gemm_time_s = (

As conv is an implicit gemm, I'm assuming the memory operations for gemm and conv should be same.

benchmarks/float8/float8_inference_roofline.py

jerryzh168 · 2025-12-04T02:57:23Z

benchmarks/float8/float8_inference_roofline.py


-            # real gemm benchmark time, also not added yet
-            # if enabled, also measured observed gemm time
+            # gemm benchmarks for conv not implemented, as conv uses implicit GEMM


we should run the conv ops I think?

We're running conv op in benchmarks

that's a bit different I think, that one is doing e2e speedup.

we can do the same that linear is doing:

ao/benchmarks/float8/float8_inference_roofline.py

Line 88 in 2ae2994

def get_gemm_times(

that only run the fp8 conv op itself (without all the quant ops for act and weight)

Add conv roofline

3bc5d37

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 3, 2025

Add conv roofline

79cdaec

jainapurva force-pushed the conv_roofline branch from 8fba23d to 79cdaec Compare December 3, 2025 06:26

jainapurva added 4 commits December 3, 2025 18:08

updates

f4c7a6e

updates

828cb02

updates

e815fd4

minor fixes

30dc793

jerryzh168 reviewed Dec 4, 2025

View reviewed changes

benchmarks/float8/float8_inference_roofline.py Outdated Show resolved Hide resolved

jerryzh168 reviewed Dec 4, 2025

View reviewed changes

jainapurva requested a review from jbschlosser December 9, 2025 17:23

jainapurva added 4 commits December 9, 2025 19:24

updates

160ee2d

updates

b82c186

updates

e298a4d

updates

44ce800

jainapurva marked this pull request as ready for review December 10, 2025 05:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Roofline quantized conv3d/2d layer #3419

Roofline quantized conv3d/2d layer #3419

Uh oh!

jainapurva commented Dec 3, 2025

Uh oh!

pytorch-bot bot commented Dec 3, 2025 •

edited

Loading

Uh oh!

jerryzh168 Dec 4, 2025

Uh oh!

jerryzh168 Dec 4, 2025

Uh oh!

jainapurva Dec 4, 2025

Uh oh!

Uh oh!

jerryzh168 Dec 4, 2025

Uh oh!

jainapurva Dec 9, 2025

Uh oh!

jerryzh168 Dec 9, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Roofline quantized conv3d/2d layer #3419

Are you sure you want to change the base?

Roofline quantized conv3d/2d layer #3419

Uh oh!

Conversation

jainapurva commented Dec 3, 2025

Uh oh!

pytorch-bot bot commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3419

❌ 2 New Failures

Uh oh!

jerryzh168 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

jainapurva Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jerryzh168 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

jainapurva Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Dec 3, 2025 •

edited

Loading

jerryzh168 Dec 9, 2025 •

edited

Loading