Commit ae1a5dd
Fix invalid horizontal stride 0 for destination in predicated sub-DW operations
Fixed invalid horizontal stride 0 for destination operand in emitLSCVectorLoad_subDW() function. When doUniformLoad is true, the code was setting hStride to 0 for destination operand, which violates Intel GPU architecture constraints and causes instruction validation errors for i8/i16 predicated load/store operations. Modified the logic to use default stride for uniform loads and appropriate stride (2 or 4) for non-uniform loads, eliminating the invalid stride 0 case.1 parent 460d83f commit ae1a5dd
File tree
2 files changed
+5
-3
lines changed- IGC/Compiler
- CISACodeGen
- tests/EmitVISAPass
2 files changed
+5
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
17717 | 17717 | | |
17718 | 17718 | | |
17719 | 17719 | | |
17720 | | - | |
17721 | | - | |
| 17720 | + | |
| 17721 | + | |
| 17722 | + | |
| 17723 | + | |
17722 | 17724 | | |
17723 | 17725 | | |
17724 | 17726 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
23 | 23 | | |
24 | 24 | | |
25 | 25 | | |
26 | | - | |
| 26 | + | |
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
| |||
0 commit comments