Commit 5655a9d
committed
Adds mask and bias copying from shared memory to registers
Implements tensor copying operations to move mask and bias data from shared memory to register storage before computation.
Creates register tensors with matching shapes and uses retiled copy views to efficiently transfer the data, preparing for subsequent processing steps.1 parent a092dfd commit 5655a9d
1 file changed
+8
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
621 | 621 | | |
622 | 622 | | |
623 | 623 | | |
| 624 | + | |
| 625 | + | |
| 626 | + | |
| 627 | + | |
| 628 | + | |
| 629 | + | |
| 630 | + | |
| 631 | + | |
624 | 632 | | |
625 | 633 | | |
626 | 634 | | |
| |||
0 commit comments