[TMA] Enable unswizzled tma layouts #6238

peterbell10 · 2025-03-18T21:40:55Z

No description provided.

peterbell10 · 2025-03-18T21:41:33Z

include/triton/Dialect/TritonGPU/IR/TritonGPUAttrDefs.td

          swizzlingByteWidth = 32;
        } else {
-          llvm_unreachable("unsupported shared memory layout for MMAv3");
+          llvm_unreachable("unsupported NVMMA layout (MMAv3 or TMA)");


Not directly related, but seeing this MMAv3 message was a bit confusing on blackwell.

Yeah, it was dropped before the shared memory encoding refactoring

python/triton/language/semantic.py

peterbell10 · 2025-03-22T00:29:14Z

lib/Dialect/TritonNvidiaGPU/Transforms/OptimizeDescriptorEncoding.cpp

+  auto rank = tensorType.getRank();
+  if (rank == 1 || contigDimSizeInBytes < 32 || shape[rank - 2] < 8) {
    return ttg::SwizzledSharedEncodingAttr::get(ctx, 1, 1, 1, order, ctaLayout);
  }


Okay, now defaulting to unswizzled encodings if there are fewer than 8 rows of data.

peterbell10 · 2025-03-22T00:30:18Z

test/TritonNvidiaGPU/optimize_descriptor_encoding.mlir

+// CHECK-DAG: #[[NVMMA_32:.*]] = #ttg.nvmma_shared<{swizzlingByteWidth = 32, transposed = false, elementBitWidth = 8}>
+tt.func public @tma_scatter(%arg0: !tt.ptr<i8> {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: tensor<32xi32, #blocked>, %arg4: tensor<32x32xi8, #blocked1>) {
+  // CHECK: tt.make_tensor_descriptor {{.*}} : <i8>, <tensor<1x32xi8, #[[NVMMA_32]]>>
+  // CHECK: tt.descriptor_scatter {{.*}} : !tt.tensordesc<tensor<1x32xi8, #[[NVMMA_32]]>>, {{.*}}


This test ensures that gather/scatter still use swizzling despite the descriptor's block shape only having 1 row.

ThomasRaoux

LGTM

[TMA] Support unswizzled tma layouts

d013790

peterbell10 requested a review from ptillet as a code owner March 18, 2025 21:40

peterbell10 commented Mar 18, 2025

View reviewed changes

Fix ndim > 2

70a813b

peterbell10 requested a review from ThomasRaoux March 18, 2025 22:38

Remove 8-row requirements

839da8d

peterbell10 commented Mar 22, 2025

View reviewed changes

ThomasRaoux approved these changes Mar 22, 2025

View reviewed changes

peterbell10 merged commit 3887b80 into main Mar 22, 2025
8 checks passed

peterbell10 deleted the pb/unswizzled-tma branch March 22, 2025 20:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TMA] Enable unswizzled tma layouts #6238

[TMA] Enable unswizzled tma layouts #6238

Uh oh!

peterbell10 commented Mar 18, 2025

Uh oh!

peterbell10 Mar 18, 2025

Uh oh!

Jokeren Mar 18, 2025

Uh oh!

Uh oh!

peterbell10 Mar 22, 2025

Uh oh!

peterbell10 Mar 22, 2025

Uh oh!

ThomasRaoux left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[TMA] Enable unswizzled tma layouts #6238

[TMA] Enable unswizzled tma layouts #6238

Uh oh!

Conversation

peterbell10 commented Mar 18, 2025

Uh oh!

peterbell10 Mar 18, 2025

Choose a reason for hiding this comment

Uh oh!

Jokeren Mar 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

peterbell10 Mar 22, 2025

Choose a reason for hiding this comment

Uh oh!

peterbell10 Mar 22, 2025

Choose a reason for hiding this comment

Uh oh!

ThomasRaoux left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants