Skip to content

Conversation

@PMylon
Copy link
Contributor

@PMylon PMylon commented Sep 3, 2025

This commit adapts the LDS layout selection logic in Stream Pipeliner so that we pick a common swizzled shared memory layout with vecSize = max kWidth of all users.
Fixes https://github.com/ROCm/triton-internal/issues/1089.
Fixes https://github.com/ROCm/triton-internal/issues/1155.

@PMylon PMylon force-pushed the enable_async_cp_for_dO branch from 0ede830 to 5e05ad2 Compare September 4, 2025 07:12
@PMylon PMylon force-pushed the enable_async_cp_for_dO branch from 5e05ad2 to 3e7959d Compare September 4, 2025 11:23
@PMylon PMylon force-pushed the enable_async_cp_for_dO branch from 74b01ce to 8fc179f Compare September 4, 2025 18:01
@PMylon PMylon changed the title [WIP] Rework pipeliner LDS layout selection logic & enable async copy [AMD] Redesign pipeliner LDS layout selection logic Sep 5, 2025
@PMylon PMylon changed the title [AMD] Redesign pipeliner LDS layout selection logic [AMD] Redesign stream pipeliner LDS layout selection logic Sep 5, 2025
@PMylon PMylon marked this pull request as ready for review September 5, 2025 12:42
@antiagainst
Copy link
Collaborator

Can we also add a new mimimal lit test with different users to check the new logic?

@antiagainst antiagainst merged commit 8b792c8 into triton-lang:main Sep 8, 2025
9 checks passed
yiqian1 pushed a commit to yiqian1/triton that referenced this pull request Sep 9, 2025
…ng#8053)

This commit adapts the LDS layout selection logic in Stream Pipeliner so
that we pick a common swizzled shared memory layout with vecSize = max
kWidth of all users.
ZelboK pushed a commit to ZelboK/triton that referenced this pull request Sep 9, 2025
…ng#8053)

This commit adapts the LDS layout selection logic in Stream Pipeliner so
that we pick a common swizzled shared memory layout with vecSize = max
kWidth of all users.
jayfurmanek pushed a commit to ROCm/triton that referenced this pull request Sep 26, 2025
…ng#8053)

This commit adapts the LDS layout selection logic in Stream Pipeliner so
that we pick a common swizzled shared memory layout with vecSize = max
kWidth of all users.
agron911 added a commit to agron911/triton that referenced this pull request Feb 5, 2026
…out selection logic (#8053)'

Summary:
This is a cherry-pick of an upstream PR: triton-lang/triton#8053

Upstream commit message:
```
> [AMD] Redesign stream pipeliner LDS layout selection logic (#8053)

> This commit adapts the LDS layout selection logic in Stream Pipeliner so
> that we pick a common swizzled shared memory layout with vecSize = max
> kWidth of all users.
```

***Do not remove the following line from this commit***
Reactor Cherry-pick Revision: 8b792c8
 ---

This diff was generated by running:
```
buck run fbcode//triton/tools/reactor:reactor -- cherrypick --num-commits 3
```

Differential Revision: D92336368
agron911 added a commit to agron911/triton that referenced this pull request Feb 5, 2026
…out selection logic (#8053)'

Summary:
This is a cherry-pick of an upstream PR: triton-lang/triton#8053

Upstream commit message:
```
> [AMD] Redesign stream pipeliner LDS layout selection logic (#8053)

> This commit adapts the LDS layout selection logic in Stream Pipeliner so
> that we pick a common swizzled shared memory layout with vecSize = max
> kWidth of all users.
```

***Do not remove the following line from this commit***
Reactor Cherry-pick Revision: 8b792c8
 ---

This diff was generated by running:
```
buck run fbcode//triton/tools/reactor:reactor -- cherrypick --num-commits 3
```

Differential Revision: D92336368
agron911 added a commit to agron911/triton that referenced this pull request Feb 5, 2026
…out selection logic (#8053)'

Summary:
This is a cherry-pick of an upstream PR: triton-lang/triton#8053

Upstream commit message:
```
> [AMD] Redesign stream pipeliner LDS layout selection logic (#8053)

> This commit adapts the LDS layout selection logic in Stream Pipeliner so
> that we pick a common swizzled shared memory layout with vecSize = max
> kWidth of all users.
```

***Do not remove the following line from this commit***
Reactor Cherry-pick Revision: 8b792c8
 ---

This diff was generated by running:
```
buck run fbcode//triton/tools/reactor:reactor -- cherrypick --num-commits 3
```

Differential Revision: D92336368
agron911 added a commit to agron911/triton that referenced this pull request Feb 10, 2026
…out selection logic (#8053)' (facebookexperimental#847)

Summary:

This is a cherry-pick of an upstream PR: triton-lang/triton#8053

Upstream commit message:
```
> [AMD] Redesign stream pipeliner LDS layout selection logic (#8053)

> This commit adapts the LDS layout selection logic in Stream Pipeliner so
> that we pick a common swizzled shared memory layout with vecSize = max
> kWidth of all users.
```

***Do not remove the following line from this commit***
Reactor Cherry-pick Revision: 8b792c8
---

This diff was generated by running:
```
buck run fbcode//triton/tools/reactor:reactor -- cherrypick --num-commits 3
```

Reviewed By: stashuk-olek

Differential Revision: D92336368
agron911 added a commit to agron911/triton that referenced this pull request Feb 10, 2026
…out selection logic (#8053)' (facebookexperimental#847)

Summary:

This is a cherry-pick of an upstream PR: triton-lang/triton#8053

Upstream commit message:
```
> [AMD] Redesign stream pipeliner LDS layout selection logic (#8053)

> This commit adapts the LDS layout selection logic in Stream Pipeliner so
> that we pick a common swizzled shared memory layout with vecSize = max
> kWidth of all users.
```

***Do not remove the following line from this commit***
Reactor Cherry-pick Revision: 8b792c8
---

This diff was generated by running:
```
buck run fbcode//triton/tools/reactor:reactor -- cherrypick --num-commits 3
```

Reviewed By: stashuk-olek

Differential Revision: D92336368
agron911 added a commit to agron911/triton that referenced this pull request Feb 10, 2026
…out selection logic (#8053)' (facebookexperimental#847)

Summary:

This is a cherry-pick of an upstream PR: triton-lang/triton#8053

Upstream commit message:
```
> [AMD] Redesign stream pipeliner LDS layout selection logic (#8053)

> This commit adapts the LDS layout selection logic in Stream Pipeliner so
> that we pick a common swizzled shared memory layout with vecSize = max
> kWidth of all users.
```

***Do not remove the following line from this commit***
Reactor Cherry-pick Revision: 8b792c8
---

This diff was generated by running:
```
buck run fbcode//triton/tools/reactor:reactor -- cherrypick --num-commits 3
```

Reviewed By: stashuk-olek

Differential Revision: D92336368
agron911 added a commit to agron911/triton that referenced this pull request Feb 10, 2026
…out selection logic (#8053)' (facebookexperimental#847)

Summary:

This is a cherry-pick of an upstream PR: triton-lang/triton#8053

Upstream commit message:
```
> [AMD] Redesign stream pipeliner LDS layout selection logic (#8053)

> This commit adapts the LDS layout selection logic in Stream Pipeliner so
> that we pick a common swizzled shared memory layout with vecSize = max
> kWidth of all users.
```

***Do not remove the following line from this commit***
Reactor Cherry-pick Revision: 8b792c8
---

This diff was generated by running:
```
buck run fbcode//triton/tools/reactor:reactor -- cherrypick --num-commits 3
```

Reviewed By: stashuk-olek

Differential Revision: D92336368
agron911 added a commit to agron911/triton that referenced this pull request Feb 10, 2026
…out selection logic (#8053)'

Summary:
This is a cherry-pick of an upstream PR: triton-lang/triton#8053

Upstream commit message:
```
> [AMD] Redesign stream pipeliner LDS layout selection logic (#8053)

> This commit adapts the LDS layout selection logic in Stream Pipeliner so
> that we pick a common swizzled shared memory layout with vecSize = max
> kWidth of all users.
```

***Do not remove the following line from this commit***
Reactor Cherry-pick Revision: 8b792c8
 ---

This diff was generated by running:
```
buck run fbcode//triton/tools/reactor:reactor -- cherrypick --num-commits 3
```

Differential Revision: D92336368
agron911 added a commit to agron911/triton that referenced this pull request Feb 10, 2026
…out selection logic (#8053)' (facebookexperimental#847)

Summary:

This is a cherry-pick of an upstream PR: triton-lang/triton#8053

Upstream commit message:
```
> [AMD] Redesign stream pipeliner LDS layout selection logic (#8053)

> This commit adapts the LDS layout selection logic in Stream Pipeliner so
> that we pick a common swizzled shared memory layout with vecSize = max
> kWidth of all users.
```

***Do not remove the following line from this commit***
Reactor Cherry-pick Revision: 8b792c8
---

This diff was generated by running:
```
buck run fbcode//triton/tools/reactor:reactor -- cherrypick --num-commits 3
```

Reviewed By: stashuk-olek

Differential Revision: D92336368
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants