Skip to content

Conversation

@AlexAUT
Copy link
Contributor

@AlexAUT AlexAUT commented Nov 3, 2025

Adds async_copy.global_to_local to Gluon for gfx1250 and forwards commit_group and wait_group from CDNA4. Note that we do not use global_load_to_shared from CDNA4 because we have relaxed constraints on memory layouts and completion order in regards to loads to registers.

@AlexAUT AlexAUT marked this pull request as ready for review November 3, 2025 16:28
@AlexAUT AlexAUT requested a review from peterbell10 as a code owner November 3, 2025 16:28
@antiagainst antiagainst merged commit 9ec56a2 into triton-lang:main Nov 3, 2025
9 checks passed
tmoreau89 pushed a commit to tmoreau89/triton that referenced this pull request Dec 1, 2025
)

Adds `async_copy.global_to_local` to Gluon for `gfx1250` and forwards
`commit_group` and `wait_group` from `CDNA4`. Note that we do not use
`global_load_to_shared` from `CDNA4` because we have relaxed constraints
on memory layouts and completion order in regards to loads to registers.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants