Use pmaddubsw for non-RDom horizontal widening adds #7440

abadams · 2023-03-20T17:28:03Z

We were rewriting the following into a pmaddubsw:

RDom r(0, 2);
f(x) += cast<uint16_t>(g(2*x + r));
f.update().atomic().vectorize(r).vectorize(x, 8);

But not the equivalent unrolled version, which is more likely to be written:

f(x) = cast<uint16_t>(g(2*x)) + g(2*x + 1)

This pattern shows up in downsampling code.

This PR fixes it, and also adds support for doing horizontal widening adds of i16 to i32 using pmaddwd, and adds some missing avx512 variants of horizontal adds using pmaddubsw.

rootjalex · 2023-03-21T02:10:10Z

src/FindIntrinsics.cpp

+            // Rewrite combinations of deinterleaves into horizontal ops
+            rewrite(widening_add(slice(x, 0, 2, lanes), slice(x, 1, 2, lanes)),
+                    h_add(cast(op->type.with_lanes(lanes * 2), x), lanes)) ||
+            rewrite(widening_add(slice(x, 1, 2, lanes), slice(x, 0, 2, lanes)),
+                    h_add(cast(op->type.with_lanes(lanes * 2), x), lanes)) ||


I think it might be useful to have a mutator that attempts to perform this operation more generally, for other operations as well?

rootjalex

LGTM, left a small comment about a future possible codegen improvement but it's not necessary for this PR imo

…_for_downsample

Use pmaddubsw for non-RDom horizontal widening adds

153f49b

rootjalex reviewed Mar 21, 2023

View reviewed changes

rootjalex approved these changes Mar 21, 2023

View reviewed changes

Merge remote-tracking branch 'origin/main' into abadams/use_pmaddubsw…

eafc641

…_for_downsample

abadams merged commit 2a51f71 into main Mar 24, 2023

ardier pushed a commit to ardier/Halide-mutation that referenced this pull request Mar 3, 2024

Use pmaddubsw for non-RDom horizontal widening adds (halide#7440)

08a4226

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use pmaddubsw for non-RDom horizontal widening adds #7440

Use pmaddubsw for non-RDom horizontal widening adds #7440

Uh oh!

abadams commented Mar 20, 2023

Uh oh!

rootjalex Mar 21, 2023

Uh oh!

rootjalex left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Use pmaddubsw for non-RDom horizontal widening adds #7440

Use pmaddubsw for non-RDom horizontal widening adds #7440

Uh oh!

Conversation

abadams commented Mar 20, 2023

Uh oh!

rootjalex Mar 21, 2023

Choose a reason for hiding this comment

Uh oh!

rootjalex left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants