Skip to content

Conversation

@avamingli
Copy link
Contributor

For a parallel join, we may benefit if gather SegmentGeneralWorkers to SingleQE.

Gather(SegmentGeneralWorkers) join SingleQE, return join locus: SingleQE. We may win if we are a parallel-aware join, SingleQE is on the inner side that means there is a chance to generate a parallel join under SingleQE. In this case, we have both side parallel and may benefit. See ex 5_P_2_2 in cbdb_parallel.sql
If not parallel-aware, we are not sure for the benefit and a simgle test shows lower performance, ex: parallel scan on replicated table and join with SingleQE which is a non-parallel plan.

SingleQE join Gather(SegmentGeneralWorkers), return join locus: SingleQE. We may win if gather to SingleQE no matter what parallel-aware is. SingleQE is outer side, there could be a parallel plan under it. So we may benefit even without a shared hash table. Let the planner decide.
See ex 2_P_5_2 in cbdb_parallel.sql

The final locus may be elided to Entry if possible.

Authored-by: Zhang Mingli [email protected]

fix #ISSUE_Number


Change logs

Describe your change clearly, including what problem is being solved or what feature is being added.

If it has some breaking backward or forward compatibility, please clary.

Why are the changes needed?

Describe why the changes are necessary.

Does this PR introduce any user-facing change?

If yes, please clarify the previous behavior and the change this PR proposes.

How was this patch tested?

Please detail how the changes were tested, including manual tests and any relevant unit or integration tests.

Contributor's Checklist

Here are some reminders and checklists before/when submitting your pull request, please check them:

  • Make sure your Pull Request has a clear title and commit message. You can take git-commit template as a reference.
  • Sign the Contributor License Agreement as prompted for your first-time contribution(One-time setup).
  • Learn the coding contribution guide, including our code conventions, workflow and more.
  • List your communication in the GitHub Issues or Discussions (if has or needed).
  • Document changes.
  • Add tests for the change
  • Pass make installcheck
  • Pass make -C src/test installcheck-cbdb-parallel
  • Feel free to request cloudberrydb/dev team for review and approval when your PR is ready🥳

@avamingli avamingli self-assigned this Dec 11, 2023
@avamingli avamingli force-pushed the enable_single_join_segment_general_workers branch from f8081e1 to 429c978 Compare May 8, 2024 02:46
Copy link
Contributor

@gfphoenix78 gfphoenix78 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

For a parallel join, we may benefit if gather SegmentGeneralWorkers
to SingleQE.

Gather(SegmentGeneralWorkers) join SingleQE, return join locus: SingleQE.
We may win if we are a parallel-aware join, SingleQE is on the inner side
that means there is a chance to generate a parallel join under SingleQE.
In this case, we have both side parallel and may benefit.
See ex 5_P_2_2 in cbdb_parallel.sql
If not parallel-aware, we are not sure for the benefit and a simgle test
shows lower performance, ex: parallel scan on replicated table and join
with SingleQE which is a non-parallel plan.

SingleQE join Gather(SegmentGeneralWorkers), return join locus: SingleQE.
We may win if gather to SingleQE no matter what parallel-aware is.
SingleQE is outer side, there could be a parallel plan under it.
So we may benefit even without a shared hash table.
Let the planner decide.
See ex 2_P_5_2 in cbdb_parallel.sql

The final locus may be elided to Entry if possible.

Authored-by: Zhang Mingli [email protected]
@avamingli avamingli force-pushed the enable_single_join_segment_general_workers branch from 429c978 to b11cb71 Compare August 6, 2024 13:55
@avamingli
Copy link
Contributor Author

+1

Thanks for review!

@avamingli
Copy link
Contributor Author

Pushed, thanks.

@avamingli avamingli merged commit e0714d1 into apache:main Aug 7, 2024
@avamingli avamingli deleted the enable_single_join_segment_general_workers branch August 7, 2024 07:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants