Skip to content

Conversation

@Xuanwo
Copy link
Member

@Xuanwo Xuanwo commented Oct 16, 2025

Which issue does this PR close?

Closes #6625

Rationale for this change

This PR adds TailCutLayer so users can use this layer to cancel long tail of requests like slower than P95.

What changes are included in this PR?

Are there any user-facing changes?


This PR was primarily authored with Amp using Claude-Sonnet-4.5 and then hand-reviewed by me. I AM responsible for every change made in this PR. I aimed to keep it aligned with our goals, though I may have missed minor issues. Please flag anything that feels off, I'll fix it quickly.

@Xuanwo Xuanwo requested a review from tisonkun October 16, 2025 12:45
Signed-off-by: Xuanwo <[email protected]>
@dosubot dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. releases-note/feat The PR implements a new feature or has a title that begins with "feat" labels Oct 16, 2025
Copy link
Member

@PsiACE PsiACE left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Interesting, LGTM

@dosubot dosubot bot added the lgtm This PR has been approved by a maintainer label Oct 17, 2025
@Xuanwo Xuanwo merged commit 3931832 into main Oct 17, 2025
717 of 718 checks passed
@Xuanwo Xuanwo deleted the auto-cancel branch October 17, 2025 10:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

lgtm This PR has been approved by a maintainer releases-note/feat The PR implements a new feature or has a title that begins with "feat" size:XL This PR changes 500-999 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

new feature: auto cancel and retry long tail request

2 participants