update Rollups process #775

matthiaskrgr · 2024-11-03T19:07:11Z

cc @compiler-errors @jieyouxu @GuillaumeGomez
Rendered

rustbot · 2024-11-03T19:07:15Z

Thanks for the pull request, and welcome! The Rust team is excited to review your changes, and you should hear from @Mark-Simulacrum (or someone else) some time within the next two weeks.

GuillaumeGomez · 2024-11-03T19:11:31Z

src/release/rollups.md


 ## Selecting Pull Requests

-The queue is sorted by rollup status. In general, a good rollup includes one or two `iffy` PRs (if available), a bunch of `maybe` (unmarked) PRs, and a large pile of `always` PRs. A rollup should never include `rollup=never` PRs.
+The queue is sorted by rollup status. In general, a good rollup contains a bunch of `maybe` (unmarked) PRs, and a large pile of `always` PRs. You can include one or two `iffy` PRs if you are confident that they will pass. 


I'm not sure including "iffy" PRs is a good idea in rollups. I personally tend to never includ them.

That's kinda the whole reason we have "iffy". If you never include them, then they may as well be marked "never".

Fair point.

I personally consider iffy PRs to be the whole point of rollups, tbh. Some PRs are clearly rollup=never if you are almost certain they will fail or if they have perf implications, but some of the iffy PRs are for the "it can fail, but we're not too sure" cases: it's great if it passes full CI, in which case we saved a bunch of time. It's also great if it fails full CI, in which case the failure time is amortized.

I adjusted this to say

You can include one or two iffy PRs to amortize the cost of testing full CI.

GuillaumeGomez

Apart from my comment, looks good to me. Thanks for the doc improvement!

src/release/rollups.md

compiler-errors · 2024-11-03T19:12:52Z

src/release/rollups.md


 ## Selecting Pull Requests

-The queue is sorted by rollup status. In general, a good rollup includes one or two `iffy` PRs (if available), a bunch of `maybe` (unmarked) PRs, and a large pile of `always` PRs. A rollup should never include `rollup=never` PRs.
+The queue is sorted by rollup status. In general, a good rollup contains a bunch of `maybe` (unmarked) PRs, and a large pile of `always` PRs. You can include one or two `iffy` PRs if you are confident that they will pass. 


That's kinda the whole reason we have "iffy". If you never include them, then they may as well be marked "never".

compiler-errors · 2024-11-03T19:30:37Z

src/release/rollups.md


-The actual absolute size of the rollup can depend based on experience, people new to making rollups might start with including 1 `iffy`, 4 `maybe`s, and 5 `always`s, but more experienced people might even make a rollup of 1-2 `iffy`s, 8 `maybe`s, and 10 `always`s! Massive rollups are rarely needed, but as your intuition grows you'll get better at judging risk when including PRs in a rollup.
+Don't try to make mega-rollups (15-20 PRs that merge half or more of the entire queue all at once) to keep the number of perf or bug regressions per merge as low as possible and keep potential regressions [bisectable].


Suggested change

Don't try to make mega-rollups (15-20 PRs that merge half or more of the entire queue all at once) to keep the number of perf or bug regressions per merge as low as possible and keep potential regressions [bisectable].

Limit the size of rollups, even if the queue is backed up -- large rollups run the risk of failing or merge conflicts, and smaller rollups keep potential regressions [bisectable]. On average, rollups are <N> PRs large, often varying from <N - M> to <N + M> depending on the number of `rollup=always` PRs that can be included.

choose some value for N and M.

note that thanks to unrolled builds, bisection can be done within a roll-up and cargo bisect rustc does that

note that thanks to unrolled builds, bisection can be done within a roll-up and cargo bisect rustc does that

But I think there is some time limit for rollup artifacts right? Something like 6 months?

There is¹ a time limit for all artifacts; for old regressions we only get nightly granularity AFAIR

Footnotes

or at least was, my source on this is only personal experience in bisection of old bugs, no idea where the current limits can be found out for certain ↩

Adopted this wording. I picked ~7 as the average size, with a usual bound of [5, 10] PRs.

Limit the size of rollups, even if the queue is backed up. Large rollups run the risk of failing
or merge conflicts, and smaller rollups keep potential regressions [bisectable]. On average, rollups
are 7 PRs large, often varying from 5 to 10 depending on the number of rollup=always PRs that can
be included.

tgross35 · 2024-11-03T23:37:16Z

src/release/rollups.md

    ```
    @bors r+ rollup=never p=5
    ````
-
+   where 5 is the number of PRs contained in your rollup.


I have been thinking that what the existing documentation says (rollups always p=5) might be better than p=pr_count that is usually done. There are a lot of cases where 7 PRs get rolled up with p=7 and sit in the queue for a number of hours, only to be leapfrogged by a p=8 rollup that contains much newer PRs. Seems to have a tendency to break FIFO.

Yes that's why I added the "Rollups should not overlap" rule.

we can just ping the contributors who are assigning rollup p>5 (I've done this a couple of times myself) and just point them to p=5 being the good default and having that respect queue order. It helps to elaborate on the reasoning for choice of p=5 and not arbitrary p=N where N is the number of PRs rolled up.

If we update the docs here and then still notice contributors assigning p > 5 to rollups, we can just ping them and point them to the update docs here.

(Thanks for the edits)

Just wanted to say, we should make these rollup advice as accessible and useful to onboard other contributors w/ r+ perms who may want to help to do rollups too.

(Somewhat unrelated, but also I feel like the p=??? advice is at times too vague to be useful, at least when I read some of the forge docs related to p=??? advice)

Yeah, I feel like the tool/subtree updates should receive same priority as rollups and all pinned at p=5 exactly, good point.

mmh imo it makes sense to prioritise subtree/module updates over rollups since doing a tool sync is often more complicated than rebasing PRs

Just, they should receive a consistent p between themselves, but otherwise it feels whatever to me.

Adjusted this to say sth like

Tools and subtree syncs should use p=5, like rollups, so they interleave between rollups, as tools and subtree syncs are typically more tricky to fix than rebasing PRs.

I'm not sure anymore if there is an easy way to generalize 😅

some subtrees are mostly self-contained and risk free to rollup, other (like clippy) are prone to creating merge conflicts and changes in tests, some, like cargo may also have test failures which the pre merge ci cannot catch.

some subtrees are always rollup=never'd by the author so they cannot be rolled up

jieyouxu · 2025-02-12T11:13:56Z

@matthiaskrgr sorry to also ping you here, do you intend to update this PR, or do you mind if I take over?

apiraino · 2025-06-20T09:15:02Z

I think it's fine by now if you @jieyouxu push this forward (when you have time, ofc). Let's not forget to co-author the patch.

@rustbot assign @jieyouxu

jieyouxu · 2025-06-22T03:05:25Z

r? @Mark-Simulacrum (release)

matthiaskrgr · 2025-06-25T20:54:44Z

ah I kinda forgot about this

I'll have another look this week.

@jieyouxu my general vibe is that I would rather avoid using --author/ committing something in someone elses name etc, and would rather have one commit by person A and another commit by person B that may or may not change/revert some content of the previous commit/ builds on top of it, just so that its clear what changed and who is the author of what. :)

edit: do you have a diff from mine to your changes? since you force pushed, the diff is all messed up :/

jieyouxu · 2025-06-26T00:26:07Z

edit: do you have a diff from mine to your changes? since you force pushed, the diff is all messed up :/

Sorry, I don't, I think I unintentionally squashed...

matthiaskrgr · 2025-06-26T05:29:33Z

Hm, I'm afraid something got messed up. when I try to reconstruct the diff between us, I can only see whitespace change, I also did not find the quote you mentioned here
#775 (comment)

edit: nevermind, I think I got it :D

compiler-errors · 2025-06-26T21:30:25Z

src/release/rollups.md

+Limit the size of rollups, *even if* the queue is backed up. Large rollups run the risk of failing
+or merge conflicts, and smaller rollups keep potential regressions [bisectable]. On average, rollups
+are 7 PRs large, often varying from 5 to 12 depending on the number of `rollup=always` PRs that can
+be included. Rollups of 15+ prs can be made for the sake of weeding out failures via try-jobs but should not be put into the merge queue as to not block other prs from being merged in the meantime


I object to this line you added. Please remove it.

Never have we ever used try jobs for this purpose, so I'm tempted to say you're just creating new procedures to justify closing other people's rollups.

if you wanna soften in the tone to encourage people to test rollups independently if they are at a high risk to merge, that's great, but you're asking for people to do extra work because of your personal preference of rollup size.

I have made (what I deem) "high-risk" rollups tested via try jobs in the past just to weed out failures and I found it pretty useful because then I don't unnecessarily block the queue.
What exactly are you concerned about?

Yeah, which is why I said:

if you wanna soften in the tone to encourage people to test rollups independently if they are at a high risk to merge, that's great, but you're asking for people to do extra work because of your personal preference of rollup size.

What I'm afraid of is you codifying justification to close other people's rollups because you think they're too large and pointing at this justification you added because of it.

Again, I think you can just discourage people from making large PRs unless they know what they're getting themselves into (and perhaps mention that it's usually unnecessary), and point them to try jobs if they really want to get things testing.

If you want to go the extra mile, it would be nice if you asked people who are working on bors2 for better automation to test "dist-like jobs" for this exact scenario, instead of just saying that the user needs to do it manually. Even better if we got CI to detect rollups and run a larger test suite on them so that they get weeded out before the next auto merge is done.

One more good idea if you really care about overlapping rollups not being made, at least not accidentally or without justification, is asking the bors2 folks if they could detect and mark PRs that are currently part of rollups that are currently mergeable + accepted. Then people could make disjoint rollups more easily; right now it's a pretty large ask to do. Paired with CI-level testing it would be very beneficial to weed out bad PRs and improve our merge rate in general.

compiler-errors · 2025-06-26T21:38:44Z

src/release/rollups.md

+   Tools and subtree syncs should use `p=5`, like rollups, so they interleave between rollups, as
+   tools and subtree syncs are typically more tricky to fix than rebasing PRs.
+   As the only exception, you shall chose a minimally higher priority (`x+1`) if a PR you include (such as a subtree sync) is already of `p=x`,
+   with the purpose of ensuring that the rollup is tested earlier than subtree sync it includes, as long as you don't skip over another rollup that is already waiting in queue.


This doesn't really make sense. I think there are plenty of cases where it's valid to make a rollup that's larger than the ones that are queued and give it higher priority.

I think the tone should be softened to urge the rollup maker to consider rollups that are already queued, but any hard suggestions here I'm afraid are just going to be used to justify closing people's rollups for no good reason.

To me it sounds a bit like you're saying "I want to be able invalidate other peoples rollups at any time" (by overwriting them with my larger rollup) but at the same time I "I don't want to have my own rollup closed for doing so".

In what situations do you deem it critical to override someones rollup?
Why not just wait it out until it is merged and then make another rollup of your own?

I think the rollup-oneupping has created a lot of unnecessary friction in the past, so its best to just avoid it altogether 🙂

To me it sounds a bit like you're saying "I want to be able invalidate other peoples rollups at any time" (by overwriting them with my larger rollup) but at the same time I "I don't want to have my own rollup closed for doing so".

First of all, claiming that this is "invalidating" other people's rollups or calling this "oneupping" makes it seem like this is a competition or race or something, which it definitely is not 😸

What I'm saying is that these should be guidelines for well-informed contributors to understand how to use rollups to help things get merged, not strict rules that must be followed to the tee. After all, I think it's very useful to give individuals discretion that is weighed against different things like what's queued and how busy the week is going and factors like that.

In what situations do you deem it critical to override someones rollup?

Well, I think it's a bit excessive to call it "critical", but there are (as of writing this comment I think) 23 rollup=always PRs in the queue right now. I think testing all of them in parallel with a couple of rollup=maybe PRs outweighs the benefit of testing a 8 PR rollup regardless of what it contains. That's what I mean by discretion above.

Of course there is always a risk that large rollups fail to merge, but I think it's unnecessary to have to wait for other smaller rollups to merge first just because they were created first. If they conflict with the larger rollup and that one does successfully merge, then great! We all win, since we're all just trying to get our PRs merged after all 😃 And if it doesn't, then we fall back to the smaller rollup.

I'm very much of the opinion that this is a non-problem because this is obviously not something that is happening all the time.

update Rollups process

11330a6

rustbot assigned Mark-Simulacrum Nov 3, 2024

rustbot added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Nov 3, 2024

GuillaumeGomez reviewed Nov 3, 2024

View reviewed changes

compiler-errors reviewed Nov 3, 2024

View reviewed changes

tgross35 reviewed Nov 3, 2024

View reviewed changes

rustbot assigned jieyouxu and unassigned Mark-Simulacrum Jun 20, 2025

rustbot assigned Mark-Simulacrum and unassigned jieyouxu Jun 22, 2025

jieyouxu force-pushed the uproll branch from dc98abc to ae43065 Compare June 22, 2025 03:09

changes by Jieyou Xu, now as a separate commit

a59d59c

matthiaskrgr force-pushed the uproll branch from ae43065 to a59d59c Compare June 26, 2025 05:37

compiler-errors requested changes Jun 26, 2025

View reviewed changes

add more lessons learned

5e59a5d

matthiaskrgr force-pushed the uproll branch from 368a9e0 to 5e59a5d Compare June 26, 2025 21:31

compiler-errors reviewed Jun 26, 2025

View reviewed changes


		The actual absolute size of the rollup can depend based on experience, people new to making rollups might start with including 1 `iffy`, 4 `maybe`s, and 5 `always`s, but more experienced people might even make a rollup of 1-2 `iffy`s, 8 `maybe`s, and 10 `always`s! Massive rollups are rarely needed, but as your intuition grows you'll get better at judging risk when including PRs in a rollup.
		Don't try to make mega-rollups (15-20 PRs that merge half or more of the entire queue all at once) to keep the number of perf or bug regressions per merge as low as possible and keep potential regressions [bisectable].

	Don't try to make mega-rollups (15-20 PRs that merge half or more of the entire queue all at once) to keep the number of perf or bug regressions per merge as low as possible and keep potential regressions [bisectable].
	Limit the size of rollups, even if the queue is backed up -- large rollups run the risk of failing or merge conflicts, and smaller rollups keep potential regressions [bisectable]. On average, rollups are <N> PRs large, often varying from <N - M> to <N + M> depending on the number of `rollup=always` PRs that can be included.

update Rollups process #775

Are you sure you want to change the base?

update Rollups process #775

Uh oh!

Conversation

matthiaskrgr commented Nov 3, 2024

Uh oh!

rustbot commented Nov 3, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jieyouxu Nov 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

GuillaumeGomez left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Noratrieb Nov 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

steffahn Feb 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Footnotes

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

matthiaskrgr Nov 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jieyouxu Nov 4, 2024 • edited by oli-obk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jieyouxu commented Feb 12, 2025

Uh oh!

apiraino commented Jun 20, 2025

Uh oh!

jieyouxu commented Jun 22, 2025

Uh oh!

matthiaskrgr commented Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jieyouxu commented Jun 26, 2025

jieyouxu Nov 4, 2024 •

edited

Loading

Noratrieb Nov 3, 2024 •

edited

Loading

steffahn Feb 17, 2025 •

edited

Loading

matthiaskrgr Nov 4, 2024 •

edited

Loading

jieyouxu Nov 4, 2024 •

edited by oli-obk

Loading

matthiaskrgr commented Jun 25, 2025 •

edited

Loading

matthiaskrgr commented Jun 26, 2025 •

edited

Loading

compiler-errors Jun 26, 2025 •

edited

Loading