Support #[global_allocator] without the allocator shim by bjorn3 · Pull Request #86844 · rust-lang/rust

bjorn3 · 2021-07-03T16:07:54Z

This makes it possible to use liballoc/libstd in combination with --emit obj if you use #[global_allocator]. This is what rust-for-linux uses right now and systemd may use in the future. Currently they have to depend on the exact implementation of the allocator shim to create one themself as --emit obj doesn't create an allocator shim.

Note that currently the allocator shim also defines the oom error handler, which is normally required too. Once #![feature(default_alloc_error_handler)] becomes the only option, this can be avoided. In addition when using only fallible allocator methods and either --cfg no_global_oom_handling for liballoc (like rust-for-linux) or --gc-sections no references to the oom error handler will exist.

To avoid this feature being insta-stable, you will have to define __rust_no_alloc_shim_is_unstable to avoid linker errors.

(Labeling this with both T-compiler and T-lang as it originally involved both an implementation detail and had an insta-stable user facing change. As noted above, the __rust_no_alloc_shim_is_unstable symbol requirement should prevent unintended dependence on this unstable feature.)

rust-highfive · 2021-07-03T16:07:56Z

Some changes occured to rustc_codegen_cranelift

cc @bjorn3

rust-highfive · 2021-07-03T16:07:57Z

r? @jackh726

(rust-highfive has picked a reviewer for you, use r? to override)

jyn514 · 2021-07-03T17:10:17Z

Hmm, r? @scottmcm maybe?

compiler/rustc_codegen_cranelift/src/allocator.rs

bors · 2021-08-06T21:39:16Z

☔ The latest upstream changes (presumably #87822) made this pull request unmergeable. Please resolve the merge conflicts.

bors · 2021-08-07T23:40:20Z

☔ The latest upstream changes (presumably #87743) made this pull request unmergeable. Please resolve the merge conflicts.

nbdd0121 · 2021-08-08T15:33:01Z

It took me a few while to understand this, so essentially instead of generating __rust_alloc that forwards to __rg_alloc or __rdl_alloc based on allocator kind, this PR makes #[global_allocator] generate __rust_alloc directly, while in case global allocator is absent, generate a shim that forwards __rust_alloc to __rdl_alloc.

Would it make sense to (in addition to this PR), just generate a

#[global_allocator]
static GLOBAL: System = System;

at somewhere in higher level when global allocator is absent, instead of having the logic duplicated in codegen? This would allow __rdl_alloc etc to be removed completely (and is more consistent with what document describes in https://doc.rust-lang.org/std/alloc/struct.System.html).

bjorn3 · 2021-08-08T15:37:18Z

Would it make sense to just generate a
#[global_allocator]
static GLOBAL: System = System;
at somewhere in higher level instead of having the logic duplicated in codegen? This would allow __rdl_alloc etc to be removed completely (and is more consistent with what document describes in https://doc.rust-lang.org/std/alloc/struct.System.html).

You can use say --crate-type cdylib --crate-type lib, in which case the cdylib would need the allocator shim, but the lib must not get the allocator shim. Both use the same object files, except for the allocator shim object file that only the bin gets.

compiler/rustc_builtin_macros/src/global_allocator.rs

wesleywiser · 2021-08-26T23:33:21Z

I posted a message in the wg-allocators Zulip stream to see if anyone there has opinions on this PR before it's merged.

Co-authored-by: Ralf Jung <post@ralfj.de>

I can't figure out how to link with the MSVC toolchain

bjorn3 · 2023-05-11T14:48:47Z

Rebased and just like with #106560 I disabled the test on MSVC for now.

pnkfelix · 2023-05-25T14:17:44Z

@bors r+

bors · 2023-05-25T14:17:46Z

📌 Commit 33d9b58 has been approved by pnkfelix

It is now in the queue for this repository.

bors · 2023-05-25T17:00:01Z

⌛ Testing commit 33d9b58 with merge a2b1646...

bors · 2023-05-25T19:33:28Z

☀️ Test successful - checks-actions
Approved by: pnkfelix
Pushing a2b1646 to master...

rust-timer · 2023-05-25T21:20:14Z

Finished benchmarking commit (a2b1646): comparison URL.

Overall result: ❌ regressions - ACTION NEEDED

Next Steps: If you can justify the regressions found in this perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please open an issue or create a new PR that fixes the regressions, add a comment linking to the newly created issue or PR, and then add the perf-regression-triaged label to this PR.

@rustbot label: +perf-regression
cc @rust-lang/wg-compiler-performance

Instruction count

This is a highly reliable metric that was used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	0.2%	[0.2%, 0.2%]	1
Regressions ❌ (secondary)	0.5%	[0.3%, 0.7%]	10
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	0.2%	[0.2%, 0.2%]	1

Max RSS (memory usage)

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	3.4%	[3.4%, 3.4%]	1
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-1.3%	[-1.3%, -1.3%]	1
All ❌✅ (primary)	-	-	0

Cycles

This benchmark run did not return any relevant results for this metric.

Binary size

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	0.1%	[0.0%, 0.4%]	44
Regressions ❌ (secondary)	0.2%	[0.0%, 0.4%]	25
Improvements ✅ (primary)	-0.1%	[-0.1%, -0.1%]	2
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	0.1%	[-0.1%, 0.4%]	46

Bootstrap: 647.061s -> 648.599s (0.24%)

bjorn3 · 2023-05-25T21:28:14Z

library/alloc/src/alloc.rs

+        // Make sure we don't accidentally allow omitting the allocator shim in
+        // stable code until it is actually stabilized.
+        #[cfg(not(bootstrap))]
+        core::ptr::read_volatile(&__rust_no_alloc_shim_is_unstable);


I did expect the perf regression to come from this change. It is a single extra instruction on the allocation path to ensure __rust_no_alloc_shim_is_unstable must be defined if no allocator shim is linked in. The only way I can think of that guarantees that it isn't possible to link without defining __rust_no_alloc_shim_is_unstable would be to put __rust_alloc and an item referencing __rust_no_alloc_shim_is_unstable in the same COMDAT group, but this isn't possible for all object file formats and rust doesn't have a way to do this without global_asm!().

It's unfortunate that this symbol is now present in every single allocation path, especially where it enlarges the output binary for platforms like Wasm.

Is it instead possible to introduce a new function __rust_no_alloc_shim_is_unstable that would simply forward to __rust_alloc?

That way you still get the desired linker error if it's not declared, but it won't take any more space and won't cause as much of a performance hit.

That would prevent LLVM from optimizing allocations away I think as __rust_no_alloc_shim_is_unstable is not considered to be an allocator function by LLVM, while __rust_alloc is. That said, I just opened a draft PR which will enable doing away with the allocator shim entirely in the future, after which I hope it will be much easier to request stabilization of support for not using the allocator shim and thus remove this symbol entirely.

pnkfelix · 2023-05-30T20:14:27Z

@bjorn3 when you say:

I did expect the perf regression to come from this change.

are you referring to the increase in binary object file size?

bjorn3 · 2023-05-30T20:17:51Z

I'm referring to the instruction count regression. I hadn't noticed the binary object file size regression.

pnkfelix · 2023-05-30T20:18:40Z

The 0.2% hit to primary benchmark serde_derive check-incr_unchanged is easily justified by the feature addition here.
The more interesting question is 44 primary benchmarks saw a regression to their binary size. However, the only one of those of note, in my opinion, is ripgrep, which suffered a 0.43% increase to binary size on various opt scenarios.
marking as triaged.

@rustbot label: perf-regression-triaged

bjorn3 added A-linkage Area: linking into static, shared libraries and binaries A-allocators Area: Custom and system allocators T-lang Relevant to the language team T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Jul 3, 2021

rust-highfive assigned jackh726 Jul 3, 2021

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Jul 3, 2021

This comment has been minimized.

Sign in to view

bjorn3 force-pushed the global_alloc_improvements branch from ba27e9f to 51f054c Compare July 3, 2021 16:11

This comment has been minimized.

Sign in to view

bjorn3 force-pushed the global_alloc_improvements branch from e4a996a to b2b1a59 Compare July 3, 2021 16:40

rust-highfive assigned scottmcm and unassigned jackh726 Jul 3, 2021

bjorn3 marked this pull request as ready for review July 3, 2021 17:16

bjorn3 mentioned this pull request Jul 3, 2021

Implement "default_alloc_error_handler" feature rust-lang/rustc_codegen_cranelift#1182

Closed

ojeda mentioned this pull request Jul 4, 2021

Rust wanted features Rust-for-Linux/linux#354

Open

55 tasks

RalfJung reviewed Jul 6, 2021

View reviewed changes

compiler/rustc_codegen_cranelift/src/allocator.rs Outdated Show resolved Hide resolved

bjorn3 mentioned this pull request Jul 12, 2021

Start working on proof of concept for exposing Backtrace in core #77384

Closed

camelid added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Jul 23, 2021

bjorn3 force-pushed the global_alloc_improvements branch from 355b470 to 4b56c58 Compare August 7, 2021 09:38

bjorn3 force-pushed the global_alloc_improvements branch from 4b56c58 to 74b52ae Compare August 8, 2021 09:47

nbdd0121 reviewed Aug 9, 2021

View reviewed changes

compiler/rustc_builtin_macros/src/global_allocator.rs Outdated Show resolved Hide resolved

bjorn3 and others added 8 commits May 11, 2023 14:35

Add test

8ea28a4

Fix allocator shim handling in miri

9506011

Fix fs miri test on AArch64

efb9c30

Improve miri comments

568deb7

Fix review comments

ffd8cb8

Co-authored-by: Ralf Jung <post@ralfj.de>

Fix test

34f6a83

Fix no-alloc-shim test on MSVC

8ace03e

Ignore test on MSVC for now

3082865

I can't figure out how to link with the MSVC toolchain

This comment has been minimized.

Sign in to view

Bless miri tests

33d9b58

bjorn3 mentioned this pull request May 25, 2023

Investigate why no GlobalAlloc-related symbols are generated Rust-for-Linux/linux#68

Closed

bjorn3 commented May 25, 2023

View reviewed changes

ojeda mentioned this pull request Jul 25, 2023

Rust unstable features needed for the kernel Rust-for-Linux/linux#2

Open

48 tasks

lqd mentioned this pull request Aug 5, 2023

Regression in global_allocator when using prefer-dynamic on 1.71.0 and above #114518

Open

bjorn3 mentioned this pull request Aug 8, 2023

do not add noalias in return position #106371

Merged

Mark-Simulacrum mentioned this pull request Mar 24, 2024

Tracking Issue for __rust_no_alloc_shim_is_unstable #123015

Open

3 tasks

saethlin mentioned this pull request Sep 18, 2024

read_volatile __rust_no_alloc_shim_is_unstable in alloc_zeroed #130497

Merged

bjorn3 mentioned this pull request Dec 20, 2024

[WIP] Use weak linkage instead of compiler generated shims #134522

Draft

anforowicz mentioned this pull request Apr 2, 2025

-Zemit-code-for-final-artifact-to-link (officially supported __rust_alloc replacement) rust-lang/compiler-team#858

Closed

3 tasks

chbaker0 mentioned this pull request Apr 2, 2025

__rust_alloc can no longer be used to provide a custom allocator #139265

Closed

ojeda mentioned this pull request Feb 8, 2026

Rust unstable features needed for the kernel — Done Rust-for-Linux/linux#1223

Open

60 tasks

Uh oh!

Conversation

bjorn3 commented Jul 3, 2021 • edited by pnkfelix Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rust-highfive commented Jul 3, 2021

Uh oh!

rust-highfive commented Jul 3, 2021

Uh oh!

This comment has been minimized.

This comment has been minimized.

jyn514 commented Jul 3, 2021

Uh oh!

Uh oh!

bors commented Aug 6, 2021

Uh oh!

bors commented Aug 7, 2021

Uh oh!

nbdd0121 commented Aug 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bjorn3 commented Aug 8, 2021

Uh oh!

Uh oh!

wesleywiser commented Aug 26, 2021

Uh oh!

bjorn3 commented May 11, 2023

Uh oh!

This comment has been minimized.

pnkfelix commented May 25, 2023

Uh oh!

bors commented May 25, 2023

Uh oh!

bors commented May 25, 2023

Uh oh!

bors commented May 25, 2023

Uh oh!

rust-timer commented May 25, 2023

Overall result: ❌ regressions - ACTION NEEDED

Instruction count

Max RSS (memory usage)

Cycles

Binary size

Uh oh!

bjorn3 May 25, 2023

Choose a reason for hiding this comment

Uh oh!

RReverser Nov 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bjorn3 Dec 19, 2024

Choose a reason for hiding this comment

Uh oh!

pnkfelix commented May 30, 2023

Uh oh!

bjorn3 commented May 30, 2023

Uh oh!

pnkfelix commented May 30, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

bjorn3 commented Jul 3, 2021 •

edited by pnkfelix

Loading

nbdd0121 commented Aug 8, 2021 •

edited

Loading

RReverser Nov 7, 2024 •

edited

Loading