Fix linker-plugin-lto only doing thin lto #136840

Flakebi · 2025-02-10T23:45:01Z

When rust provides LLVM bitcode files to lld and the bitcode contains function summaries as used for thin lto, lld defaults to using thin lto. This prevents some optimizations that are only applied for fat lto. I ran into this with the amdgpu backend and was able to create a test there. Unfortunately, I wasn’t able to recreate the same test on x86, the difference between thin and fat lto seems less pronounced there.

The important part of the change is setting the ThinLTO=0 module flag. The rest of the changes are fixing the <function>.kd symbol getting exported for amdhsa with linker-plugin-lto.

Tracking issue: #135024

r? @workingjubilee, as you’ve been reviewing most other amdgpu patches, not sure if there should be other reviewers for lto.

When rust provides LLVM bitcode files to lld and the bitcode contains function summaries as used for thin lto, lld defaults to using thin lto. This prevents some optimizations that are only applied for fat lto. I ran into this with the amdgpu backend and was able to create a test there. Unfortunately, I wasn’t able to recreate the same test on x86, the difference between thin and fat lto seems less pronounced there. The important part of the change is setting the `ThinLTO=0` module flag. The rest of the changes are fixing the `<function>.kd` symbol getting exported for amdhsa with linker-plugin-lto.

rustbot · 2025-02-10T23:45:04Z

Could not assign reviewer from: workingjubilee.
User(s) workingjubilee are either the PR author, already assigned, or on vacation. Please use r? to specify someone else to assign.

rustbot · 2025-02-10T23:45:10Z

r? @jieyouxu

rustbot has assigned @jieyouxu.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

rustbot · 2025-02-10T23:45:12Z

This PR modifies tests/run-make/. If this PR is trying to port a Makefile
run-make test to use rmake.rs, please update the
run-make port tracking issue
so we can track our progress. You can either modify the tracking issue
directly, or you can comment on the tracking issue and link this PR.

cc @jieyouxu

workingjubilee · 2025-02-11T00:09:30Z

I have barely any idea about LTO besides "it happens and it involves dlopening a compiler and shoving its serialized data back in it" tbh soo

jieyouxu · 2025-02-11T00:35:31Z

Unfortunately I have no clue either, so

r? compiler

Flakebi · 2025-02-11T12:59:05Z

For reference, the code that switches to thin lto when the flag is not set is here: https://github.com/llvm/llvm-project/blob/e258bca9505f35e0a22cb213a305eea9b76d11ea/llvm/lib/Bitcode/Writer/BitcodeWriter.cpp#L4441-L4446

  // By default we compile with ThinLTO if the module has a summary, but the
  // client can request full LTO with a module flag.
  bool IsThinLTO = true;
  if (auto *MD =
          mdconst::extract_or_null<ConstantInt>(M.getModuleFlag("ThinLTO")))
    IsThinLTO = MD->getZExtValue();

The code in clang that sets the flag, which is replicated here for Rust is here: https://github.com/llvm/llvm-project/blob/560149b5e3c891c64899e9912e29467a69dc3a4c/clang/lib/CodeGen/BackendUtil.cpp#L1150

        if (!TheModule->getModuleFlag("ThinLTO") && !CodeGenOpts.UnifiedLTO)
          TheModule->addModuleFlag(llvm::Module::Error, "ThinLTO", uint32_t(0));

bjorn3 · 2025-02-11T17:39:59Z

compiler/rustc_codegen_llvm/src/context.rs

+    // Disable ThinLTO if fat lto is requested. Otherwise lld defaults to thin lto.
+    if sess.lto() == config::Lto::Fat {
+        llvm::add_module_flag_u32(llmod, llvm::ModuleFlagMergeBehavior::Override, "ThinLTO", 0);
+    }


What if a dependency is built with lto=true (aka lto=fat), but then the user wants to use thinLTO? I'm pretty sure the standard library is built with lto=true for example, but that shouldn't prevent thinLTO from ever working.

Good question, it seems to change somewhat, but still work in general. I added a test for this.
What changes: Without this change, the test passes when
lib is compiled with O0 and main with O3 and

lib uses lto=thin and main uses lto=thin

lib uses lto=thin and main uses lto=fat

lib uses lto=fat and main uses lto=thin

lib uses lto=fat and main uses lto=fat

With this change, all of these keep passing except for case 3 (lib uses lto=fat and main uses lto=thin).
When lib is compiled with O1, O2 or O3, case 3 passes as well.
I assume this is the important case, as the standard library is compiled with optimizations.
(And lto with O0 is kinda questionable, except maybe for nvptx and amdgpu, but they require lto=fat anyway.)

bjorn3 · 2025-02-11T17:40:35Z

compiler/rustc_codegen_ssa/src/back/link.rs

+        let dylib_pgo = crate_type == CrateType::Dylib || sess.opts.cg.profile_generate.enabled();
+        // When compiling for amdhsa, every kernel function generates a <function name>.kd symbol.
+        // This symbol gets removed again when using linker-plugin-lto. Disable gc_sections to keep
+        // the symbol.


Maybe the version script should list those symbols instead?

The symbol is in the version script with #135909. I’m not 100% sure why it doesn’t work with linker-plugin-lto (putting it in the version script works with normal lto). My guess is that it’s the same problem as with the defined symbols, lld doesn’t see the symbol defined at the start, because the backend hasn’t created it yet.

I looked into lld to be sure and it seems to be indeed the case that the <kernel>.kd symbol is removed with --gc-sections because it is not defined at the beginning when doing full linker lto.

For future reference, this is what happens in more details:

SymbolTable::scanVersionScript looks at entries in the linker script and calls the assignExact lambda. It ignores symbols that are not defined (unless undefined symbols are forbidden, then it aborts).

elf::parseVersionAndComputeIsPreemptible sets isExported of existing symbols to true.

Now bitcode inputs get compiled, this generates <kernel>.kd symbols for amdhsa.

MarkLive<ELFT>::run checks isExported and then marks these symbols as isLive.

demoteSymbolsAndComputeIsPreemptible demotes symbols that are not marked as isLive. This makes the symbol undefined.

Later on, in Add symbols to symtabs, undefined symbols are skipped

Not passing --gc-sections changes elf::markLive (with gc, this calls MarkLive<ELFT>::run) to mark all sections as needed and this skips further live checks.

It seems to be this issue, I’ll mention it in the comment: llvm/llvm-project#119479

rustbot assigned jieyouxu Feb 10, 2025

rustbot added A-run-make Area: port run-make Makefiles to rmake.rs S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Feb 10, 2025

Flakebi mentioned this pull request Feb 10, 2025

Tracking Issue for amdgpu target #135024

Open

16 tasks

rustbot assigned fee1-dead and unassigned jieyouxu Feb 11, 2025

bjorn3 reviewed Feb 11, 2025

View reviewed changes

Flakebi added 2 commits February 12, 2025 23:50

Add testcase for fat then thin lto

ef534d6

Add link to lld gc-sections issue

c04d9f1

This comment has been minimized.

Sign in to view

Fix formatting

0d63f96

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix linker-plugin-lto only doing thin lto #136840

Fix linker-plugin-lto only doing thin lto #136840

Flakebi commented Feb 10, 2025 •

edited

Loading

rustbot commented Feb 10, 2025

rustbot commented Feb 10, 2025

rustbot commented Feb 10, 2025

workingjubilee commented Feb 11, 2025

jieyouxu commented Feb 11, 2025

Flakebi commented Feb 11, 2025

bjorn3 Feb 11, 2025

Flakebi Feb 12, 2025

bjorn3 Feb 11, 2025

Flakebi Feb 11, 2025

Flakebi Feb 12, 2025

Flakebi Feb 12, 2025

This comment has been minimized.

Fix linker-plugin-lto only doing thin lto #136840

Are you sure you want to change the base?

Fix linker-plugin-lto only doing thin lto #136840

Conversation

Flakebi commented Feb 10, 2025 • edited Loading

rustbot commented Feb 10, 2025

rustbot commented Feb 10, 2025

rustbot commented Feb 10, 2025

workingjubilee commented Feb 11, 2025

jieyouxu commented Feb 11, 2025

Flakebi commented Feb 11, 2025

bjorn3 Feb 11, 2025

Choose a reason for hiding this comment

Flakebi Feb 12, 2025

Choose a reason for hiding this comment

bjorn3 Feb 11, 2025

Choose a reason for hiding this comment

Flakebi Feb 11, 2025

Choose a reason for hiding this comment

Flakebi Feb 12, 2025

Choose a reason for hiding this comment

Flakebi Feb 12, 2025

Choose a reason for hiding this comment

This comment has been minimized.

Flakebi commented Feb 10, 2025 •

edited

Loading