llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-25 20:36:06 +00:00

Author	SHA1	Message	Date
Daniil Kovalev	9178708c3b	[PAC][lld][AArch64][ELF] Support signed TLSDESC (#113817 ) Depends on #120010 Support `R_AARCH64_AUTH_TLSDESC_ADR_PAGE21`, `R_AARCH64_AUTH_TLSDESC_LD64_LO12` and `R_AARCH64_AUTH_TLSDESC_LD64_LO12` static relocations and `R_AARCH64_AUTH_TLSDESC` dynamic relocation. IE/LE optimization is not currently supported for AUTH TLSDESC.	2025-01-22 12:18:05 +03:00
Daniil Kovalev	1ef5b987a4	[PAC][lld][AArch64][ELF] Support signed GOT with tiny code model (#113816 ) Depends on #114525 Support `R_AARCH64_AUTH_GOT_ADR_PREL_LO21` and `R_AARCH64_AUTH_GOT_LD_PREL19` GOT-generating relocations. A corresponding `RE_AARCH64_AUTH_GOT_PC` member of `RelExpr` is added, which is an AUTH-specific variant of `R_GOT_PC`.	2024-12-18 09:41:54 +03:00
Daniil Kovalev	417d2d7ce6	[PAC][lld][AArch64][ELF] Support signed GOT (#113815 ) Depends on #113811 Support `R_AARCH64_AUTH_ADR_GOT_PAGE`, `R_AARCH64_AUTH_GOT_LO12_NC` and `R_AARCH64_AUTH_GOT_ADD_LO12_NC` GOT-generating relocations. For preemptible symbols, dynamic relocation `R_AARCH64_AUTH_GLOB_DAT` is emitted. Otherwise, we unconditionally emit `R_AARCH64_AUTH_RELATIVE` dynamic relocation since pointers in signed GOT needs to be signed during dynamic link time.	2024-12-17 10:23:01 +03:00
Fangrui Song	04996a28b7	[ELF] Rename target-specific RelExpr enumerators RelExpr enumerators are named `R_`, which can be confused with ELF relocation type names. Rename the target-specific ones to `RE_` to avoid confusion. For consistency, the target-independent ones can be renamed as well, but that's not urgent. The relocation processing mechanism with RelExpr has non-trivial overhead compared with mold's approach, and we might make more code into Arch/*.cpp files and decrease the enumerators. Pull Request: https://github.com/llvm/llvm-project/pull/118424	2024-12-03 09:17:17 -08:00
Daniil Kovalev	1c4f335ec2	[PAC][lld] Use braa instr in PAC PLT sequence with valid PAuth core info (#113945 ) Assume PAC instructions being supported with PAuth core info different from (0,0). The (0,0) value means that an ELF file is incompatible with PAuth - see https://github.com/ARM-software/abi-aa/blob/2024Q3/pauthabielf64/pauthabielf64.rst#core-information. With PAC non-hint instructions supported, `autia1716; br x17` can be replaced with `braa x17, x16; nop`, where `braa` is an authenticated branch instruction using IA key, discriminator from x16 and signed target address from x17.	2024-11-18 08:07:33 +03:00
Fangrui Song	c1a6defd9f	[ELF] Make RelType a struct type otherwise operator<<(const ELFSyncStream &s, RelType type) applies to non-reloc-type uint32_t, which can be confusing.	2024-11-16 20:26:34 -08:00
Fangrui Song	be5dad012e	[ELF] Replace internalLinkerError(getErrorLoc(ctx, buf) + ...) with InternalErr(ctx, buf) and simplify `+ toStr(ctx, x)` to `<< x`. The trailing '\n' << llvm::getBugReportMsg() is not very useful and therefore removed.	2024-11-16 13:07:17 -08:00
Fangrui Song	58a971f42f	[ELF] Replace contex-less toString(x) with toStr(ctx, x) so that we can remove the global `ctx` from toString implementations. Rename lld::toString (to lld:🧝:toStr) to simplify name lookup (we have many llvm::toString and another lld::toString(const llvm::opt::Arg &)).	2024-11-16 11:58:10 -08:00
Fangrui Song	d69cc05bcf	[ELF] Migrate away from global ctx	2024-11-14 22:30:29 -08:00
Fangrui Song	09c2c5e1e9	[ELF] Replace error(...) with ErrAlways or Err Most are migrated to ErrAlways mechanically. In the future we should change most to Err.	2024-11-06 22:04:52 -08:00
Daniil Kovalev	a4ace3de1b	[PAC][lld] Fix reloc against adrp imm in PAC PLT header (#113429 ) The PAC PLT header contains adrp instruction which immediate should be filled. In https://reviews.llvm.org/D62609, the adrp instruction address was calculated incorrectly. This patch resolves the issue. The test is already present in test/ELF/aarch64-feature-pac.s	2024-10-23 19:47:56 +03:00
Fangrui Song	861bd36bce	[ELF] Pass Ctx & to Symbol::getVA	2024-10-19 20:32:58 -07:00
Fangrui Song	2c5dd03f55	[ELF] Pass Ctx & to check*	2024-10-13 11:14:40 -07:00
Fangrui Song	002ca63b3f	[ELF] Pass Ctx & to (read\|write)(16\|64)	2024-10-13 10:47:18 -07:00
Fangrui Song	38dfcd9ac9	[ELF] Pass Ctx & to read32/write32	2024-10-13 10:37:47 -07:00
Fangrui Song	1c28f31133	[ELF] Pass Ctx &	2024-10-11 18:35:02 -07:00
Fangrui Song	4fadf41c2f	[ELF] Pass Ctx & to ARM/AArch64	2024-10-07 23:29:11 -07:00
Fangrui Song	e1a073c9d9	[ELF] Change Ctx::target to unique_ptr (#111260 ) also rename `TargetInfo *getXXXTargetInfo` to `void setXXXTargetInfo` and change it to set `ctx.target`. This ensures that when `ctx` becomes a local variable, two lld invocations will not reuse the function-local static variable. --- Reland after commit c35214c131c0bc7f54dc18ceb75c75cba89f58ee ([ELF] Initialize TargetInfo members).	2024-10-07 23:14:02 -07:00
Paul Kirth	2ca850111f	Revert "[ELF] Change Ctx::target to unique_ptr (#111260 )" (#111449 ) This patch seems to be breaking the windows build bots. https://lab.llvm.org/buildbot/#/builders/63/builds/1953 We also see this in Fuchsia's Linux CI: https://fxbug.dev/372010530 This reverts commit 4ec06b17435e32ece5e1aa2bc8a6d26dbf0bb312.	2024-10-07 15:43:01 -07:00
Fangrui Song	4ec06b1743	[ELF] Change Ctx::target to unique_ptr (#111260 ) also rename `TargetInfo *getXXXTargetInfo` to `void setXXXTargetInfo` and change it to set `ctx.target`. This ensures that when `ctx` becomes a local variable, two lld invocations will not reuse the function-local static variable.	2024-10-06 21:47:13 -07:00
Fangrui Song	cfd3289a1f	[ELF] Pass Ctx & to some free functions	2024-10-06 19:36:21 -07:00
Fangrui Song	acb2b1e779	[ELF] Pass Ctx & to Symbols	2024-10-06 16:59:04 -07:00
Fangrui Song	2b5cb1bf62	[ELF] getRelocTargetVA: pass Ctx and Relocation. NFC	2024-10-06 16:34:09 -07:00
Fangrui Song	6d03a69034	[ELF] Pass Ctx & to Arch/	2024-10-06 00:14:12 -07:00
Peter Smith	c4d9cd8b74	[LLD][ELF][AArch64] Add BTI Aware long branch thunks (#108989 ) When Branch Target Identification BTI is enabled all indirect branches must target a BTI instruction. A long branch thunk is a source of indirect branches. To date LLD has been assuming that the object producer is responsible for putting a BTI instruction at all places the linker might generate an indirect branch to. This is true for clang, but not for GCC. GCC will elide the BTI instruction when it can prove that there are no indirect branches from outside the translation unit(s). GNU ld was fixed to generate a landing pad stub (gnu ld speak for thunk) for the destination when a long range stub was needed [1]. This means that using GCC compiled objects with LLD may lead to LLD generating an indirect branch to a location without a BTI. The ABI [2] has also been clarified to say that it is a static linker's responsibility to generate a landing pad when the target does not have a BTI. This patch implements the same mechansim as GNU ld. When the output ELF file is setting the GNU_PROPERTY_AARCH64_FEATURE_1_BTI property, then we check the destination to see if it has a BTI instruction. If it does not we generate a landing pad consisting of: BTI c B <destination> The B <destination> can be elided if the thunk can be placed so that control flow drops through. For example: BTI c <destination>: This will be common when -ffunction-sections is used. The landing pad thunks are effectively alternative entry points for the function. Direct branches are unaffected but any linker generated indirect branch needs to use the alternative. We place these as close as possible to the destination section. There is some further optimization possible. Consider the case: .text fn1 ... fn2 ... If we need landing pad thunks for both fn1 and fn2 we could order them so that the thunk for fn1 immediately precedes fn1. This could save a single branch. However I didn't think that would be worth the additional complexity. [1] https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106671 [2] https://github.com/ARM-software/abi-aa/issues/196	2024-10-01 13:12:29 +01:00
Fangrui Song	c3e4998c0b	[ELF] Pass Ctx & to TargetInfo. NFC	2024-09-28 21:48:26 -07:00
Fangrui Song	b0fc36dfa4	[ELF] Remove unneeded getTargetInfo. NFC	2024-09-28 20:29:37 -07:00
Fangrui Song	1dd9a565ea	[ELF] Replace config-> with ctx.arg. in Arch/	2024-09-21 12:03:18 -07:00
Fangrui Song	e88b7ff016	[ELF] Move InStruct into Ctx. NFC Ctx was introduced in March 2022 as a more suitable place for such singletons. llvm/Support/thread.h includes <thread>, which transitively includes sstream in libc++ and uses ios_base::in, so we cannot use `#define in ctx.sec`. `symtab, config, ctx` are now the only variables using LLVM_LIBRARY_VISIBILITY.	2024-09-15 22:15:02 -07:00
Simon Tatham	daf208598b	[lld][AArch64] Fix getImplicitAddend in big-endian mode. (#107845 ) In AArch64, the endianness of instruction encodings is always little, whereas the endianness of data swaps between LE and BE modes. So getImplicitAddend must use the right one of read32() and read32le(), for data and code respectively. It was using read32() throughout, causing instructions to be read as big-endian in BE mode, getting the wrong addend. Fixed, and updated the existing test to check both endiannesses. The expected results for data must be byte-swapped, but the ones for code need no adjustment.	2024-09-10 12:38:32 +01:00
Fangrui Song	b4feb26606	[ELF] Move target to Ctx. NFC Ctx was introduced in March 2022 as a more suitable place for such singletons. Follow-up to driver (2022-10) and script (2024-08).	2024-08-21 23:53:36 -07:00
Simon Tatham	a6b204b827	[lld][AArch64] Fix handling of SHT_REL relocation addends. (#98291 ) Normally, AArch64 ELF objects use the SHT_RELA type of relocation section, with addends stored in each relocation. But some legacy AArch64 object producers still use SHT_REL in some situations, storing the addend in the initial value of the data item or instruction immediate field that the relocation will modify. LLD was mishandling relocations of this type in multiple ways. Firstly, many of the cases in the `getImplicitAddend` switch statement were apparently based on a misunderstanding. The relocation types that operate on instructions should be expecting to find an instruction of the appropriate type, and should extract its immediate field. But many of them were instead behaving as if they expected to find a raw 64-, 32- or 16-bit value, and wanted to extract the right range of bits. For example, the relocation for R_AARCH64_ADD_ABS_LO12_NC read a 16-bit word and extracted its bottom 12 bits, presumably on the thinking that the relocation writes the low 12 bits of the value it computes. But the input addend for SHT_REL purposes occupies the immediate field of an AArch64 ADD instruction, which meant it should have been reading a 32-bit AArch64 instruction encoding, and extracting bits 10-21 where the immediate field lives. Worse, the R_AARCH64_MOVW_UABS_G2 relocation was reading 64 bits from the input section, and since it's only relocating a 32-bit instruction, the second half of those bits would have been completely unrelated! Adding to that confusion, most of the values being read were first sign-extended, and _then_ had a range of bits extracted, which doesn't make much sense. They should have first extracted some bits from the instruction encoding, and then sign-extended that 12-, 19-, or 21-bit result (or whatever else) to a full 64-bit value. Secondly, after the relocated value was computed, in most cases it was being written into the target instruction field via a bitwise OR operation. This meant that if the instruction field didn't initially contain all zeroes, the wrong result would end up in it. That's not even a 100% reliable strategy for SHT_RELA, which in some situations is used for its repeatability (in the sense that applying the relocation twice should cause the second answer to overwrite the first, so you can relocate an image in advance to its most likely address, and then do it again at load time if that turns out not to be available). But for SHT_REL, when you expect nonzero immediate fields in normal use, it couldn't possibly work. You could see the effect of this in the existing test, which had a lot of FFFFFF in the expected output which there wasn't any plausible justification for. Finally, one relocation type was actually missing: there was no support for R_AARCH64_ADR_PREL_LO21 at all. So I've rewritten most of the cases in `getImplicitAddend`; replaced the bitwise ORs with overwrites; and replaced the previous test with a much more thorough one, obtained by writing an input assembly file with explicitly specified relocations on instructions that also have carefully selected immediate fields, and then doing some yaml2obj seddery to turn the RELA relocation section into a REL one.	2024-07-19 09:01:25 +01:00
Daniil Kovalev	a9c43b9a14	[lld][AArch64][ELF][PAC] Support `.relr.auth.dyn` section (#96496 ) This re-applies #87635 after the issue described in https://github.com/llvm/llvm-project/pull/87635#issuecomment-2155318065 is fixed in ebc123e0793a1cbcb69b4af1548e339e018ffff2. A corresponding test is also added. Original PR description below. Support `R_AARCH64_AUTH_RELATIVE` relocation compression as described in https://github.com/ARM-software/abi-aa/blob/main/pauthabielf64/pauthabielf64.rst#relocation-compression.	2024-06-29 20:58:27 +03:00
Daniil Kovalev	2e1788f8e2	Revert "[lld][AArch64][ELF][PAC] Support `.relr.auth.dyn` section" (#94843 ) Reverts llvm/llvm-project#87635 On some corner cases, lld generated an object file with an empty REL section with `sh_info` set to 0. This file triggers an lld error when used as its input. See https://github.com/llvm/llvm-project/pull/87635#issuecomment-2155318065 for details.	2024-06-08 09:33:11 +03:00
Daniil Kovalev	7acd2c0652	[lld][ELF][AArch64] Support `R_AARCH64_GOT_LD_PREL19` relocation (#89592 ) With tiny code model, the GOT slot contents can be loaded via `ldr x0, :got:sym` which corresponds to `R_AARCH64_GOT_LD_PREL19` static GOT-relative relocation. See https://github.com/ARM-software/abi-aa/blob/main/aaelf64/aaelf64.rst#static-aarch64-relocations	2024-05-31 14:57:58 +03:00
Daniil Kovalev	ca1f0d41b8	[lld][AArch64][ELF][PAC] Support `.relr.auth.dyn` section (#87635 ) Support `R_AARCH64_AUTH_RELATIVE` relocation compression as described in https://github.com/ARM-software/abi-aa/blob/main/pauthabielf64/pauthabielf64.rst#relocation-compression	2024-05-16 10:04:58 +03:00
Ramkumar Ramachandra	cd14e7132f	lld/AArch64: handle more relocation addends (#87328 ) The function getImplicitAddend() is incomplete, as it is possible to cook up object files with other relocation addends. Although using llvm-mc or the clang integrated assembler does not produce such object files, a proprietary assembler known as armasm can: https://developer.arm.com/documentation/101754/0622/armasm-Legacy-Assembler-Reference armasm is in a frozen state, but it is still actively used in a lot of legacy codebases as the directives, macros and operators are very different from the clang integrated assembler. This makes porting a lot of legacy code from armasm syntax impractical for a lot of projects. Some internal testing of projects using open-source clang and lld fell over at link time when legacy armasm objects were included in the link. The goal of this patch is to enable people with legacy armasm objects to be able to use lld as the linker. Sadly armasm uses SHT_REL format relocations for AArch64 rather than SHT_RELA, which causes lld to reject the objects. As a frozen project we know the small number of relocations that the assembler officially supports and won't include (outside the equivalent of the .reloc directive which I think we can rule out of scope as that is not commonly used). The benefit to lld is that it will ease migration from a proprietary to an open-source toolchain. The drawback is the implementation of a small number of SHT_REL relocations. Although this patch doesn't aim to comprehensively cover all possible relocation addends, it does extend lld to work with the relocation addends that armasm produces, using the canonical aaelf64 document as a reference: https://github.com/ARM-software/abi-aa/blob/main/aaelf64/aaelf64.rst	2024-04-11 09:36:56 +01:00
Daniil Kovalev	cca9115b1c	[lld][AArch64][ELF][PAC] Support AUTH relocations and AUTH ELF marking (#72714 ) This patch adds lld support for: - Dynamic R_AARCH64_AUTH_* relocations (without including RELR compressed AUTH relocations) as described here: https://github.com/ARM-software/abi-aa/blob/main/pauthabielf64/pauthabielf64.rst#auth-variant-dynamic-relocations - .note.AARCH64-PAUTH-ABI-tag section as defined here https://github.com/ARM-software/abi-aa/blob/main/pauthabielf64/pauthabielf64.rst#elf-marking Depends on #72713 and #85231 --------- Co-authored-by: Peter Collingbourne <peter@pcc.me.uk> Co-authored-by: Fangrui Song <i@maskray.me>	2024-04-04 12:38:09 +03:00
Fangrui Song	f6455606bb	[ELF] Move getSymbol/getRelocTargetSym from ObjFile<ELFT> to InputFile. NFC This removes lots of unneeded `template getFile<ELFT>()`.	2024-03-10 23:01:26 -07:00
PiJoules	04a906ec98	[llvm][lld] Support R_AARCH64_GOTPCREL32 (#72584 ) This is the follopw implementation to https://github.com/ARM-software/abi-aa/pull/223 that supports this relocation in llvm and lld.	2024-01-10 15:12:42 -08:00
Mitch Phillips	a831a21e4d	[lld] [MTE] Allow android note for static executables. (#77078 ) Florian pointed out that we're accidentally eliding the Android note for static executables, as it's guarded behind the "can have memtag globals" conditional. Of course, memtag globals are unsupported for static executables, but we should still allow static binaries to produce the Android note (as that's the only way they get MTE).	2024-01-08 11:22:38 +01:00
simpal01	3cde1d8000	[ELF] Handle relocations in synthetic .eh_frame with a non-zero offset within the output section (#65966 ) When the .eh_frame section is placed at a non-zero offset within its output section, the relocation value within .eh_frame are computed incorrectly. We had similar issue in .ARM.exidx section and it has been fixed already in https://reviews.llvm.org/D148033. While applying the relocation using S+A-P, the value of P (the location of the relocation) is getting wrong. P is: P = SecAddr + rel.offset, But SecAddr points to the starting address of the outputsection rather than the starting address of the eh frame section within that output section. This issue affects all targets which generates .eh_frame section. Hence fixing in all the corresponding targets it affecting.	2023-10-03 10:20:14 +01:00
Mitch Phillips	ca35a19aca	[lld] Synthesize metadata for MTE globals As per the ABI at https://github.com/ARM-software/abi-aa/blob/main/memtagabielf64/memtagabielf64.rst, this patch interprets the SHT_AARCH64_MEMTAG_GLOBALS_STATIC section, which contains R_NONE relocations to tagged globals, and emits a SHT_AARCH64_MEMTAG_GLOBALS_DYNAMIC section, with the correct DT_AARCH64_MEMTAG_GLOBALS and DT_AARCH64_MEMTAG_GLOBALSSZ dynamic entries. This section describes, in a uleb-encoded stream, global memory ranges that should be tagged with MTE. We are also out of bits to spare in the LLD Symbol class. As a result, I've reused the 'needsTocRestore' bit, which is a PPC64 only feature. Now, it's also used for 'isTagged' on AArch64. An entry in SHT_AARCH64_MEMTAG_GLOBALS_STATIC is practically a guarantee from an objfile that all references to the linked symbol are through the GOT, and meet correct alignment requirements. As a result, we go through all symbols and make sure that, for all symbols $SYM, all object files that reference $SYM also have a SHT_AARCH64_MEMTAG_GLOBALS_STATIC entry for $SYM. If this isn't the case, we demote the symbol to being untagged. Symbols that are imported from other DSOs should always be fine, as they're GOT-referenced (and thus the GOT entry either has the correct tag or not, depending on whether it's tagged in the defining DSO or not). Additionally hand-tested by building {libc, libm, libc++, libm, and libnetd} on Android with some experimental MTE globals support in the linker/libc. Reviewed By: MaskRay, peter.smith Differential Revision: https://reviews.llvm.org/D152921	2023-07-31 17:07:42 +02:00
Daniel Kiss	92fbb602f3	[lld][AArch64] Add BTI landing pad to PLT entries when the symbol is exported. With relative vtables the caller jumps directly to the plt entries in the shared object, therefore landing pad is need for these entries. Reproducer: main.cpp ``` #include "v.hpp" int main() { A* a = new B(); a->do_something2(); return 0; } ``` v.hpp ``` struct A { virtual void do_something() = 0; virtual void do_something2(); }; struct B : public A { void do_something() override; void do_something2() override; }; ``` v.cpp ``` #include "v.hpp" void A::do_something2() { } void B::do_something() { } void B::do_something2() { } ``` ``` CC="clang++ --target=aarch64-unknown-linux-gnu -fuse-ld=lld -mbranch-protection=bti" F=-fexperimental-relative-c++-abi-vtables ${=CC} $F -shared v.cpp -o v.so -z force-bti ${=CC} $F main.cpp -L./ v.so -Wl,-rpath=. -z force-bti qemu-aarch64-static -L /usr/aarch64-linux-gnu -cpu max ./a.out ``` For v.so, the regular vtable entry is relocated by an R_AARCH64_ABS64 relocation referencing _ZN1B13do_something2Ev. ``` _ZTV1B: .xword _ZN1B13do_something2Ev ``` Using relative vtable entry for a DSO has a downside of creating many PLT entries and making their addresses escape. The relative vtable entry references a PLT entry _ZN1B13do_something2Ev@plt. ``` .L_ZTV1A.local: .word (_ZN1A13do_something2Ev@PLT-.L_ZTV1A.local)-8 ``` fixes: #63580 Reviewed By: peter.smith, MaskRay Differential Revision: https://reviews.llvm.org/D153264	2023-06-29 22:17:17 +02:00
Daniel Kiss	60827df765	[lld][AArch64] Add BTI landing pad to PLT when it is accessed by a range extension thunk. Adding BTI to those PLT's which accessed with by a range extension thunk due to those preform an indirect call. Fixes: #62140 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D148704	2023-04-23 23:17:02 +02:00
Fangrui Song	c3c9e45312	[ELF] Add InputSectionBase::{addRelocs,relocs} and GotSection::addConstant to add/access relocations to prepare for changing `relocations` from a SmallVector to a pointer. Also change the `isec` parameter in `addAddendOnlyRelocIfNonPreemptible` to `GotSection &`.	2022-11-21 04:12:03 +00:00
Fangrui Song	821e04de11	[ELF] Restore AArch64Relaxer after 685b21255315e699aa839d93fe71b37d806c90c2 relocateAlloc may be parallel so we should avoid sharing AArch64 states.	2022-10-18 10:28:11 -07:00
Fangrui Song	685b212553	[ELF] Make relocateAlloc target specific. NFC The target-specific code (AArch64, PPC64) does not fit into the generic code and adds virtual function overhead. Move relocateAlloc into ELF/Arch/ instead. This removes many virtual functions (relaxTls*). In addition, this helps get rid of getRelocTargetVA dispatch and many RelExpr members in the future.	2022-10-17 11:01:11 -07:00
Fangrui Song	bd16ffb389	[ELF] Merge Symbol::needs* into uint16_t flags. NFC Split off from D133003 ([ELF] Parallelize relocation scanning) to make its diff smaller.	2022-09-09 14:37:18 -07:00
Fangrui Song	4c244b2833	[ELF] Fix .plt.got comments. NFC	2022-08-16 23:29:01 -07:00

1 2 3

121 Commits