llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-26 17:36:05 +00:00

Author	SHA1	Message	Date
Simi Pallipurath	f146763e07	Revert "Revert "[lld][Arm] Big Endian - Byte invariant support."" This reverts commit d8851384c6ac2a1cea15e05228dbde5f13654e23. Reason: Applied the fix for the Asan buildbot failures.	2023-06-22 16:10:18 +01:00
Simi Pallipurath	d8851384c6	Revert "[lld][Arm] Big Endian - Byte invariant support." This reverts commit 8cf8956897ce9bca3176c6339077b1ca17b27abc.	2023-06-20 17:27:44 +01:00
Simi Pallipurath	8cf8956897	[lld][Arm] Big Endian - Byte invariant support. Arm has BE8 big endian configuration called a byte-invariant(every byte has the same address on little and big-endian systems). When in BE8 mode: 1. Instructions are big-endian in relocatable objects but little-endian in executables and shared objects. 2. Data is big-endian. 3. The data encoding of the ELF file is ELFDATA2MSB. To support BE8 without an ABI break for relocatable objects,the linker takes on the responsibility of changing the endianness of instructions. At a high level the only difference between BE32 and BE8 in the linker is that for BE8: 1. The linker sets the flag EF_ARM_BE8 in the ELF header. 2. The linker endian reverses the instructions, but not data. This patch adds BE8 big endian support for Arm. To endian reverse the instructions we'll need access to the mapping symbols. Code sections can contain a mix of Arm, Thumb and literal data. We need to endian reverse Arm instructions as words, Thumb instructions as half-words and ignore literal data.The only way to find these transitions precisely is by using mapping symbols. The instruction reversal will need to take place after relocation. For Arm BE8 code sections (Section has SHF_EXECINSTR flag ) we inserted a step after relocation to endian reverse the instructions. The implementation strategy i have used here is to write all sections BE32 including SyntheticSections then endian reverse all code in InputSections via mapping symbols. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D150870	2023-06-20 14:08:21 +01:00
Stefan Pintilie	658f23fc46	[LLD] Emit DT_PPC64_OPT into the dynamic section As per section 4.2.2 of the PowerPC ELFv2 ABI, this value tells the dynamic linker which optimizations it is allowed to do. Specifically, the higher order bit of the two tells the dynamic linker that there may be multiple TOC pointers in the binary. When we resolve any NOTOC relocations during linking, we need to set this value because we may be calling TOC functions from NOTOC functions when the NOTOC function already clobbered the TOC pointer. In practice, this ensures that the PLT resolver always resolves the call to the GEP (global entry point) of the TOC function (which will set up the TOC for the TOC function). Original patch by nemanjai Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D150631	2023-06-05 12:18:29 -04:00
Ben Shi	229fcad7fc	[lld][ELF] Support relocations R_AVR_LO8_LDI_GS/R_AVR_HI8_LDI_GS Relocations R_AVR_LO8_LDI_GS/R_AVR_HI8_LDI_GS (indirect calls via function pointers) only cover range 128KiB. They are equivalent to R_AVR_LO8_LDI_PM/R_AVR_HI8_LDI_PM within this range. But for function addresses beyond this range, GNU-ld emits trampolines. And this patch implements corresponding thunks for them in lld. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D147364	2023-04-28 11:42:06 +08:00
Peter Smith	d0cdc5ddd7	[LLD][ELF][AArch64] Add AArch64 short range thunk support The AArch64 branch immediate instruction has a 128MiB range. This makes it suitable for use a short range thunk in the same way as short thunks are implemented in Arm and PPC. This patch adds support for short range thunks to AArch64. Adding short range thunk support should mean that OutputSections can grow to nearly 256 MiB in size without needing long-range indirect branches. Differential Revision: https://reviews.llvm.org/D148701	2023-04-24 13:48:22 +01:00
Daniel Kiss	60827df765	[lld][AArch64] Add BTI landing pad to PLT when it is accessed by a range extension thunk. Adding BTI to those PLT's which accessed with by a range extension thunk due to those preform an indirect call. Fixes: #62140 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D148704	2023-04-23 23:17:02 +02:00
Ben Shi	f1f6ca582e	[lld][ELF][NFC] Simplify method "Thunk *elf::addThunk()" Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D147124	2023-03-30 11:35:07 +08:00
Simi Pallipurath	2f68ddc604	[lld][ARM][2/3]Big Endian support - Word invariant support Changes: - Adding BE32 big endian Support for Arm. - Replace the writele and readle with their endian-aware versions. - Adding test cases for the big-endian be32 arm configuration. Patch by: Milosz Plichta. This patch merges all the changes from this patch https://reviews.llvm.org/D140203 as well. Reviewed By: peter.smith, MaskRay Differential Revision: https://reviews.llvm.org/D140202	2023-03-29 10:21:00 +01:00
Fangrui Song	7198c87f42	[ELF][PPC64] Actually implement --no-power10-stubs When a caller that does not use TOC calls a function, a call stub is needed if the function may use TOC. --no-power10-stubs avoids PC-relative instructions in the code sequence. The --no-power10-stubs=no implementation added in D94627 is wrong. First, the first instruction incorrectly uses `mflr 0` (instead of `mflr 12`). Second, for the PLT case, it uses addis+addi with getVA instead of addis+ld with getGotPltVA.	2023-02-27 16:19:13 -08:00
Fangrui Song	8ce135123e	[ELF][PPC64] Merge PPC64R12SetupStub and PPC64PCRelPLTStub. NFC PPC64PCRelPLTStub (from D83669) duplicates lot of code from PPC64R12SetupStub. Just merge them. Note: PPC64R12SetupStub does not correctly handle long branch to a non-preemptible non-TOC code.	2023-02-27 14:33:18 -08:00
Simi Pallipurath	674f094d85	[lld][ARM][NFCI][1/3]Big Endian support - Removing assumptions Change: - Replacing the memcpy that assume little endian with the endian-aware write. Shouldn't affect the output for now, just a prerequisite for the next patches. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D140201	2023-02-15 11:42:49 +00:00
Ties Stuij	6f9ff1beee	[lld][ARM] support position independent thunks for Armv4(T) - Position independent thunks now work for both Armv4 and Armv4T - Armv4 arm->arm thunks don't emit a BX anymore, which doesn't exist for the arch. This fixes https://github.com/llvm/llvm-project/issues/50764. - Armv4 and Armv4T both have the same arm->arm behaviour. Which also is desirable for the above ticket. Reviewed By: MaskRay, peter.smith Differential Revision: https://reviews.llvm.org/D141272	2023-01-13 11:54:41 +00:00
Ties Stuij	747fc27ee4	[lld][ARM] don't use short thumb thunks if no branch range extension In ThumbThunk::isCompatibleWith, we check if we can use short thunks if we are within branch range. However these short thumb thunks will generate b.w instructions, and these are not available on pre branch range extension architectures. On these architectures (v4, v5, and most of v6), we could replace the b.w with a Thumb b (2) instruction, but that would in an ideal situation only give us an extra range of 2048 bytes on top of the 4MB range of a BL, if a thunk section happens to be placed on the outer range of a BL and the stars are aligned. It doesn't seem worth it. What would be worth it is a state change to Arm and a subsequent branch to either Arm or Thumb code. But that's the subject of another patch. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D140633	2023-01-09 11:45:49 +00:00
Ties Stuij	62c605771a	[lld][ARM] support absolute thunks for Armv4T Thumb and interworking changes: - BLX: The Arm architecture versions that support the branch and link instruction (BLX), can rewrite BLs in place when a state change from Arm<->Thumb is required. Armv4T does not have BLX and so needs thunks for state changes. - v4T Thumb long branches needed their own thunk. We could have used the v6M implementation, but v6M doesn't have Arm state and must resolve to rather inefficient stack reshuffling. We also can't reuse v7 thumb thunks as they use MOVV/MOVT, which wasn't available yet for v4T. - Remove the `lack of BLX' warning. LLVM only supports Arm Architecture versions upwards of v4, which we now all support in LLD. - renamed existing thunks to better reflect their use: ARMV5ABSLongThunk -> ARMV5LongLdrPcThunk, ARMV5PILongThunk -> ARMV4PILongThunk - removed isCompatibleWith method from ARMV5ABSLongThunk and ARMV5PILongThunk, as they were identical to the ARMThunk parent class implementation. Support for (efficient) position independent thunks for v4T will be added in a follow-up patch, including possible related thunk renaming and code comment cleanup. Reviewed By: MaskRay, peter.smith Differential Revision: https://reviews.llvm.org/D139888	2022-12-21 11:04:32 +00:00
Fangrui Song	4191fda69c	[ELF] Change most llvm::Optional to std::optional https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 19:19:15 -08:00
Stefan Pintilie	c1f3cffee1	[PowerPC][LLD] Change PPC64R2SaveStub to only use non-PC-relative code Currently the PPC64R2SaveStub thunk will produce Power 10 code by default. This produced an issue when linking older code that made use of the st_other=1 bit but was never meant to be linked or run on Power 10. This patch makes it so that only the R_PPC64_REL24_NOTOC relocation can produce Power 10 code. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D129580	2022-07-13 19:34:33 -05:00
Fangrui Song	38fbedab32	[ELF] Don't rely on Symbols.h's transitive inclusion of InputFiles.h. NFC	2022-02-23 20:44:34 -08:00
Fangrui Song	27bb799095	[ELF] Clean up headers. NFC	2022-02-07 21:53:34 -08:00
Alexandre Ganea	83d59e05b2	Re-land [LLD] Remove global state in lldCommon Move all variables at file-scope or function-static-scope into a hosting structure (lld::CommonLinkerContext) that lives at lldMain()-scope. Drivers will inherit from this structure and add their own global state, in the same way as for the existing COFFLinkerContext. See discussion in https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html The previous land f860fe362282ed69b9d4503a20e5d20b9a041189 caused issues in https://lab.llvm.org/buildbot/#/builders/123/builds/8383, fixed by 22ee510dac9440a74b2e5b3fe3ff13ccdbf55af3. Differential Revision: https://reviews.llvm.org/D108850	2022-01-20 14:53:26 -05:00
Alexandre Ganea	e6b153947d	Revert [LLD] Remove global state in lldCommon It seems to be causing issues on https://lab.llvm.org/buildbot/#/builders/123/builds/8383	2022-01-16 11:03:06 -05:00
Alexandre Ganea	f860fe3622	[LLD] Remove global state in lldCommon Move all variables at file-scope or function-static-scope into a hosting structure (lld::CommonLinkerContext) that lives at lldMain()-scope. Drivers will inherit from this structure and add their own global state, in the same way as for the existing COFFLinkerContext. See discussion in https://lists.llvm.org/pipermail/llvm-dev/2021-June/151184.html Differential Revision: https://reviews.llvm.org/D108850	2022-01-16 08:57:57 -05:00
Fangrui Song	bf45624ba0	[ELF][PPC32] Support .got2 in an output section description I added `PPC32Got2Section` D62464 to support .got2 but did not implement .got2 in another output section. PR52799 has a linker script placing .got2 in .rodata, which causes a null pointer dereference because a MergeSyntheticSection's file is nullptr. Add the support.	2021-12-23 11:32:44 -08:00
Fangrui Song	baa3eb0dd9	[ELF] Change some non-null pointer parameters to references. NFC	2021-12-22 20:51:11 -08:00
Fangrui Song	ad45df91ad	[ELF][PPC64] Remove unneeded PPC64PCRelLongBranchThunk This reverts the PPC64PCRelLongBranchThunk part from D86706. PPC64PCRelLongBranchThunk is the same as PPC64R12SetupStub. Use `__gep_setup_` instead of `__long_branch_pcrel_` for the stub symbol name as it more closely indicates the operation. (Note: GNU ld uses `.long_branch.` and `.plt_branch.`). Reviewed By: NeHuang, nemanjai Differential Revision: https://reviews.llvm.org/D114656	2021-11-30 11:33:17 -08:00
Fangrui Song	3b4dd68de5	[ELF][PPC64] Make --power10-stubs/--no-power10-stubs proper aliases for --power10-stubs={auto,no} This allows --power10-stubs= and --[no-]power10-stubs to override each other (they are position dependent in GNU ld). Also improve --help messages and the manpage. Note: GNU ld's default "auto" mode uses heuristics to decide whether Power10 instructions are used. Arguably it is a design mistake of R_PPC64_REL24_NOTOC (acked by the relevant folks on a libc-alpha discussion). We don't implement "auto", so the default --power10-stubs is the same as "yes".	2021-11-26 11:51:45 -08:00
Alex Richardson	6d87ca08ae	[ELF] Refactor DynamicReloc to fix incorrect relocation addends This patch changes the DynamicReloc class to store an enum instead of the overloaded useSymVA member to make it easier to understand and fix incorrect addends being written in some corner cases. The change is motivated by a follow-up review that checks the value of implicit Elf_Rel addends written to the output file. This patch fixes an incorrect output when using `-z rela` for i386 files with R_386_GOT32 relocations (not that this really matters since it's an unsupported configuration). Storing the relocation expression kind also addresses an incorrect addend FIXME in ppc64-abs64-dyn.s introduced in D63383. DynamicReloc now also has a special case for the MIPS TLS relocations (DynamicReloc::AgainstSymbolWithTargetVA) since the R_MIPS_TLS_TPREL{32/64} the symbol VA to the GOT for preemptible symbols. I'm not sure if the symbol value actually should be written for R_MIPS_TLS_TPREL32, but this patch does not attempt to change that behaviour. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D100490	2021-07-09 10:41:40 +01:00
Stefan Pintilie	f0adf3a24c	[PowerPC][LLD] Make sure that the correct Thunks are used. This fixes an issue where mixed TOC / NOTOC calls can call the incorrect thunks if a previous thunk already exists. The issue appears when a TOC funciton calls a NOTOC callee and then a different NOTOC function calls the same NOTOC callee. In this case the linker would sometimes incorrectly call the same thunk for both cases. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101837	2021-05-06 12:00:04 -05:00
Albion Fung	36192790d8	[PowerPC][PC Rel] Implement option to omit Power10 instructions from stubs Implemented the option to omit Power10 instructions from save stubs via the option --no-power10-stubs or --power10-stubs=no on lld. --power10-stubs= will override the other option. --power10-stubs=auto also exists to use the default behaviour (ie allow Power10 instructions in stubs). Differential Revision: https://reviews.llvm.org/D94627	2021-03-04 13:27:46 -05:00
Peter Smith	e35929e026	[LLD][ELF][ARM] Refactor inBranchRange to use addend for PC Bias In AArch32 ARM, the PC reads two instructions ahead of the currently executiing instruction. This evaluates to 8 in ARM state and 4 in Thumb state. Branch instructions on AArch32 compensate for this by subtracting the PC bias from the addend. For a branch to symbol this will result in an addend of -8 in ARM state and -4 in Thumb state. The existing ARM Target::inBranchRange function accounted for this implict addend within the function meaning that if the addend were to be taken into account by the caller then it would be double counted. This complicates the interface for all Targets as callers wanting to account for addends had to account for the ARM PC-bias. In certain situations such as: https://github.com/ClangBuiltLinux/linux/issues/1305 the PC-bias compensation code didn't match up. In particular normalizeExistingThunk() didn't put the PC-bias back in as Arm thunks did not store the addend. The simplest fix for the problem is to add the PC bias in normalizeExistingThunk when restoring the addend. However I think it is worth refactoring the Arm inBranchRange implementation so that fewer calls to getPCBias are needed for other Targets. I wasn't able to remove getPCBias completely but hopefully the Relocations.cpp code is simpler now. In principle a test could be written to replicate the linux kernel build failure but I wasn't able to reproduce with a small example that I could build up from scratch. Fixes https://github.com/ClangBuiltLinux/linux/issues/1305 Differential Revision: https://reviews.llvm.org/D97550	2021-03-02 11:02:33 +00:00
Stefan Pintilie	d224175230	[PowerPC][LLD] Extend R2 save stub to support offsets of more than 26 bits The R2 save stub will now support offsets up to 64 bits. There are three cases that will be used. 1) The offset fits in 26 bits. ``` b <26 bit offset> ``` 2) The offset does not fit in 26 bits but fits in 34 bits. ``` paddi r12, 0, <34 bit offset>, 1 mtctr r12 bctr ``` 3) The offset does not fit in 34 bits. Since this is an R2 save stub we can use the TOC in R2. We are not loading the offset but the actual address we want to branch to. ``` addis r12, r2, <address in TOC lo> ld r12 <address in TOC hi>(r12) mtctr r12 bctr ``` In case 1) the stub is only 8 bytes while in cases 2) and 3) the stub will be 20 bytes. Reviewed By: MaskRay, sfertile, NeHuang Differential Revision: https://reviews.llvm.org/D87916	2020-09-25 06:39:14 -05:00
Victor Huang	967e29ff8c	[LLD][PowerPC][test] Update thunk range error report for PPC64PCRelLongBranchThunk Update the thunk range error report for PPC64PCRelLongBranchThunk and add a range error test case for PPC64R12SetupStub. Differential Revision: https://reviews.llvm.org/D87381	2020-09-22 07:37:54 -05:00
Fangrui Song	94921e9f8a	[ELF] Define a reportRangeError() overload for thunks and tidy up recent PPC64 thunk range errors Prefer `errorOrWarn` to `fatal` for recoverable errors and graceful degradation when --noinhibit-exec is specified. Mention the destination symbol, otherwise the diagnostic is not really actionable. Two errors are not tested but the patch does not intend to add the coverage. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D87486	2020-09-14 09:55:59 -07:00
Victor Huang	bfc7636612	[LLD][PowerPC] Add a pc-rel based long branch thunk In this patch, a pc-rel based long branch thunk is added for the local call protocol that caller and callee does not use TOC. Reviewed By: sfertile, nemanjai Differential Revision: https://reviews.llvm.org/D86706	2020-08-28 10:40:48 -05:00
Victor Huang	7b391245d8	[PowerPC] Fix thunk alignment issue when using pc-rel instruction Thunk alignment is added in thie patch when using pc-rel instructions to avoid crossing the 64 byte boundary. Patched by: nemanjai, NeHuang Reviewed By: sfertile, MaskRay Differential Revision: https://reviews.llvm.org/D85973	2020-08-17 09:09:36 -05:00
Victor Huang	6c64f05b90	[PowerPC] Add compatibility check for PPC PLT stubs Compatibility checks for PPC64PltCallStub and PPC64PCRelPLTStub are added in this patch to prevent the usage of incompatible thunk/stub. Reviewed By: sfertile, nemanjai, stefanp Differential Revision: https://reviews.llvm.org/D85459	2020-08-07 13:45:18 +00:00
Victor Huang	8dbea4785c	[PowerPC] Support for R_PPC64_REL24_NOTOC calls where the caller has no TOC and the callee is not DSO local This patch supports the situation where caller does not have a valid TOC and calls using the R_PPC64_REL24_NOTOC relocation and the callee is not DSO local. In this case the call cannot be made directly since the callee may or may not require a valid TOC pointer. As a result this situation require a PC-relative plt stub to set up r12. Reviewed By: sfertile, MaskRay, stefanp Differential Revision: https://reviews.llvm.org/D83669	2020-07-29 19:49:28 +00:00
Victor Huang	91cce1a2bc	[PowerPC] Implement R_PPC64_REL24_NOTOC local calls, callee requires a TOC The PC Relative code now allows for calls that are marked with the relocation R_PPC64_REL24_NOTOC. This indicates that the caller does not have a valid TOC pointer in R2 and does not require R2 to be restored after the call. This patch is added to support local calls to callees that require a TOC Reviewed By: sfertile, MaskRay, nemanjai, stefanp Differential Revision: https://reviews.llvm.org/D83504	2020-07-20 17:46:49 +00:00
Stefan Pintilie	beb52b12cb	[PowerPC] Support PCRelative Callees for R_PPC64_REL24 Relocation The R_PPC64_REL24 is used in function calls when the caller requires a valid TOC pointer. If the callee shares the same TOC or does not clobber the TOC pointer then a direct call can be made. If the callee does not share the TOC a thunk must be added to save the TOC pointer for the caller. Up until PC Relative was introduced all local calls on medium and large code models were assumed to share a TOC. This is no longer the case because if the caller requires a TOC and the callee is PC Relative then the callee can clobber the TOC even if it is in the same DSO. This patch is to add support for a TOC caller calling a PC Relative callee that clobbers the TOC. Reviewed By: sfertile, MaskRay Differential Revision: https://reviews.llvm.org/D82950	2020-07-09 09:50:19 -05:00
Leonard Chan	723b5a1785	[lld][ELF][AArch64] Handle R_AARCH64_PLT32 relocation This is the followup to D77647 which implements handling for the new R_AARCH64_PLT32 relocation type in lld. This relocation would benefit the PIC-friendly vtables feature described in D72959. Differential Revision: https://reviews.llvm.org/D81184	2020-06-23 16:10:07 -07:00
Fangrui Song	07837b8f49	[ELF] Use namespace qualifiers (lld:: or elf::) instead of `namespace lld { namespace elf {` Similar to D74882. This reverts much code from commit bd8cfe65f5fee4ad573adc2172359c9552e8cdc0 (D68323) and fixes some problems before D68323. Sorry for the churn but D68323 was a mistake. Namespace qualifiers avoid bugs where the definition does not match the declaration from the header. See https://llvm.org/docs/CodingStandards.html#use-namespace-qualifiers-to-implement-previously-declared-functions (D74515) Differential Revision: https://reviews.llvm.org/D79982	2020-05-15 08:49:53 -07:00
Fangrui Song	9195b01911	[ELF][PPC64] Enable R_PPC64_REL14 trunks The thunk implementation is available but an assertion disallows it. Linux kernel has such a use case: in arch/powerpc/kernel/exceptions-64s.S:handle_page_fault, beq+ ret_from_except_lite may get out of range. Link: https://github.com/ClangBuiltLinux/linux/issues/951 Differential Revision: https://reviews.llvm.org/D76904	2020-04-04 10:59:17 -07:00
Fangrui Song	70389be7a0	[ELF][PPC32] Support range extension thunks with addends * Generalize the code added in D70637 and D70937. We should eventually remove the EM_MIPS special case. * Handle R_PPC_LOCAL24PC the same way as R_PPC_REL24. Reviewed By: Bdragon28 Differential Revision: https://reviews.llvm.org/D73424	2020-01-25 22:32:42 -08:00
Fangrui Song	deb5819d62	[ELF] Rename relocateOne() to relocate() and pass `Relocation` to it Symbol information can be used to improve out-of-range/misalignment diagnostics. It also helps R_ARM_CALL/R_ARM_THM_CALL which has different behaviors with different symbol types. There are many (67) relocateOne() call sites used in thunks, {Arm,AArch64}errata, PLT, etc. Rename them to `relocateNoSym()` to be clearer that there is no symbol information. Reviewed By: grimar, peter.smith Differential Revision: https://reviews.llvm.org/D73254	2020-01-25 12:00:18 -08:00
Fangrui Song	bb87364f26	[ELF][PPC64] Improve "call lacks nop" diagnostic and make it compatible with GCC<5.5 and GCC<6.4 GCC before r245813 (https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79439) did not emit nop after b/bl. This can happen with recursive calls. r245813 was back ported to GCC 5.5 and GCC 6.4. This is common, for example, libstdc++.a(locale.o) shipped with GCC 4.9 and many objects in netlib lapack can cause lld to error. gold allows such calls to the same section. Our __plt_foo symbol's `section` field is used for ThunkSection, so we can't implement a similar loosen rule easily. But we can make use of its `file` field which is currently NULL. Differential Revision: https://reviews.llvm.org/D71639	2019-12-29 23:05:11 -08:00
Fangrui Song	fb2944bd7f	[ELF][PPC32] Implement IPLT code sequence for non-preemptible IFUNC Similar to D71509 (EM_PPC64), on EM_PPC, the IPLT code sequence should be similar to a PLT call stub. Unlike EM_PPC64, EM_PPC -msecure-plt has small/large PIC model differences. * -fpic/-fpie: R_PPC_PLTREL24 r_addend=0. The call stub loads an address relative to `_GLOBAL_OFFSET_TABLE_`. * -fPIC/-fPIE: R_PPC_PLTREL24 r_addend=0x8000. (A partial linked object file may have an addend larger than 0x8000.) The call stub loads an address relative to .got2+0x8000. Just assume large PIC model for now. This patch makes: // clang -fuse-ld=lld -msecure-plt -fno-pie -no-pie a.c // clang -fuse-ld=lld -msecure-plt -fPIE -pie a.c #include <stdio.h> static void impl(void) { puts("meow"); } void thefunc(void) __attribute__((ifunc("resolver"))); void resolver(void) { return &impl; } int main(void) { thefunc(); void (theptr)(void) = &thefunc; theptr(); } work on Linux glibc. -fpie will crash because the compiler and the linker do not agree on the value which r30 stores (_GLOBAL_OFFSET_TABLE_ vs .got2+0x8000). Differential Revision: https://reviews.llvm.org/D71621	2019-12-29 22:42:53 -08:00
Fangrui Song	45acc35ac2	[ELF][PPC64] Implement IPLT code sequence for non-preemptible IFUNC Non-preemptible IFUNC are placed in in.iplt (.glink on EM_PPC64). If there is a non-GOT non-PLT relocation, for pointer equality, we change the type of the symbol from STT_IFUNC and STT_FUNC and bind it to the .glink entry. On EM_386, EM_X86_64, EM_ARM, and EM_AARCH64, the PLT code sequence loads the address from its associated .got.plt slot. An IPLT also has an associated .got.plt slot and can use the same code sequence. On EM_PPC64, the PLT code sequence is actually a bl instruction in .glink . It jumps to `__glink_PLTresolve` (the PLT header). and `__glink_PLTresolve` computes the .plt slot (relocated by R_PPC64_JUMP_SLOT). An IPLT does not have an associated R_PPC64_JUMP_SLOT, so we cannot use `bl` in .iplt . Instead, create a call stub which has a similar code sequence as PPC64PltCallStub. We don't save the TOC pointer, so such scenarios will not work: a function pointer to a non-preemptible ifunc, which resolves to a function defined in another DSO. This is the restriction described by https://sourceware.org/glibc/wiki/GNU_IFUNC (though on many architectures it works in practice): Requirement (a): Resolver must be defined in the same translation unit as the implementations. If an ifunc is taken address but not called, technically we don't need an entry for it, but we currently do that. This patch makes // clang -fuse-ld=lld -fno-pie -no-pie a.c // clang -fuse-ld=lld -fPIE -pie a.c #include <stdio.h> static void impl(void) { puts("meow"); } void thefunc(void) __attribute__((ifunc("resolver"))); void resolver(void) { return &impl; } int main(void) { thefunc(); void (theptr)(void) = &thefunc; theptr(); } work on Linux glibc and FreeBSD. Calling a function pointer pointing to a Non-preemptible IFUNC never worked before. Differential Revision: https://reviews.llvm.org/D71509	2019-12-29 22:40:03 -08:00
Fangrui Song	c8f0d3e130	[ELF][PPC64] Support long branch thunks with addends Fixes PPC64 part of PR40438 // clang -target ppc64le -c a.cc // .text.unlikely may be placed in a separate output section (via -z keep-text-section-prefix) // The distance between bar in .text.unlikely and foo in .text may be larger than 32MiB. static void foo() {} __attribute__((section(".text.unlikely"))) static int bar() { foo(); return 0; } __attribute__((used)) static int dummy = bar(); This patch makes such thunks with addends work for PPC64. AArch64: .text -> `__AArch64ADRPThunk_ (adrp x16, ...; add x16, x16, ...; br x16)` -> target PPC64: .text -> `__long_branch_ (addis 12, 2, ...; ld 12, ...(12); mtctr 12; bctr)` -> target AArch64 can leverage ADRP to jump to the target directly, but PPC64 needs to load an address from .branch_lt . Before Power ISA v3.0, the PC-relative ADDPCIS was not available. .branch_lt was invented to work around the limitation. Symbol::ppc64BranchltIndex is replaced by PPC64LongBranchTargetSection::entry_index which take addends into consideration. The tests are rewritten: ppc64-long-branch.s tests -no-pie and ppc64-long-branch-pi.s tests -pie and -shared. Reviewed By: sfertile Differential Revision: https://reviews.llvm.org/D70937	2019-12-05 10:17:45 -08:00
Fangrui Song	bf535ac4a2	[ELF][AArch64] Support R_AARCH64_{CALL26,JUMP26} range extension thunks with addends Fixes AArch64 part of PR40438 The current range extension thunk framework does not handle a relocation relative to a STT_SECTION symbol with a non-zero addend, which may be used by jumps/calls to local functions on some RELA targets (AArch64, powerpc ELFv1, powerpc64 ELFv2, etc). See PR40438 and the following code for examples: // clang -target $target a.cc // .text.cold may be placed in a separate output section. // The distance between bar in .text.cold and foo in .text may be larger than 128MiB. static void foo() {} __attribute__((section(".text.cold"))) static int bar() { foo(); return 0; } __attribute__((used)) static int dummy = bar(); This patch makes such thunks with addends work for AArch64. The target independent part can be reused by PPC in the future. On REL targets (ARM, MIPS), jumps/calls are not represented as STT_SECTION + non-zero addend (see MCELFObjectTargetWriter::needsRelocateWithSymbol), so they don't need this feature, but we need to make sure this patch does not affect them. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D70637	2019-12-02 10:07:24 -08:00
Nico Weber	5976a3f5aa	Fix a few typos in lld/ELF to cycle bots	2019-10-28 21:41:47 -04:00

1 2 3

122 Commits