llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-22 21:06:04 +00:00

Author	SHA1	Message	Date
Pranav Kant	8389d6fad7	[lld] Merge equivalent symbols found during ICF (#134342 ) Fixes a correctness issue for AArch64 when ADRP and LDR instructions are outlined in separate sections and sections are fed to ICF for deduplication. See test case (based on https://github.com/llvm/llvm-project/issues/129122) for details. All rodata.* sections are folded into a single section with ICF. This leads to all f2_* function sections getting folded into one (as their relocation target symbols g* belong to .rodata.g* sections that have already been folded into one). Since relocations still refer original g* symbols, we end up creating duplicate GOT entry for all such symbols. This PR addresses that by tracking such folded symbols and create one GOT entry for all such symbols. Fixes https://github.com/llvm/llvm-project/issues/129122 Co-authored by: @jyknight	2025-04-21 15:17:03 -07:00
Fangrui Song	712264b83c	[ELF] Merge parseSymbolVersion and computeIspreemptible ICF needs isPreemptible, which can be combined with parseSymbolVersion.	2024-12-08 17:50:17 -08:00
Fangrui Song	10b98473e8	[ELF] Simplify --print-icf-sections	2024-11-29 18:26:29 -08:00
Fangrui Song	1cd627562b	[ELF] Remove unneeded Twine in ELFSyncStream	2024-11-24 12:13:02 -08:00
Fangrui Song	3b75a5c4c8	[ELF] Replace message(...) with Msg(ctx)	2024-11-16 15:34:42 -08:00
Fangrui Song	58a971f42f	[ELF] Replace contex-less toString(x) with toStr(ctx, x) so that we can remove the global `ctx` from toString implementations. Rename lld::toString (to lld:🧝:toStr) to simplify name lookup (we have many llvm::toString and another lld::toString(const llvm::opt::Arg &)).	2024-11-16 11:58:10 -08:00
Fangrui Song	c13258ac49	[ELF] Replace log with Log(ctx)	2024-11-07 09:30:20 -08:00
Fangrui Song	f1dccda1b5	[ELF] Pass Ctx & to Symbols	2024-10-06 17:05:43 -07:00
Fangrui Song	bab5d5b6b0	[ELF] Pass Ctx & to ICF and SymbolTable	2024-09-29 14:45:00 -07:00
Fangrui Song	df0864e761	[ELF] Move elf::symtab into Ctx Remove the global variable `symtab` and add a member variable (`std::unique_ptr<SymbolTable>`) to `Ctx` instead. This is one step toward eliminating global states. Pull Request: https://github.com/llvm/llvm-project/pull/109612	2024-09-23 10:33:43 -07:00
Fangrui Song	eba30b3370	[ELF] Replace config-> with ctx.arg. in [IS]*.cpp	2024-09-21 12:47:47 -07:00
Fangrui Song	e82f0838ae	[ELF] --icf: don't fold a section without relocation and a section with relocations for SHT_CREL Similar to commit 686cff17cc310884e48ae963bf7507f96950cc90 for SHT_REL (#57693). CREL hasn't been tested with ICF before. And avoid a pitfall that eqClass[0] might interfere with ICF.	2024-09-18 23:06:12 -07:00
Fangrui Song	40e8e4ddcb	[ELF] Move partitions into ctx. NFC Ctx was introduced in March 2022 as a more suitable place for such singletons.	2024-09-15 14:52:56 -07:00
Fangrui Song	4629aa1797	[ELF] Move script into Ctx. NFC Ctx was introduced in March 2022 as a more suitable place for such singletons. We now use default-initialization for `LinkerScript` and should pay attention to non-class types (e.g. `dot` is initialized by commit 503907dc505db1e439e7061113bf84dd105f2e35).	2024-08-21 21:23:28 -07:00
Fangrui Song	0af07c0787	[ELF] Support relocatable files using CREL with explicit addends ... using the temporary section type code 0x40000020 (`clang -c -Wa,--crel,--allow-experimental-crel`). LLVM will change the code and break compatibility (Clang and lld of different versions are not guaranteed to cooperate, unlike other features). CREL with implicit addends are not supported. --- Introduce `RelsOrRelas::crels` to iterate over SHT_CREL sections and update users to check `crels`. (The decoding performance is critical and error checking is difficult. Follow `skipLeb` and `R_LEB128` handling, do not use `llvm::decodeULEB128`, whichs compiles to a lot of code.) A few users (e.g. .eh_frame, LLDDwarfObj, s390x) require random access. Pass `/supportsCrel=/false` to `relsOrRelas` to allocate a buffer and convert CREL to RELA (`relas` instead of `crels` will be used). Since allocating a buffer increases, the conversion is only performed when absolutely necessary. --- Non-alloc SHT_CREL sections may be created in -r and --emit-relocs links. SHT_CREL and SHT_RELA components need reencoding since r_offset/r_symidx/r_type/r_addend may change. (r_type may change because relocations referencing a symbol in a discarded section are converted to `R__NONE`). * SHT_CREL components: decode with `RelsOrRelas` and re-encode (`OutputSection::finalizeNonAllocCrel`) * SHT_RELA components: convert to CREL (`relToCrel`). An output section can only have one relocation section. * SHT_REL components: print an error for now. SHT_REL to SHT_CREL conversion for -r/--emit-relocs is complex and unsupported yet. Link: https://discourse.llvm.org/t/rfc-crel-a-compact-relocation-format-for-elf/77600 Pull Request: https://github.com/llvm/llvm-project/pull/98115	2024-08-01 10:22:03 -07:00
Fangrui Song	6efc3774bd	[ELF] Add Relocs and invokeOnRelocs. NFC Relocs is to simplify CREL support (#98115) while invokeOnRelocs simplifies some relsOrRelas call sites that will use the CREL iterator.	2024-07-27 10:55:17 -07:00
Fangrui Song	45146082e6	[ELF] relocateNonAlloc & ICF: replace random access iterators of relocations with input iterators. NFC Also replace one `this->file` with a local variable as some function calls are opaque to the compiler, causing unneeded reload.	2024-04-10 23:11:24 -07:00
Fangrui Song	f6455606bb	[ELF] Move getSymbol/getRelocTargetSym from ObjFile<ELFT> to InputFile. NFC This removes lots of unneeded `template getFile<ELFT>()`.	2024-03-10 23:01:26 -07:00
Fangrui Song	f4b4bc2f18	[ELF] --icf: switch to xxh3_64bits for more efficient section content hashing. Also clean up the tests a bit and properly test the formatting of --print-icf-sections.	2023-07-18 17:57:31 -07:00
Fangrui Song	8d85c96e0e	[lld] StringRef::{starts,ends}with => {starts,ends}_with. NFC The latter form is now preferred to be similar to C++20 starts_with. This replacement also removes one function call when startswith is not inlined.	2023-06-05 14:36:19 -07:00
Fangrui Song	686cff17cc	[ELF] --icf: don't fold a section without relocation and a section with relocations for SHT_REL Fix https://github.com/llvm/llvm-project/issues/57693	2023-02-22 23:34:49 -08:00
Fangrui Song	2bf5d86422	[ELF] Change rawData to content() and data() to contentMaybeDecompress() Clarify data() which may trigger decompression and make it feasible to refactor the member variable rawData.	2022-11-20 22:43:22 +00:00
Fangrui Song	14f996dca8	[ELF] Move inputSections/ehInputSections into Ctx. NFC	2022-10-16 00:49:48 -07:00
Fangrui Song	9c626d4a0d	[ELF] Remove symtab indirection. NFC Add LLVM_LIBRARY_VISIBILITY to remove unneeded GOT and unique_ptr indirection.	2022-10-01 14:46:49 -07:00
Fangrui Song	34fa860048	[ELF] Remove ctx indirection. NFC Add LLVM_LIBRARY_VISIBILITY to remove unneeded GOT and unique_ptr indirection. We can move other global variables into ctx without indirection concern. In the long term we may consider passing Ctx as a parameter to various functions and eliminate global state as much as possible and then remove `Ctx::reset`.	2022-10-01 12:06:33 -07:00
Sam Clegg	2cd4cd9a32	[lld][ELF] Rename SymbolTable::symbols() to SymbolTable::getSymbols(). NFC This change renames this method match its original name and the name used in the wasm linker. Back in d8f8abbd4a2823f223bd7bc56445541fb221b512 the ELF SymbolTable method `getSymbols()` was replaced with `forEachSymbol`. Then in a2fc96441788fba1e4709d63677f34ed8e321dae `forEachSymbol` was replaced with a `llvm::iterator_range`. Then in e9262edf0d11a907763098d8e101219ccd9c43e9 we came full circle and the `llvm::iterator_range` was replaced with a `symbols()` accessor that was identical the original `getSymbols()`. `getSymbols` also matches the name used elsewhere in the ELF linker as well as in both COFF and wasm backend (e.g. `InputFiles.h` and `SyntheticSections.h`) Differential Revision: https://reviews.llvm.org/D130787	2022-08-19 14:56:08 -07:00
Fangrui Song	c72973608d	[ELF] Combine EhInputSection removal and MergeInputSection removal. NFC	2022-07-29 00:39:57 -07:00
Fangrui Song	9a572164d5	[ELF] Move InputFiles global variables (memoryBuffers, objectFiles, etc) into Ctx. NFC	2022-06-29 18:53:38 -07:00
Nico Weber	7effcbda49	Rename parallelForEachN to just parallelFor Patch created by running: rg -l parallelForEachN \| xargs sed -i '' -c 's/parallelForEachN/parallelFor/' No behavior change. Differential Revision: https://reviews.llvm.org/D128140	2022-06-19 17:49:00 -04:00
Fangrui Song	6c814931bc	[ELF] Don't use multiple inheritance for OutputSection. NFC Add an OutputDesc class inheriting from SectionCommand. An OutputDesc wraps an OutputSection. This change allows InputSection::getParent to be inlined. Differential Revision: https://reviews.llvm.org/D120650	2022-03-08 11:23:42 -08:00
Fangrui Song	b01430a04f	[ELF] Don't rely on Symbols.h's transitive inclusion of InputFiles.h. NFC	2022-02-23 19:18:24 -08:00
Fangrui Song	ae1ba6194f	[ELF] Replace uncompressed InputSectionBase::data() with rawData. NFC In many call sites we know uncompression cannot happen (non-SHF_ALLOC, or the data (even if compressed) must have been uncompressed by a previous pass). Prefer rawData in these cases. data() increases code size and prevents optimization on rawData.	2022-02-21 00:39:26 -08:00
Fangrui Song	27bb799095	[ELF] Clean up headers. NFC	2022-02-07 21:53:34 -08:00
Fangrui Song	196aedb843	[ELF] Change vector<InputSection *> to SmallVector. NFC My x86-64 lld executable is 8KiB smaller.	2022-02-01 00:14:21 -08:00
Fangrui Song	7330fd236e	[ELF] Simplify Symbol::includeInDynsym	2022-01-15 23:27:45 -08:00
Fangrui Song	e1b6b5be46	[ELF] Avoid referencing SectionBase::repl after ICF It is fairly easy to forget SectionBase::repl after ICF. Let ICF rewrite a Defined symbol's `section` field to avoid references to SectionBase::repl in subsequent passes. This slightly improves the --icf=none performance due to less indirection (maybe for --icf={safe,all} as well if most symbols are Defined). With this change, there is only one reference to `repl` (--gdb-index D89751). We can undo f4fb5fd7523f8e3c3b3966d43c0a28457b59d1d8 (`Move Repl to SectionBase.`) but move `repl` to `InputSection` instead. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D116093	2021-12-24 12:09:48 -08:00
Fangrui Song	7051aeef7a	[ELF] Rename BaseCommand to SectionCommand. NFC BaseCommand was picked when PHDRS/INSERT/etc were not implemented. Rename it to SectionCommand to match `sectionCommands` and make it clear that the commands are used in SECTIONS (except a special case for SymbolAssignment). Also, improve naming of some BaseCommand variables (base -> cmd).	2021-11-25 20:24:23 -08:00
Fangrui Song	6188fd4957	[ELF] Rename OutputSection::sectionCommands to commands. NFC This partially reverts r315409: the description applies to LinkerScript, but not to OutputSection. The name "sectionCommands" is used in both LinkerScript::sectionCommands and OutputSection::sectionCommands, which may lead to confusion. "commands" in OutputSection has no ambiguity because there are no other types of commands.	2021-11-25 16:47:07 -08:00
Fangrui Song	ecc93ed2d7	[ELF] Replace InputBaseSection::{areRelocsRela,firstRelocation,numRelocation} with relSecIdx For `InputSection` `.foo`, its `InputBaseSection::{areRelocsRela,firstRelocation,numRelocation}` basically encode the information of `.rel[a].foo`. However, one uint32_t (the relocation section index) suffices. See the implementation of `relsOrRelas`. This change decreases sizeof(InputSection) from 184 to 176 on 64-bit Linux. The maximum resident set size linking a large application (1.2G output) decreases by 0.39%. Differential Revision: https://reviews.llvm.org/D112513	2021-10-27 09:51:07 -07:00
Andrew Ng	88ce27c39c	[LLD][ELF] Improve ICF for relocations to ineligible sections via "aliases" ICF was not able to merge equivalent sections because of relocations to sections ineligible for ICF that use alternative symbols, e.g. symbol aliases or section relative relocations. Merging in this scenario has been enabled by giving the sections that are ineligible for ICF a unique ID, i.e. an equivalence class of their own. This approach also provides another benefit as it improves the hashing that is used to perform the initial equivalance grouping for ICF. This is because the ICF ineligible sections can now contribute a unique value towards the hashes instead of the same value of zero. This has been seen to reduce link time with ICF by ~68% for objects compiled with -fprofile-instr-generate. In order to facilitate this use of a unique ID, the existing inconsistent approach to the setting of the InputSection eqClass in ICF has been changed so that there is a clear distinction between the eqClass values of ICF eligible sections and those of the ineligible sections that have a unique ID. This inconsistency could have caused incorrect equivalence class equality in the past, although it appears that no issues were encountered in actual use. Differential Revision: https://reviews.llvm.org/D88830	2020-10-15 12:43:14 +01:00
Fangrui Song	164a02d0fa	[ELF]: --icf: don't fold sections referencing sections with LCDA after D84610	2020-08-07 13:42:25 -07:00
Fangrui Song	21b4f8060a	[ELF] --icf: don't fold text sections with LSDA Fix PR36272 and PR46835 A .eh_frame FDE references a text section and (optionally) a LSDA (in .gcc_except_table). Even if two text sections have identical content and relocations (e.g. a() and b()), we cannot fold them if their LSDA are different. ``` void foo(); void a() { try { foo(); } catch (int) { } } void b() { try { foo(); } catch (float) { } } ``` Scan .eh_frame pieces with LSDA and disallow referenced text sections to be folded. If two .gcc_except_table have identical semantics (usually identical content with PC-relative encoding), we will lose folding opportunity. For ClickHouse (an exception-heavy application), this can reduce --icf=all efficiency from 9% to 5%. There may be some percentage we can reclaim without affecting correctness, if we analyze .eh_frame and .gcc_except_table sections. gold 2.24 implemented a more complex fix (resolution to https://sourceware.org/bugzilla/show_bug.cgi?id=21066) which combines the checksum of .eh_frame CIE/FDE pieces. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D84610	2020-08-05 09:16:28 -07:00
Fangrui Song	07837b8f49	[ELF] Use namespace qualifiers (lld:: or elf::) instead of `namespace lld { namespace elf {` Similar to D74882. This reverts much code from commit bd8cfe65f5fee4ad573adc2172359c9552e8cdc0 (D68323) and fixes some problems before D68323. Sorry for the churn but D68323 was a mistake. Namespace qualifiers avoid bugs where the definition does not match the declaration from the header. See https://llvm.org/docs/CodingStandards.html#use-namespace-qualifiers-to-implement-previously-declared-functions (D74515) Differential Revision: https://reviews.llvm.org/D79982	2020-05-15 08:49:53 -07:00
Reid Kleckner	932f0276ea	[Support] Move LLD's parallel algorithm wrappers to support Essentially takes the lld/Common/Threads.h wrappers and moves them to the llvm/Support/Paralle.h algorithm header. The changes are: - Remove policy parameter, since all clients use `par`. - Rename the methods to `parallelSort` etc to match LLVM style, since they are no longer C++17 pstl compatible. - Move algorithms from llvm::parallel:: to llvm::, since they have "parallel" in the name and are no longer overloads of the regular algorithms. - Add range overloads - Use the sequential algorithm directly when 1 thread is requested (skips task grouping) - Fix the index type of parallelForEachN to size_t. Nobody in LLVM was using any other parameter, and it made overload resolution hard for for_each_n(par, 0, foo.size(), ...) because 0 is int, not size_t. Remove Threads.h and update LLD for that. This is a prerequisite for parallel public symbol processing in the PDB library, which is in LLVM. Reviewed By: MaskRay, aganea Differential Revision: https://reviews.llvm.org/D79390	2020-05-05 15:21:05 -07:00
Fangrui Song	eb4663d8c6	[lld][COFF][ELF][WebAssembly] Replace --[no-]threads /threads[:no] with --threads={1,2,...} /threads:{1,2,...} --no-threads is a name copied from gold. gold has --no-thread, --thread-count and several other --thread-count-*. There are needs to customize the number of threads (running several lld processes concurrently or customizing the number of LTO threads). Having a single --threads=N is a straightforward replacement of gold's --no-threads + --thread-count. --no-threads is used rarely. So just delete --no-threads instead of keeping it for compatibility for a while. If --threads= is specified (ELF,wasm; COFF /threads: is similar), --thinlto-jobs= defaults to --threads=, otherwise all available hardware threads are used. There is currently no way to override a --threads={1,2,...}. It is still a debate whether we should use --threads=all. Reviewed By: rnk, aganea Differential Revision: https://reviews.llvm.org/D76885	2020-03-31 08:46:12 -07:00
Russell Gallop	e7cb374433	[LLD][ELF] Add time-trace to ELF LLD This adds some of LLD specific scopes and picks up optimisation scopes via LTO/ThinLTO. Makes use of TimeProfiler multi-thread support added in 77e6bb3c. Differential Revision: https://reviews.llvm.org/D71060	2020-02-06 12:14:13 +00:00
Kazuaki Ishizaki	7ae3d33546	[lld] Fix trivial typos in comments Reviewed By: ruiu, MaskRay Differential Revision: https://reviews.llvm.org/D72196	2020-01-06 10:25:48 -08:00
Rui Ueyama	69da7e29de	Revert an accidental commit af5ca40b47b3e85c3add81ccdc0b787c4bc355ae	2019-12-13 15:17:40 +09:00
Rui Ueyama	af5ca40b47	temporary	2019-12-13 14:35:03 +09:00
Fangrui Song	6e513a5382	[ELF] Move a computeIsPreemptible() pass into ICF. NFC Address post-commit review for D71163. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D71326	2019-12-10 22:21:05 -08:00

1 2 3 4

164 Commits