llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-29 11:46:06 +00:00

Author	SHA1	Message	Date
Kazu Hirata	f97c610d1f	[memprof] Add MemProfReader::takeMemProfData (#116769 ) This patch adds MemProfReader::takeMemProfData, a function to return the complete MemProf profile from the reader. We can directly pass its return value to InstrProfWriter::addMemProfData without having to deal with the indivual components of the MemProf profile. The new function is named "take", but it doesn't do std::move yet because of type differences (DenseMap v.s. MapVector). The end state I'm trying to get to is roughly as follows: - MemProfReader accepts IndexedMemProfData as a parameter as opposed to the three individual components (frames, call stacks, and records). - MemProfReader keeps IndexedMemProfData as a class member without decomposing it into its individual components. - MemProfReader returns IndexedMemProfData like: IndexedMemProfData takeMemProfData() { return std::move(MemProfData); }	2024-11-19 19:33:26 -08:00
lifengxiang1025	314e9b1cff	[llvm-profdata] fix typo (#116754 )	2024-11-20 10:52:16 +08:00
Aiden Grossman	842fd15375	[llvm-exegesis] Add explicit support for setting DF in X86 (#115644 ) While llvm-exegesis has explicit support for setting EFLAGS which contains DF, it can be nice sometimes to explicitly set DF, especially given that it is modeled as a separate register within LLVM. This patch adds the ability to do that by lowering setting the value to 0 or 1 to cld and std respectively.	2024-11-18 12:06:52 -08:00
Matt Arsenault	a6fc489bb7	AMDGPU: Add gfx950 subtarget definitions (#116307 ) Mostly a stub, but adds some baseline tests and tests for removed instructions.	2024-11-18 10:41:14 -08:00
Fangrui Song	2444b6f0df	[llvm-objcopy] Replace custom -- parsing with DashDashParsing The custom -- parsing from https://reviews.llvm.org/D102665 can be replaced with the generic feature from https://reviews.llvm.org/D152286 Pull Request: https://github.com/llvm/llvm-project/pull/116565	2024-11-18 09:09:06 -08:00
Lang Hames	224290d448	[ORC] Add LazyObjectLinkingLayer, lazy-linking support to llvm-jitlink (#116002 ) LazyObjectLinkingLayer can be used to add object files that will not be linked into the executor unless some function that they define is called at runtime. (References to data members defined by these objects will still trigger immediate linking) To implement lazy linking, LazyObjectLinkingLayer uses the lazyReexports utility to construct stubs for each function in a given object file, and an ObjectLinkingLayer::Plugin to rename the function bodies at link-time. (Data symbols are not renamed) The llvm-jitlink utility is extended with a -lazy option that can be passed before input files or archives to add them using the lazy linking layer rather than the base ObjectLinkingLayer.	2024-11-18 11:17:36 +11:00
Aiden Grossman	5bb9465d35	[llvm-exegesis] Reserve members in array This patch reserves members in the RegisterSetUp vector as we statically know the size.	2024-11-17 02:32:04 +00:00
Kazu Hirata	0d38f64e7d	[memprof] Remove MemProf format Version 0 (#116442 ) This patch removes MemProf format Version 0 now that version 2 and 3 seem to be working well. I'm not touching version 1 for now because some tests still rely on version 1. Note that Version 0 is identical to Version 1 except that the MemProf section of the indexed format has a MemProf version field.	2024-11-15 15:37:00 -08:00
Cyndy Ishida	2d48489cc3	[Clang][Darwin] Introduce `SubFrameworks` as a SDK default location (#115048 ) * Have clang always append & pass System/Library/SubFrameworks when determining default sdk search paths. * Teach clang-installapi to traverse there for framework input. * Teach llvm-readtapi that the library files (TBD or binary) in there should be considered private. resolves: rdar://137457006	2024-11-15 09:27:08 -08:00
Matin Raayai	bb3f5e1fed	Overhaul the TargetMachine and LLVMTargetMachine Classes (#111234 ) Following discussions in #110443, and the following earlier discussions in https://lists.llvm.org/pipermail/llvm-dev/2017-October/117907.html, https://reviews.llvm.org/D38482, https://reviews.llvm.org/D38489, this PR attempts to overhaul the `TargetMachine` and `LLVMTargetMachine` interface classes. More specifically: 1. Makes `TargetMachine` the only class implemented under `TargetMachine.h` in the `Target` library. 2. `TargetMachine` contains target-specific interface functions that relate to IR/CodeGen/MC constructs, whereas before (at least on paper) it was supposed to have only IR/MC constructs. Any Target that doesn't want to use the independent code generator simply does not implement them, and returns either `false` or `nullptr`. 3. Renames `LLVMTargetMachine` to `CodeGenCommonTMImpl`. This renaming aims to make the purpose of `LLVMTargetMachine` clearer. Its interface was moved under the CodeGen library, to further emphasis its usage in Targets that use CodeGen directly. 4. Makes `TargetMachine` the only interface used across LLVM and its projects. With these changes, `CodeGenCommonTMImpl` is simply a set of shared function implementations of `TargetMachine`, and CodeGen users don't need to static cast to `LLVMTargetMachine` every time they need a CodeGen-specific feature of the `TargetMachine`. 5. More importantly, does not change any requirements regarding library linking. cc @arsenm @aeubanks	2024-11-14 13:30:05 -08:00
Kyungwoo Lee	d23c5c2d65	[CGData] Global Merge Functions (#112671 ) This implements a global function merging pass. Unlike traditional function merging passes that use IR comparators, this pass employs a structurally stable hash to identify similar functions while ignoring certain constant operands. These ignored constants are tracked and encoded into a stable function summary. When merging, instead of explicitly folding similar functions and their call sites, we form a merging instance by supplying different parameters via thunks. The actual size reduction occurs when identically created merging instances are folded by the linker. Currently, this pass is wired to a pre-codegen pass, enabled by the `-enable-global-merge-func` flag. In a local merging mode, the analysis and merging steps occur sequentially within a module: - `analyze`: Collects stable function hashes and tracks locations of ignored constant operands. - `finalize`: Identifies merge candidates with matching hashes and computes the set of parameters that point to different constants. - `merge`: Uses the stable function map to optimistically create a merged function. We can enable a global merging mode similar to the global function outliner (https://discourse.llvm.org/t/rfc-enhanced-machine-outliner-part-2-thinlto-nolto/78753/), which will perform the above steps separately. - `-codegen-data-generate`: During the first round of code generation, we analyze local merging instances and publish their summaries. - Offline using `llvm-cgdata` or at link-time, we can finalize all these merging summaries that are combined to determine parameters. - `-codegen-data-use`: During the second round of code generation, we optimistically create merging instances within each module, and finally, the linker folds identically created merging instances. Depends on #112664 This is a patch for https://discourse.llvm.org/t/rfc-global-function-merging/82608.	2024-11-13 17:34:07 -08:00
Shilei Tian	de0fd64bed	[AMDGPU] Introduce a new generic target `gfx9-4-generic` (#115190 ) This patch introduces a new generic target, `gfx9-4-generic`. Since it doesn’t support FP8 and XF32-related instructions, the patch includes several code reorganizations to accommodate these changes.	2024-11-12 23:11:05 -05:00
Kazu Hirata	4048c64306	[llvm] Remove redundant control flow statements (NFC) (#115831 ) Identified with readability-redundant-control-flow.	2024-11-12 10:09:42 -08:00
Lang Hames	dc11c06015	[ORC] Move absoluteSymbols from Core.h to new AbsoluteSymbols.h header. NFC. Continuing Core.h clean-up. If you see any errors about a missing absoluteSymbols function you need to include the new AbsoluteSymbols.h header.	2024-11-11 11:39:53 +11:00
Douglas	d822c099ee	[JITLink] Use `rsplit` on `-sectcreate` argument in llvm-jitlink (#115511 ) This accounts for cases where the file path may contain an `@` symbol. In such cases, the split occurs too early causing argument parsing to fail.	2024-11-10 19:19:02 +11:00
Thomas Fransham	f8fea5d49b	[llvm] Add explicit visibility macros to YAMLTraits classes (#111484 ) These symbols need to be exported for llvm-pdbutil when using windows shared library builds. Exclude the YAML traits declared in llvm-pdbutil so there not declared as dllimported which will causing missing symbol errors for windows shared library builds. This is part of the work to enable LLVM_BUILD_LLVM_DYLIB and plugins on window.	2024-11-09 19:42:43 +02:00
Antonio Frighetto	60972a893e	[ObjectYAML][ELF] Allow verdaux entry offset to be user-defined	2024-11-08 18:44:02 +01:00
Yingwei Zheng	3356eb3b15	[llvm-reduce] Reduce samesign flag from icmp (#115492 )	2024-11-09 00:12:09 +08:00
zhijian lin	e373ba46bb	[llvm-objdump] Implement decoding auxiliary header for xcoff with llvm-objdump --private-headers (#105682 ) Implement decoding auxiliary header of XCOFF object file with llvm-objdump --private-headers	2024-11-07 10:53:11 -05:00
Thomas Fransham	c87d198cd9	[ORC] Switch to new visibility macros for JIT debug symbols (#113848 ) Use LLVM_ALWAYS_EXPORT for __jit_debug_descriptor and __jit_debug_register_code so there exported even if LLVM is not built as a shared library. This is part of the work to enable LLVM_BUILD_LLVM_DYLIB and plugins on windows #109483.	2024-11-07 17:11:40 +02:00
Sjoerd Meijer	6720ce75f6	[Docs][llvm-exegesis] Clarify AArch64 support (#114989 ) Claiming AArch64 support for llvm-exegesis is a bit of a stretch in my opinion as only a couple of opcodes with GPR64 operands will work for snippet benchmarking, so I propose to clarify that AArch64 support is very experimental. Also added some clarifications about its libpfm4 dependency.	2024-11-07 10:48:52 +00:00
abhishek-kaushik22	d2aff182d3	Revert "TLS loads opimization (hoist)" (#114740 ) This reverts commit c31014322c0b5ae596da129cbb844fb2198b4ef4. Based on the discussions in #112772, this pass is not needed after the introduction of `llvm.threadlocal.address` intrinsic. Fixes https://github.com/llvm/llvm-project/issues/112771.	2024-11-07 10:10:28 +01:00
Ryan Mansfield	bbc3af0577	[dsymutil] Add missing newlines in error messages. (#115191 ) Errors like "cannot create bundle: Not a directory" or "error: a.out.dSYM: Is a directory" were being emitted without a newline.	2024-11-06 15:54:47 -08:00
Kazu Hirata	3297858c19	[llvm-readobj] Use heterogenous lookups with std::map (NFC) (#114929 ) Heterogenous lookups allow us to call find with StringRef, avoiding a temporary heap allocation of std::string.	2024-11-05 09:38:11 -08:00
Youngsuk Kim	ba60f6dc03	Remove leftover uses of llvm::Type::getPointerTo() (#114993 ) `llvm::Type::getPointerTo()` is to be deprecated. Replace remaining uses of it.	2024-11-05 10:00:23 -05:00
Kyungwoo Lee	ffcf3c8688	[CGData][llvm-cgdata] Support for stable function map (#112664 ) This introduces a new cgdata format for stable function maps. The raw data is embedded in the __llvm_merge section during compile time. This data can be read and merged using the llvm-cgdata tool, into an indexed cgdata file. Consequently, the tool is now capable of handling either outlined hash trees, stable function maps, or both, as they are orthogonal. Depends on #112662. This is a patch for https://discourse.llvm.org/t/rfc-global-function-merging/82608.	2024-11-04 17:32:50 -08:00
Kazu Hirata	bc3c83fe78	[sancov] Use heterogeneous lookups with std::map (NFC) (#113406 )	2024-11-04 09:39:55 -08:00
Aiden Grossman	074209034f	[llvm-exegesis] Use older instructions to load lower vregs (#114768 ) This patch makes X86 llvm-exegesis unconditionally use older instructions to load the lower vector registers, rather than trying to use AVX512 for everything when available. This fixes a case where we would try and load AVX512 registers using the older instructions if such a snippet was constructed while -mcpu was set to something that did not support AVX512. This would lead to a machine code verification error rather than resulting in incomplete snippet setup, which seems to be the intention of how this should work. Fixes #114691.	2024-11-04 09:04:31 -08:00
Shilei Tian	390300d9f4	[PassBuilder] Add `ThinOrFullLTOPhase` to optimizer pipeline (#114577 )	2024-11-03 23:25:29 -05:00
Shilei Tian	dc45ff1d2a	[PassBuilder] Add `ThinOrFullLTOPhase` to early simplication EP call backs (#114547 ) The early simplication pipeline is used in non-LTO and (Thin/Full)LTO pre-link stage. There are some passes that we want them in non-LTO mode, but not at LTO pre-link stage. The control is missing currently. This PR adds the support. To demonstrate the use, we only enable the internalization pass in non-LTO mode for AMDGPU because having it run in pre-link stage causes some issues.	2024-11-03 23:24:10 -05:00
Teresa Johnson	bb3915149a	[MemProf] Support for random hotness when writing profile (#113998 ) Add support for generating random hotness in the memprof profile writer, to be used for testing. The random seed is printed to stderr, and an additional option enables providing a specific seed in order to reproduce a particular random profile.	2024-10-29 22:10:33 -07:00
tf2spi	f23bdbbaff	Add DILabel functions for LLVM-C (#112840 ) Addresses #112799	2024-10-28 10:59:53 -07:00
Fangrui Song	92412c106f	[llvm-objdump] Handle -M for --macho --macho -d uses the `parseInputMachO` code path, which does not handle -M. Add -M handling for --macho as well. Close #61019 Pull Request: https://github.com/llvm/llvm-project/pull/113795	2024-10-28 09:40:15 -07:00
Edd Dawson	d4dd770289	[llvm-cxxfilt] De-emphasize "function" in llvm-cxxfilt docs and --help (#113309 ) llvm-cxxfilt can demangle names of data symbols, in addition to function names. $ llvm-cxxfilt _ZN6garden5gnomeE garden::gnome And type names too, on request: $ llvm-cxxfilt -t i int Update some overly specific the wording in the --help and documentation that suggests otherwise.	2024-10-23 13:03:30 +01:00
Mészáros Gergely	7ab6d39a4d	[LLVM][CMake][MSVC] Wrap linker flags for ICX on Windows (#112680 ) The Intel C++ Compiler (ICX) passes linker flags through the driver unlike MSVC and clang-cl, and therefore needs them to be prefixed with `/Qoption,link` (the equivalent of `-Wl,` for gcc on *nix). Use `LINKER:` prefix wherever supported by cmake, when that's not possible fall-back to `${CMAKE_CXX_LINKER_WRAPPER_FLAG}`. CMake replaces these with `/Qoption,link` for ICX and with the empty string for MSVC and clang-cl. For `target_link_libraries` neither `LINKER:` (not supported prior to CMake 3.32) nor `${CMAKE_CXX_LINKER_WRAPPER_FLAG}` (does not begin with `-` would be taken as a library name) works, use `-Qoption,link` directly within a conditional generator expression that we're linking with ICX. For MSVC and clang-cl no functional change is intended. Tested by compiling with ICX and setting `CMAKE_(EXE\|SHARED\|STATIC\|MODULE)_LINKER_FLAGS_INIT` to `-Werror=unknown-argument`. RFC: https://discourse.llvm.org/t/rfc-cmake-linker-flags-need-wl-equivalent-for-intel-c-icx-on-windows/82446	2024-10-23 13:03:25 +02:00
Carl Ritson	076aac59ac	[AMDGPU] Add a new target for gfx1153 (#113138 )	2024-10-23 12:56:58 +09:00
Jinsong Ji	d985197156	[NFC][Fuzzer] Refactor to avoid a false warning from gcc (#112944 ) This is one of the many PRs to fix errors with LLVM_ENABLE_WERROR=on. Built by GCC 11. Refactor the code to avoid the false warning llvm-project/llvm/tools/llvm-isel-fuzzer/llvm-isel-fuzzer.cpp llvm-project/llvm/tools/llvm-isel-fuzzer/llvm-isel-fuzzer.cpp: In function ‘int LLVMFuzzerInitialize(int, char*)’: llvm-project/llvm/tools/llvm-isel-fuzzer/llvm-isel-fuzzer.cpp:141:43: error: ISO C++ forbids zero-size array ‘argv’ [-Werror=pedantic] 141 \| ExitOnError ExitOnErr(std::string(argv[0]) + ": error:"); \|	2024-10-22 17:39:05 -04:00
Jonas Devlieghere	5886454669	[dsymutil] Provide an option to ignore object timestamp mismatches (#113238 ) Provide a option (--no-object-timestamp) to ignore object file timestamp mismatches. We already have a similar option for Swift modules (--no-swiftmodule-timestamp). rdar://123975869	2024-10-22 09:48:02 -07:00
Jonas Devlieghere	2ccbea1d73	[dsymutil] Share one BinaryHolder between debug map parsing & linking (#113234 ) I (re)discovered that dsymutil was instantiating two BinaryHolders: one for parsing the debug map and one for linking. That really defeats the purpose of the BinaryHolder as it serves as a cache. Fix the issue and remove an old FIXME.	2024-10-22 09:40:34 -07:00
Kazu Hirata	da66f6a280	[llvm-jitlink] Use heterogenous lookups with std::map (NFC) (#113245 )	2024-10-22 07:58:50 -07:00
Kazu Hirata	61a286ac08	[tools] Don't call StringRef::str() when calling StringMap::find (NFC) (#113119 ) StringMap::find takes StringRef. We don't need to create an instance of std::string from StringRef only to convert it right back to StringRef.	2024-10-21 06:50:34 -07:00
Ronan Keryell	d582442bec	[llvm-cxxfilt] Add --quote option to quote demangled function names (#111871 ) This is useful when looking at LLVM/MLIR assembly produced from C++ sources. For example cir.call @_ZN3aie4tileILi1ELi4EE7programIZ4mainE3$_0EEvOT_(%2, %7) : will be translated to cir.call @"void aie::tile<1, 4>::program<main::$_0>(main::$_0&&)"(%2, %7) : which can be parsed as valid MLIR by the right mlir-lsp-server. If a symbol is already quoted, do not quote it more. --------- Co-authored-by: James Henderson <jh7370@my.bristol.ac.uk>	2024-10-21 08:54:04 +01:00
Kazu Hirata	3bddf85e52	[sancov] Avoid repeated map lookups (NFC) (#113026 )	2024-10-20 10:41:42 -07:00
NAKAMURA Takumi	4a011ac84f	[Coverage] Introduce "partial fold" on BranchRegion (#112694 ) Currently both True/False counts were folded. It lost the information, "It is True or False before folding." It prevented recalling branch counts in merging template instantiations. In `llvm-cov`, a folded branch is shown as: - `[True: n, Folded]` - `[Folded, False n]` In the case If `n` is zero, a branch is reported as "uncovered". This is distinguished from "folded" branch. When folded branches are merged, `Folded` may be dissolved. In the coverage map, either `Counter` is `Zero`. Currently both were `Zero`. Since "partial fold" has been introduced, either case in `switch` is omitted as `Folded`. Each `case:` in `switch` is reported as `[True: n, Folded]`, since `False` count doesn't show meaningful value. When `switch` doesn't have `default:`, `switch (Cond)` is reported as `[Folded, False: n]`, since `True` count was just the sum of `case`(s). `switch` with `default` can be considered as "the statement that doesn't have any `False`(s)".	2024-10-20 12:30:35 +09:00
Kazu Hirata	f4136b3265	[llvm-diff] Avoid repeated hash lookups (NFC) (#113025 )	2024-10-19 14:42:25 -07:00
Kazu Hirata	a99bf0f6c9	[llvm-readtapi] Simplify code with StringMap::operator[] (NFC) (#112824 )	2024-10-18 08:40:05 -07:00
Shimin Cui	0205667010	[LTO] Add function alias as function instead of data (#112599 ) On AIX, for undefined functions, only the dotnamed symbols (the address of the function) are generated after linking (i.e., no named function symbol is generated). Currently, all alias symbols are added as defined data symbols when parsing symbols in LTOModule (the Link Time Optimization library used by linker to optimization code at link time). On AIX, if the function alias is used in the native object, and only its dotnamed symbol is generated, the linker will have problem to match the dotnamed symbol from the native object and the defined symbol marked as data from the bitcode at LTO linktime. This patch is to add function alias as function instead of data.	2024-10-17 12:10:05 -04:00
Lang Hames	7937fe1a17	[ORC][llvm-jitlink] Add support for forced loading of archive members. This patch adds support for forced loading of archive members, similar to the behavior of the -all_load and -ObjC options in ld64. To enable this, the StaticLibraryDefinitionGenerator class constructors are extended with a VisitMember callback that is called on each member file in the archive at generator construction time. This callback can be used to unconditionally add the member file to a JITDylib at that point. To test this the llvm-jitlink utility is extended with -all_load (all platforms) and -ObjC (darwin only) options. Since we can't refer to symbols in the test objects directly (these would always cause the member to be linked in, even without the new flags) we instead test side-effects of force loading: execution of constructors and registration of Objective-C metadata. rdar://134446111	2024-10-16 15:21:51 +11:00
Tyler Kenney	1b6a46ab8e	[ORC][COFF] Remove the `ExecutionSession&` argument to `COFFPlatform` factory & constructor (#112419 ) We can get a reference to the `ExecutionSession` from the `ObjectLinkingLayer` argument, so there's no need to pass it in separately. This mirrors recent changes to `ElfNixPlatform` and `MachOPlatform` by @lhames in `3dba4ca155` and `cc20dd285a`.	2024-10-16 10:57:58 +11:00
Sunho Kim	04af63b267	[ORC] Introduce IRPartitionLayer for common partition functionality.	2024-10-12 01:21:05 +09:00

1 2 3 4 5 ...

15643 Commits