llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-05-02 09:06:06 +00:00

Author	SHA1	Message	Date
Matt Arsenault	70feafdb27	IR/AMDGPU: Autoupgrade amdgpu-unsafe-fp-atomics attribute (#101698 ) Delete the attribute and annotate any atomicrmw instructions in the function with new metadata.	2024-08-12 14:56:53 +04:00
Alexis Engelke	b7cd564fa3	[IR] Don't verify module flags on every access (#102153 ) 8b4306ce050bd5 introduced validity checks for every module flag access, because the auto-upgrader uses named metadata before verifying the module. This causes overhead for all other accesses, and the check is, in fact, only need at that single place. Change the upgrader to be careful when accessing module flags before the module is verified and remove the checks on all other occasions. There are two tangential optimizations included: first, when querying a specific flag, don't enumerate all other flags into a vector as well. Second, don't use a Twine for getNamedMetadata(), which has materialization overhead -- all call sites use simple strings that can be implicitly converted to a StringRef.	2024-08-06 18:33:26 +02:00
Justin Holewinski	9374f83a73	Outline X86 autoupgrade patterns (#97851 ) Outlining these patterns has a significant impact on the overall stack frame size of llvm::UpgradeIntrinsicCall. This is helpful for scenarios where compilation threads are stack-constrained. The overall impact is low when using clang as the host compiler, but very pronounced when using MSVC 2022 with release builds. Clang: 1,624 -> 824 bytes MSVC: 23,560 -> 6,120 bytes	2024-07-06 09:24:36 -04:00
Matt Arsenault	f55bcc5dbe	AMDGPU: Add amdgpu.no.fine.grained.memory when upgrading old atomic intrinsics (#89655 ) This should replicate the old intrinsic behavior better when codegen of the raw instruction will require metadata in the future.	2024-06-27 19:52:23 +02:00
Matt Arsenault	4477ff6836	AMDGPU: Remove ds_fmin/ds_fmax intrinsics (#96739 ) These have been replaced with atomicrmw.	2024-06-27 15:35:24 +02:00
Stephen Tozer	d75f9dd1d2	Revert "[IR][NFC] Update IRBuilder to use InsertPosition (#96497 )" Reverts the above commit, as it updates a common header function and did not update all callsites: https://lab.llvm.org/buildbot/#/builders/29/builds/382 This reverts commit 6481dc57612671ebe77fe9c34214fba94e1b3b27.	2024-06-24 18:00:22 +01:00
Stephen Tozer	6481dc5761	[IR][NFC] Update IRBuilder to use InsertPosition (#96497 ) Uses the new InsertPosition class (added in #94226) to simplify some of the IRBuilder interface, and removes the need to pass a BasicBlock alongside a BasicBlock::iterator, using the fact that we can now get the parent basic block from the iterator even if it points to the sentinel. This patch removes the BasicBlock argument from each constructor or call to setInsertPoint. This has no functional effect, but later on as we look to remove the `Instruction *InsertBefore` argument from instruction-creation (discussed [here](https://discourse.llvm.org/t/psa-instruction-constructors-changing-to-iterator-only-insertion/77845)), this will simplify the process by allowing us to deprecate the InsertPosition constructor directly and catch all the cases where we use instructions rather than iterators.	2024-06-24 17:27:43 +01:00
Matt Arsenault	70c8b9c24a	AMDGPU: Remove ds atomic fadd intrinsics (#95396 ) These have been replaced with atomicrmw fadd	2024-06-23 10:30:20 +02:00
Simon Pilgrim	2615e69ec2	[IR] AutoUpgrade.cpp - don't directly dereference pointers from dyn_cast Static analysis was reporting that dyn_cast<> can return null on failure - use cast<> instead	2024-06-21 17:42:01 +01:00
hev	46edc02eaa	[LoongArch] Adjust LA64 data layout by using n32:64 in layout string (#93814 ) Although i32 type is illegal in the backend, LA64 has pretty good support for i32 types by using W instructions. By adding n32 to the DataLayout string, middle end optimizations will consider i32 to be a native type. One known effect of this is enabling LoopStrengthReduce on loops with i32 induction variables. This can be beneficial because C/C++ code often has loops with i32 induction variables due to the use of `int` or `unsigned int`. If this patch exposes performance issues, those are better addressed by tuning LSR or other passes.	2024-06-06 14:05:56 +08:00
Doug Wyatt	ddecadabeb	[clang backend] In AArch64's DataLayout, specify a minimum function alignment of 4. (#90702 ) This addresses an issue where the explicit alignment of 2 (for C++ ABI reasons) was being propagated to the back end and causing under-aligned functions (in special sections). This is an alternate approach suggested by @efriedma-quic in PR #90415. Fixes #90358	2024-05-05 19:05:15 -07:00
Kazu Hirata	4e6f6fda8b	[IR] Use StringRef::operator== instead of StringRef::equals (NFC) (#90550 ) I'm planning to remove StringRef::equals in favor of StringRef::operator==. - StringRef::operator== outnumbers StringRef::equals by a factor of 22 under llvm/ in terms of their usage. - The elimination of StringRef::equals brings StringRef closer to std::string_view, which has operator== but not equals. - S == "foo" is more readable than S.equals("foo"), especially for !Long.Expression.equals("str") vs Long.Expression != "str".	2024-04-30 12:23:31 -07:00
Maciej Gabka	bfc0317153	Move several vector intrinsics out of experimental namespace (#88748 ) This patch is moving out following intrinsics: * vector.interleave2/deinterleave2 * vector.reverse * vector.splice from the experimental namespace. All these intrinsics exist in LLVM for more than a year now, and are widely used, so should not be considered as experimental.	2024-04-29 10:16:45 +01:00
Paul Walker	0fa1f1f2d1	[LLVM][SVE] Seperate the int and floating-point variants of addqv. (#89762 ) We only use common intrinsics for operations that treat their element type as a container of bits.	2024-04-26 11:25:55 +01:00
Alex Voicu	1120d8e6f7	[clang][CodeGen] Add AS for Globals to SPIR & SPIRV datalayouts (#88455 ) Currently neither the SPIR nor the SPIRV targets specify the AS for globals in their datalayout strings. This is problematic because CodeGen/LLVM will default to AS0 in this case, which produces Globals that end up in the private address space for e.g. OCL, HIPSPV or SYCL. This patch addresses it by completing the datalayout string.	2024-04-16 11:37:29 +01:00
Arthur Eubanks	5d6d8dcd29	[clang][llvm] Remove "implicit-section-name" attribute (#87906 ) D33412/D33413 introduced this to support a clang pragma to set section names for a symbol depending on if it would be placed in bss/data/rodata/text, which may not be known until the backend. However, for text we know that only functions will go there, so just directly set the section in clang instead of going through a completely separate attribute. Autoupgrade the "implicit-section-name" attribute to directly setting the section on a Fuction.	2024-04-11 12:29:29 -07:00
Stephen Tozer	ed5fe66370	[RemoveDIs][BC] Reject intrinsic->record upgrades for old-format modules (#87494 ) Fixes issue noted at: https://github.com/llvm/llvm-project/pull/86274 When loading bitcode lazily, we may request debug intrinsics be upgraded to debug records during the module parsing phase; later on we perform this upgrade when materializing the module functions. If we change the module's debug info format between parsing and materializing however, then the requested upgrade is no longer correct and leads to an assertion. This patch fixes the issue by adding an extra check in the autoupgrader to see if the upgrade is no longer suitable, and either exit-out or fall back to the correct intrinsic->intrinsic upgrade if one is required.	2024-04-04 10:53:36 +01:00
Stephen Tozer	bdc77d1ecc	[RemoveDIs][NFC] Rename DPLabel->DbgLabelRecord (#85918 ) This patch renames DPLabel to DbgLabelRecord, in accordance with the ongoing DbgRecord rename. This rename was fairly trivial, since DPLabel isn't as widely used as DPValue and has no real conflicts in either its full or abbreviated name. As usual, the entire replacement was done automatically, with `s/DPLabel/DbgLabelRecord/` and `s/DPL/DLR/`.	2024-03-20 13:11:28 +00:00
Stephen Tozer	ffd08c7759	[RemoveDIs][NFC] Rename DPValue -> DbgVariableRecord (#85216 ) This is the major rename patch that prior patches have built towards. The DPValue class is being renamed to DbgVariableRecord, which reflects the updated terminology for the "final" implementation of the RemoveDI feature. This is a pure string substitution + clang-format patch. The only manual component of this patch was determining where to perform these string substitutions: `DPValue` and `DPV` are almost exclusively used for DbgRecords, except for: - llvm/lib/target, where 'DP' is used to mean double-precision, and so appears as part of .td files and in variable names. NB: There is a single existing use of `DPValue` here that refers to debug info, which I've manually updated. - llvm/tools/gold, where 'LDPV' is used as a prefix for symbol visibility enums. Outside of these places, I've applied several basic string substitutions, with the intent that they only affect DbgRecord-related identifiers; I've checked them as I went through to verify this, with reasonable confidence that there are no unintended changes that slipped through the cracks. The substitutions applied are all case-sensitive, and are applied in the order shown: ``` DPValue -> DbgVariableRecord DPVal -> DbgVarRec DPV -> DVR ``` Following the previous rename patches, it should be the case that there are no instances of any of these strings that are meant to refer to the general case of DbgRecords, or anything other than the DPValue class. The idea behind this patch is therefore that pure string substitution is correct in all cases as long as these assumptions hold.	2024-03-19 20:07:07 +00:00
Orlando Cazalet-Hyams	835c1b56a8	[RemoveDIs] Auto-upgrade debug intrinsics to DbgRecords (default false) (#85650 ) If --load-bitcode-into-experimental-debuginfo-iterators is true then debug intrinsics are auto-upgraded to DbgRecords (the new debug info format). The upgrade is trivial because the two representations are semantically identical. llvm.dbg.value with 4 operands and llvm.dbg.addr intrinsics are upgraded in the same way as usual, but converted directly into DbgRecords instead of debug intrinsics.	2024-03-19 13:28:43 +00:00
Fraser Cormack	67c5a98cae	[IR][NFC] Suppress warnings in ternary operators Just doing this the same way as in AMDGPUPromoteAlloca.cpp	2024-03-18 17:17:08 +00:00
Daniel Kiss	4b0276d1c9	Revert "[llvm][AArch64] Autoupgrade function attributes from Module attributes." (#85291 ) Reverts llvm/llvm-project#82763 because caused a regressions with inlining. See https://github.com/llvm/llvm-project/pull/84494#issuecomment-1996047458	2024-03-14 21:11:37 +01:00
Emma Pilkington	4490003a22	[AMDGPU] Rename COV module flag to amdhsa_code_object_version (#79905 ) The previous name 'amdgpu_code_object_version', was misleading since this is really a property of the HSA OS. The new spelling also matches the asm directive I added in bc82cfb.	2024-03-06 09:51:48 -05:00
Dani	ded5de11fa	[llvm][AArch64] Autoupgrade function attributes from Module attributes. (#82763 ) sign-return-address and similar module attributes should be propagated to the function level before got merged because module flags may contradict and this information is not recoverable. Generated code will match with the normal linking flow. Refactored version of (#80640). Run the attribute copy only during IRMove.	2024-03-04 11:12:52 +01:00
Daniel Kiss	b13c8e5099	Revert "[llvm][AArch64] Autoupgrade function attributes from Module attributes. (#80640 )" This reverts commit 531e8c26b3f2626e7f1a997e0e8b61d67d10aded.	2024-02-23 10:24:15 +01:00
Dani	531e8c26b3	[llvm][AArch64] Autoupgrade function attributes from Module attributes. (#80640 ) `sign-return-address` and similar module attributes should be propagated to the function level before modules got merged because module flags may contradict and this information is not recoverable. Generated code will match with the normal linking flow.	2024-02-23 09:04:33 +01:00
Shubham Sandeep Rastogi	6ce03ff3fe	Revert "[IR] Use range-based for loops (NFC)" This reverts commit e8512786fedbfa6ddba70ceddc29d7122173ba5e. This revert is done because llvm::drop_begin over an empty ArrayRef doesn't return an empty range, and therefore can lead to an invalid address returned instead. See discussion in https://github.com/llvm/llvm-project/pull/80737 for more context.	2024-02-05 15:33:21 -08:00
Kazu Hirata	e8512786fe	[IR] Use range-based for loops (NFC)	2024-01-31 23:54:05 -08:00
Nathan Sidwell	0880742a60	[NFC] Rename internal fns (#77994 ) Internal functions should use a lowerCaseName, thus renamed.	2024-01-20 14:23:37 -05:00
Kazu Hirata	c6cfd5350e	[llvm] Use StringRef::contains (NFC)	2024-01-19 00:19:36 -08:00
Alex MacLean	430a40d12e	[NVPTX] extend type support for nvvm.{min,max,mulhi,sad} (#78385 ) Ensure intrinsics and auto-upgrades support i16, i32, and i64 for for `nvvm.{min,max,mulhi,sad}` - `nvvm.min` and `nvvm.max`: These are auto-upgraded to `select` instructions but it is still nice to support the 16 bit variants just in case any generators of IR are still trying to use these intrinsics. - `nvvm.sad` added both the 16 and 64 bit variants, also marked this instruction as speculateble. These directly correspond to the PTX `sad.{u16,s16,u64,s64}` instructions. - `nvvm.mulhi` added the 16 bit variants. These directly correspond to the PTX `mul.hi.{s,u}16` instructions.	2024-01-17 16:18:39 -08:00
Kazu Hirata	c0cb80338f	[IR] Use StringRef::consume_front (NFC)	2024-01-14 00:53:26 -08:00
Nathan Sidwell	31626dadce	[llvm][NFC] Refactor AutoUpgrader arm/aarch64 (#74145 ) Break out and refactor AArch64 & ARM intrinsic updating. There's a fair amount of comonality, but let's avoid continually checking the same prefixes.	2024-01-05 13:50:44 -05:00
Kazu Hirata	395f9ce30e	Use StringRef::{starts,ends}_with (NFC) This patch replaces uses of StringRef::{starts,ends}with with StringRef::{starts,ends}_with for consistency with std::{string,string_view}::{starts,ends}_with in C++20. I'm planning to deprecate and eventually remove StringRef::{starts,ends}with.	2023-12-16 10:14:44 -08:00
Jessica Del	32f9983c06	[AMDGPU] - Add address space for strided buffers (#74471 ) This is an experimental address space for strided buffers. These buffers can have structs as elements and a stride > 1. These pointers allow the indexed access in units of stride, i.e., they point at `buffer[index * stride]`. Thus, we can use the `idxen` modifier for buffer loads. We assign address space 9 to 192-bit buffer pointers which contain a 128-bit descriptor, a 32-bit offset and a 32-bit index. Essentially, they are fat buffer pointers with an additional 32-bit index.	2023-12-15 15:49:25 +01:00
Kazu Hirata	586ecdf205	[llvm] Use StringRef::{starts,ends}_with (NFC) (#74956 ) This patch replaces uses of StringRef::{starts,ends}with with StringRef::{starts,ends}_with for consistency with std::{string,string_view}::{starts,ends}_with in C++20. I'm planning to deprecate and eventually remove StringRef::{starts,ends}with.	2023-12-11 21:01:36 -08:00
Nikita Popov	a87738f86b	[AutoUpgrade] Don't try to upgrade struct return of non-intrinsic This code should only be run for intrinsics known to LLVM (otherwise it will crash), not for everything that starts with "llvm.".	2023-12-08 17:18:20 +01:00
Nikita Popov	e309667769	[AutoUpgrade] Simplify vclz upgrade (NFC) We can use Intrinsic::getDeclaration() here, we just have to pass the correct arguments. This function accepts only the mangled types, not all argument types.	2023-12-04 16:30:00 +01:00
Nathan Sidwell	d04a4a06ab	[llvm] Adjust Autoupdater's llvm prefix detection (#74142 ) Use consume_front to swallow the 'llvm.' prefix, and 'empty' to check there's at least one character left.	2023-12-02 11:57:41 -05:00
Nathan Sidwell	91b2559a6a	[nvptx] Fix autoupdater's intrinsic matcher (#73330 ) Fix nvptx autoupdater's intrinsic matcher's typo'd names that used `_` (underbar), rather than '.' (dot), as a separator.	2023-12-01 14:52:38 -05:00
Nathan Sidwell	adc6b43ee1	[llvm][NFC] Autoupdater AMD intrinsic detection (#73331 ) Check atomic prefix before looking for atomic instructions	2023-12-01 14:50:39 -05:00
Nathan Sidwell	770dc47659	[llvm][NFC] Refactor autoupdater's 'c' intrinsics (#73333 ) With these three intrinsics it's probable faster to check the number of arguments first and then check the names. We can also handle ctlz and cttz in the same block.	2023-11-30 13:29:03 +09:00
Nathan Sidwell	fcf5ac84a6	[llvm][NFC] Autoupdater x86 intrinsic selection (#73046 ) Sort x86 intrinsics and use prefix checking.	2023-11-25 08:02:39 -05:00
Nathan Sidwell	d34ac0ee72	[llvm][NFC] Autoupdater x86 detection (#72808 ) Sort x86 intrinsics for better readability and use common prefixes to reduce number of comparisons.	2023-11-21 12:52:12 -05:00
Simon Pilgrim	939fd6c37c	[AutoUpgrade] Use StringRef::starts_with/ends_with instead of startswith/endswith. NFC. startswith/endswith wrap starts_with/ends_with and will eventually go away (to more closely match string_view)	2023-11-06 13:27:36 +00:00
Harald van Dijk	a21abc782a	[X86] Align i128 to 16 bytes in x86 datalayouts This is an attempt at rebooting https://reviews.llvm.org/D28990 I've included AutoUpgrade changes to modify the data layout to satisfy the compatible layout check. But this does mean alloca, loads, stores, etc in old IR will automatically get this new alignment. This should fix PR46320. Reviewed By: echristo, rnk, tmgross Differential Revision: https://reviews.llvm.org/D86310	2023-10-11 10:23:38 +01:00
Youngsuk Kim	e5026f0179	[llvm] Remove uses of Type::getPointerTo() (NFC) Partial progress towards removing in-tree uses of `getPointerTo()`, by employing the following options: * Drop the call entirely if the sole purpose of it is to support a no-op bitcast (remove the no-op bitcast as well). * Replace with `PointerType::get()`/`PointerType::getUnqual()` This is a NFC cleanup effort. Reviewed By: barannikov88 Differential Revision: https://reviews.llvm.org/D155232	2023-09-22 19:44:38 -04:00
Anton Korobeynikov	51d5d7bbae	Extend `retcon.once` coroutines lowering to optionally produce a normal result (#66333 ) One of the main user of these kind of coroutines is swift. There yield-once (`retcon.once`) coroutines are used to temporary "expose" pointers to internal fields of various objects creating borrow scopes. However, in some cases it might be useful also to allow these coroutines to produce a normal result, but there is no convenient way to represent this (as compared to switched-resume kind of coroutines where C++ `co_return` is transformed to a member / callback call on promise object). The extension is simple: we allow continuation function to have a non-void result and accept optional extra arguments via a special `llvm.coro.end.result` intrinsic that would essentially forward them as normal results.	2023-09-15 09:54:38 -07:00
Matt Arsenault	edecb60481	Reapply "AMDGPU: Drop and auto-upgrade llvm.amdgcn.ldexp to llvm.ldexp" This reverts commit d9333e360a7c52587ab6e4328e7493b357fb2cf3.	2023-09-13 08:38:48 +03:00
Nathan Sidwell	b045c36ab9	[llvm][NFC]Refactor AutoUpgrader case 'n'. The NVPTX intrinsics are under 'n'. Use the consume_front API, so fix that. Refactor the helper function to group matchers on the first component and check that first. Do similarly with the final set of intrinsics, which have a lot of commonality in the matching. Finally reorder the argument/return type checking wrt name checking -- the former is going to be cheaper, so do that first before checking the name.# Reviewed By: tra Differential Revision: https://reviews.llvm.org/D158445	2023-08-22 16:32:53 -04:00

1 2 3 4 5 ...

497 Commits