llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-19 10:26:42 +00:00

Author	SHA1	Message	Date
Mark de Wever	566916131e	[libc++][CI] Use latest Docker image. (#130497 ) This image was updated in #130433.	2025-03-19 18:16:46 +01:00
Alexander Shaposhnikov	297f0b3f4c	[CudaSPIRV] Allow using integral non-type template parameters as attribute args (#131546 ) Allow using integral non-type template parameters as attribute arguments of reqd_work_group_size and work_group_size_hint. Test plan: ninja check-all	2025-03-19 10:11:18 -07:00
Kerry McLaughlin	d09ecb07c2	[AArch64][CodeGen] Add +lsfe atomicrmw tests where the result is unused. (#132022 ) Following the discussion on #131174, update generate-tests.py script to emit atomicrmw tests where the result is unused and add a note to explain why these do use ST[F]ADD.	2025-03-19 17:10:02 +00:00
Sarah Spall	dd17c649ab	[HLSL] Implement min and max overloads using templates (#131666 ) Replace min and max overload implementation using macros with one using templates. Enable overloads of the forms: vector<T,N> min/max(vector<T,N> p0, T p1) vector<T,N> min/max(T p0, vector<T,N> p1) Add new tests. Closes #131170	2025-03-19 09:57:46 -07:00
Reid Kleckner	5475834737	[sancov] Use comdats when one already exists (#131929 ) This code avoids adding comdat groups to interposable linkage types (weak, linkonce (non-ODR)) to avoid changing semantics, since comdat elimination happens before weak/strong prevailaing symbol resolution. However, if the function is already in a comdat, we can add to the group without changing the semantics of the linked program. Fixes an issue uncovered in PR #126240	2025-03-19 09:47:32 -07:00
Andy Kaylor	39ce99589b	[CIR] Upstream cir-canonicalize pass (#131891 ) This change introduces the cir-canonicalize pass. This is a simple cir-to-cir transformation that eliminates empty scopes and redundant branches. It will be expanded in future changes to simplify other redundant instruction sequences. MLIR verification and mlir-specific command-line option handling is also introduced here.	2025-03-19 09:42:03 -07:00
Krzysztof Parzyszek	cd26dd5595	[flang][OpenMP] Use OmpDirectiveSpecification in simple directives (#131162 ) The `OmpDirectiveSpecification` contains directive name, the list of arguments, and the list of clauses. It was introduced to store the directive specification in METADIRECTIVE, and could be reused everywhere a directive representation is needed. In the long term this would unify the handling of common directive properties, as well as creating actual constructs from METADIRECTIVE by linking the contained directive specification with any associated user code.	2025-03-19 11:34:40 -05:00
Peng Liu	b350bc2c0b	[libc++] Verify forward_list self-merging is a no-op (#129985 ) https://wg21.link/LWG3088 requires that `forward_list::merge()` is a no-op when passed `*this`, which aligns with the behavior of `list::merge`. Although libc++'s implementation of `forward_list::merge()` already meets this requirement, there were no tests to verify this behavior. This patch adds the necessary tests to ensure that self-merging remains a no-op and prevents any future regressions on this. Closes #104942.	2025-03-19 12:28:11 -04:00
Kristof Beyls	9a078a372e	2024 Security Group Transparency Report (#132011 ) This adds the Security Response Group's transparency report for 2024.	2025-03-19 16:26:41 +00:00
Peng Liu	617646ab8d	[libc++] Use __alloc_traits in <deque> whenever it is available for consistency (#126595 ) When an allocator-aware container already defines a member type alias `__alloc_traits` for `std::allocator_traits<allocator_type>`, we should consistently use `__alloc_traits`. Mixing the usage of both `__alloc_traits` and `std::allocator_traits` can lead to inconsistencies and confusion.	2025-03-19 12:23:35 -04:00
LLVM GN Syncbot	99997d717b	[gn build] Port 7af0bfe62fff	2025-03-19 16:17:16 +00:00
John Harrison	7af0bfe62f	[lldb-dap] Ensure logging statements are written as a single chunk. (#131916 ) I noticed this while debugging some unit tests that the logs occasionally would intersperse two log statements. Previously, it was possible for a log statement to have two messages interspersed since the timestamp and log statement were two different writes to the log stream. I created a Log helper to ensure we have a lock while attempting to write to the logs.	2025-03-19 09:16:53 -07:00
James Y Knight	25613b32ab	NFC: fix whitespace lint in bazel file.	2025-03-19 12:16:09 -04:00
Peng Liu	70480fcff6	[libc++] Validate vector<bool> copy/move-assignment operators in realistic scenarios (#119817 ) The existing tests for `vector<bool>` copy- and move-assignment operators are limited to 3 bits only, which are inadequate to cover realistic scenarios. Most `vector<bool>` operations have code paths that are executed only when multiple storage words are involved, with each storage word typically comprising 64 bits on a 64-bit platform. Furthermore, the existing tests fail to cover all combinations `POCCA`/`POCMA`, along with different allocator equality and/or reallocation scenarios, leaving some critical code paths untested. This patch enhances the test coverage by introducing new tests covering up to 5 storage words, ensuring that partial words in the front or tail, and whole words in the middle are all properly tested. Moreover, these new tests ensure that the copy- and move-assignment operators are tested under all combinations of `POCCA`/`POCMA` and various allocator equality scenarios, both with or without reallocations.	2025-03-19 12:08:20 -04:00
Peng Liu	029cb8aff1	[libc++] Fix copy_backward for vector<bool> with small storage types (#131560 ) This patch fixes an issue in libc++ where `std::copy_backward` and `ranges::copy_backward` incorrectly copy `std::vector<bool>` with small storage types (e.g., `uint8_t`, `uint16_t`). The problem arises from flawed bit mask computations involving integral promotions and sign-bit extension, leading to unintended zeroing of bits. This patch corrects the bitwise operations to ensure correct bit-level copying. Fixes #131718.	2025-03-19 11:58:02 -04:00
TatWai Chong	1456eabca8	[mlir][tosa] Finalize profile-based validation for TOSA v1.0 (#131208 ) - When the operand type of an operation changes to a profile-dependent type, the compliance metadata must be updated. Update compliance check for the following: - CONV2D, CONV3D, DEPTHWISE_CONV2D, and TRANSPOSE_CONV2D, as zero points have changed to variable inputs. - PAD, because pad_const has been changed to a variable input. - GATHER and SCATTER, as indices has changed to index_t. - Add an int16 extension check for CONCAT. - Add a compliance check for COND_IF, WHILE_LOOP, VARIABLE, VARIABLE_READ, and VARIABLE_WRITE. - Correct the profile requirements for IDENTITY, TABLE, MATMUL and LOGICAL-like operations. - Remove unnecessary checks for non-v1.0 operations. - Add condition requirements (anyOf and allOf) to the type mode of metadata for modes that have multiple profile/extension considerations.	2025-03-19 08:57:22 -07:00
Peng Liu	a5b3d3a03f	[libc++] Fix {std, ranges}::copy for vector<bool> with small storage types (#131545 ) The current implementation of `{std, ranges}::copy` fails to copy `vector<bool>` correctly when the underlying storage type (`__storage_type`) is smaller than `int`, such as `unsigned char`, `unsigned short`, `uint8_t` and `uint16_t`. The root cause is that the unsigned small storage type undergoes integer promotion to (signed) `int`, which is then left and right shifted, leading to UB (before C++20) and sign-bit extension (since C++20) respectively. As a result, the underlying bit mask evaluations become incorrect, causing erroneous copying behavior. This patch resolves the issue by correcting the internal bitwise operations, ensuring that `{std, ranges}::copy` operates correctly for `vector<bool>` with any custom (unsigned) storage types. Fixes #131692.	2025-03-19 11:55:51 -04:00
dong-miao	480202f0d1	[RISCV] Add Zilsd and Zclsd Extensions (#131094 ) This commit adds the Load/Store pair instructions (Zilsd) and Compressed Load/Store pair instructions (Zclsd). [Specification link](https://github.com/riscv/riscv-isa-manual/blob/main/src/zilsd.adoc).	2025-03-19 08:53:41 -07:00
Peng Liu	c5195ae2d0	[libc++] Fix {std, ranges}::equal for vector<bool> with small storage types (#130394 ) The current implementation of `{std, ranges}::equal` fails to correctly compare `vector<bool>`s when the underlying storage type is smaller than `int` (e.g., `unsigned char`, `unsigned short`, `uint8_t` and `uint16_t`). See [demo](https://godbolt.org/z/j4s87s6b3)). The problem arises due to integral promotions on the intermediate bitwise operations, leading to incorrect final equality comparison results. This patch fixes the issue by ensuring that `{std, ranges}::equal` operate properly for both aligned and unaligned bits. Fixes #126369.	2025-03-19 11:51:21 -04:00
John Harrison	5720a792a9	[lldb-dap] Waiting for the test binary to exit prior to dumping logs. (#131917 ) This should ensure we have the full logs prior to dumping the logs. Additionally, printing log dumps to stderr so they are adjacent to assertion failures.	2025-03-19 08:49:52 -07:00
Sarah Spall	2c41a8e6d3	[HLSL] Fix bug in new clamp overloads (#131928 ) In some cases using the newly introduced clamp overloads, when floats were involved, clang would behave differently than DXC. To ensure the same behavior as DXC, require that for mix scalar/vector overloads the type of the scalar matches the type of the vector.	2025-03-19 08:47:56 -07:00
Joseph Huber	cb493d2bab	[OpenMP] Replace utilities with 'gpuintrin.h' definitions (#131644 ) Summary: Port more instructions. AMD version is at https://gist.github.com/jhuber6/235d7ee95f747c75f9a3cfd8eedac6aa	2025-03-19 10:47:21 -05:00
Peng Liu	e53bea5182	[libc++] Fix ambiguous call in {ranges, std}::count (#122529 ) This PR fixes an ambiguous call encountered while using the `std::ranges::count` and `std::count` algorithms with `vector<bool>` with small `size_type`s. The ambiguity arises from integral promotions during the internal bitwise arithmetic of the `count` algorithms for small integral types. This results in multiple viable candidates: `__libcpp_popcount(unsigned)`,` __libcpp_popcount(unsigned long)`, and `__libcpp_popcount(unsigned long long)`, leading to an ambiguous call error. To resolve this ambiguity, we introduce a dispatcher function, `__popcount`, which directs calls to the appropriate overloads of `__libcpp_popcount`. This closes #122528.	2025-03-19 11:36:29 -04:00
Craig Topper	d7879e524f	[ARM] Use DenseSet instead of DenseMap. NFC (#131978 ) The value in the map is set to "true" when something is added to the map. Techncally this: if (!DefRegs.contains(Reg)) will set the value in the map to false if it didn't already exist, and this is used to indicate the value wasn't in the map. This only occurs after all the "true" values have already been added to the map.	2025-03-19 08:32:09 -07:00
Joseph Huber	cc2a86ad35	[LLVM][Docs] Add language reference for `llvm.readsteadycounter` (#132040 ) Summary: This was not added in the original patch, add LLVM documentation.	2025-03-19 10:30:10 -05:00
Paul Kirth	93b7c1b4b2	[clang-doc] Add testing without --repository for HTML documentation (#131894 ) https://github.com/llvm/llvm-project/issues/131697 reported a crash when --repository was unused, so make sure we test the project in both configurations.	2025-03-19 08:24:10 -07:00
David Peixotto	58027fa989	[lldb] Require wide char support in unicode test (#131951 ) The unicode test sends some unicode input to lldb through pexpect and expects the output to be echoed back in an error message. This only works correctly when editline was compiled with wide character support. This commit modifies the test to require the necessary libedit configuration.	2025-03-19 08:17:59 -07:00
jeanPerier	cd0a2a3f1b	[flang] add QSORT extension intrinsic to the runtime (#132033 ) Add support for legacy Fortran intrinsic QSORT from lib 3f. This is a thin Fortran wrapper over libc qsort.	2025-03-19 16:14:37 +01:00
Florian Hahn	11b8699572	[LV] Don't skip instrs with side-effects in reg pressure computation. (#126415 ) calculateRegisterUsage adds end points for each user of an instruction to Ends and ignores instructions not added to it, i.e. instructions with no users. This means things like stores aren't included, which in turn means values that are only used in stores are also not included for consideration. This means we underestimate the register usage in cases where the only users are things like stores. Update the code to don't skip instructions without users (i.e. not in Ends) if they have side-effects. PR: https://github.com/llvm/llvm-project/pull/126415	2025-03-19 15:13:43 +00:00
Mariusz Sikora	4f5ccf22fa	[AMDGPU] Support image_bvh8_intersect_ray instruction and intrinsic. (#130041 ) Co-authored-by: Ivan Kosarev <ivan.kosarev@amd.com>	2025-03-19 16:08:08 +01:00
James Y Knight	379a8d1a83	Fix bazel build for 8437b7f5. (#132046 ) When using parse_headers validation, rpc_server.h fails to build, since we don't expose any of the RPC deps in the Bazel build. So at least for now, just exclude it.	2025-03-19 11:05:51 -04:00
Cassandra Beckley	1f152527e0	[SPIR-V] Add support for HLSL SV_GroupIndex (#130670 ) This PR lowers the `llvm.spv.flattened.thread.id.in.group` intrinsic as a `LocalInvocationIndex` builtin variable.	2025-03-19 11:04:41 -04:00
Aaron Ballman	5497709756	Update C++ DR status page	2025-03-19 11:01:51 -04:00
Luke Lau	f536f71580	[LV] Split RecurrenceDescriptor into RecurKind + FastMathFlags in LoopUtils. NFC (#132014 ) Split off from #131300, this splits up RecurrenceDescriptor arguments so that arbitrary recurrence kinds may be used down the line.	2025-03-19 22:56:57 +08:00
Ash Dobrescu	3bba268013	[BOLT] Support computed goto and allow map addrs inside functions (#120267 ) Create entry points for addresses referenced by dynamic relocations and allow getNewFunctionOrDataAddress to map addrs inside functions. By adding addresses referenced by dynamic relocations as entry points. This patch fixes an issue where bolt fails on code using computing goto's. This also fixes a mapping issue with the bugfix from this PR: https://github.com/llvm/llvm-project/pull/117766.	2025-03-19 14:55:59 +00:00
Fabian Mora	2b8f887915	[mlir][Ptr] Add the `MemorySpaceAttrInterface` interface and dependencies. (#86870 ) This patch introduces the `MemorySpaceAttrInterface` interface. This interface is responsible for handling the semantics of `ptr` operations. For example, this interface can be used to create read-only memory spaces, making any other operation other than a load a verification error, see `TestConstMemorySpaceAttr` for a possible implementation of this concept. This patch also introduces Enum dependencies `AtomicOrdering`, and `AtomicBinOp`, both enumerations are clones of the Enums with the same name in the LLVM Dialect. Also, see: - [[RFC] `ptr` dialect & modularizing ptr ops in the LLVM dialect](https://discourse.llvm.org/t/rfc-ptr-dialect-modularizing-ptr-ops-in-the-llvm-dialect/75142) for rationale. - https://github.com/llvm/llvm-project/pull/73057 for a prototype implementation of the full change. Note: Ignore the first commit, that's being reviewed in https://github.com/llvm/llvm-project/pull/86860 .	2025-03-19 10:55:24 -04:00
Simon Pilgrim	b3d280b887	[X86] fold AND(X,SEXT(SETCC())) -> SELECT(SETCC(),X,0) to support AVX512 predicated {k}{z} masks (#131788 ) We already do this for the ANDNP(SEXT(SETCC()),X) equivalent pattern. Fixes #109272	2025-03-19 14:53:23 +00:00
Jonas Devlieghere	4322d03e4a	[lldb] Avoid having to list all the headers in headers.swig (#131934 ) When adding SBLock, I realized you always have to add a new header in two places: LLDB.h and headers.swig. I can't think of a reason the latter can't use the umbrella header and avoid the duplication.	2025-03-19 07:45:25 -07:00
Nikita Popov	38e8dff84b	[AA][BasicAA] Move more call logic to BasicAA (#131144 ) Currently, the handling for calls is split between AA and BasicAA in an awkward way. BasicAA does argument alias analysis for non-escaping objects (but without considering MemoryEffects), while AA handles the generic case using MemoryEffects. However, fundamentally, both of these are really trying to do the same thing. The new merged logic first tries to remove the OtherMR component of the memory effects, which includes accesses to escaped memory. If a function-local object does not escape, OtherMR can be set to NoModRef. Then we perform the argument scan in basically the same way as AA previously did. However, we also need to look at the operand bundles. To support that, I've adjusted getArgModRefInfo to accept operand bundle arguments.	2025-03-19 15:44:52 +01:00
Iris	67a01131a8	[libc] Replace `__builtin_popcountl` with `cpp::popcount` in `sched_getcpucount` (#131952 ) Closes #128781.	2025-03-19 10:35:50 -04:00
Connector Switch	2d09028771	[libc] Fix `size_t` and `unsigned long` mismatch on riscv32 in test. (#132028 ) Fix regression: https://github.com/llvm/llvm-project/pull/131431#issuecomment-2735029529	2025-03-19 22:30:51 +08:00
Valentin Clement (バレンタインクレメン)	20feca47c1	[flang][cuda] Allow ieee_arithmetic on the device (#131930 ) - Allow ieee_arithmetic on the device - Add ignore_tkr(d) to ieee_is_finite	2025-03-19 07:20:06 -07:00
Lei Huang	ade22fc1d9	[PowerPC] Support conversion between f16 and f128 (#130158 ) Enables conversion between f16 and f128. Expanding on pre-Power9 targets and using HW instructions on Power9. Fixes https://github.com/llvm/llvm-project/issues/92866 Commandeer of: https://github.com/llvm/llvm-project/pull/97677 --------- Co-authored-by: esmeyi <esme.yi@ibm.com>	2025-03-19 10:19:57 -04:00
Alex Bradbury	be0a3b223a	[IR] Allow llvm.experimental.memset.pattern to take any sized type as the pattern argument (#132026 ) I initially thought starting with a more narrow definition and later expanding would make more sense. But as pointed out in review for PR #129220, this restriction is generating additional unnecessary work. This patch alters the intrinsic to accept patterns of any type. Future patches will update LoopIdiomRecognize and PreISelIntrinsicLowering to take advantage of this. The verifier will complain if an unsized type is used. I've additionally taken the opportunity to remove a comment from the LangRef about some bit widths potentially not being supported by the target. I don't think this is any more true than it is for arbitrary width loads/stores which don't carry a similar warning that I can see. A verifier check ensures that only sized types are used for the pattern.	2025-03-19 14:17:42 +00:00
cor3ntin	baef6fadbf	[Clang] Increase the default expression nesting limit (#132021 ) This iterates on #104717 (which we had to revert) In a bid to increase our chances of success, we try to avoid blowing up the stack by - Using `runWithSufficientStackSpace` in ParseCompoundStatement - Reducing the size of `StmtVector` a bit - Reducing the size of `DeclsInGroup` a bit - Removing a few `ParsedAttributes` from the stacks in places where they are not strictly necessary. `ParsedAttributes` is a _huge_ object On a 64 bits system, the following stack size reductions are observed ``` ParseStatementOrDeclarationAfterAttributes: 344 -> 264 ParseStatementOrDeclaration: 520 -> 376 ParseCompoundStatementBody: 1080 -> 1016 ParseDeclaration: 264 -> 120 ``` Fixes #94728	2025-03-19 15:16:38 +01:00
Simon Pilgrim	0cb9c5045b	[CostModel][X86] check fp<->int conversion cost kinds using -cost-kind=all	2025-03-19 14:16:08 +00:00
Kazu Hirata	364e60b6e4	[X86] Avoid repeated hash lookups (NFC) (#131963 )	2025-03-19 07:15:21 -07:00
Kazu Hirata	9705010b5e	[Vectorize] Avoid repeated hash lookups (NFC) (#131962 )	2025-03-19 07:14:54 -07:00
Kazu Hirata	fc278e406e	[Scalar] Avoid repeated hash lookups (NFC) (#131961 )	2025-03-19 07:14:28 -07:00
Kazu Hirata	f97a852d69	[IPO] Avoid repeated hash lookups (NFC) (#131960 )	2025-03-19 07:14:10 -07:00

1 2 3 4 5 ...

531239 Commits