llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-19 06:56:49 +00:00

Author	SHA1	Message	Date
Nikolas Klauser	b69ddbc628	[libc++] Make variables in templates inline (#115785 ) The variables are all `constexpr`, which implies `inline`. Since they aren't `constexpr` in C++03 they're also not `inline` there. Because of that we define them out-of-line currently. Instead we can use the C++17 extension of `inline` variables, which results in the same weak definitions of the variables but without having all the boilerplate.	2024-11-13 11:57:16 +01:00
David Spickett	889b3c9487	Reland "[ci] New script to generate test reports as Buildkite Annotations (#113447 )" This reverts commit 8a1ca6cad9cd0e972c322910cdfbbe9552c6c7ca. I have fixed 2 things: * The report is now sent by stdin so we do not hit the limit on the size of command line arguments. * The report is limited to 1MB in size and if we exceed that we fall back to listing only the totals with a note telling you to check the full log.	2024-11-13 10:39:57 +00:00
Peng Liu	c7df10643b	Unify naming of internal pointer members in std::vector and std::__split_buffer (#115517 ) Related to PR #114423, this PR proposes to unify the naming of the internal pointer members in `std::vector` and `std::__split_buffer` for consistency and clarity. Both `std::vector` and `std::__split_buffer` originally used a `__compressed_pair<pointer, allocator_type>` member named `__end_cap_` to store an internal capacity pointer and an allocator. However, inconsistent naming changes have been made in both classes: - `std::vector` now uses `__cap_` and `__alloc_` for its internal pointer and allocator members. - In contrast, `std::__split_buffer` retains the name `__end_cap_` for the capacity pointer, along with `__alloc_`. This inconsistency between the names `__cap_` and `__end_cap_` has caused confusions (especially to myself when I was working on both classes). I suggest unifying these names by renaming `__end_cap_` to `__cap_` in `std::__split_buffer`.	2024-11-13 11:08:08 +01:00
Utkarsh Saxena	3e20bae827	Reapply "[clang] Introduce [[clang::lifetime_capture_by(X)]] (#115823 ) Fix compile time regression and memory leak In the previous change, we saw: - Memory leak: https://lab.llvm.org/buildbot/#/builders/169/builds/5193 - 0.5% Compile time regression [link](https://llvm-compile-time-tracker.com/compare.php?from=4a68e4cbd2423dcacada8162ab7c4bb8d7f7e2cf&to=8c4331c1abeb33eabf3cdbefa7f2b6e0540e7f4f&stat=instructions:u) For compile time regression, we make the Param->Idx `StringMap` for all functions. This `StringMap` is expensive and should not be computed when none of the params are annotated with `[[clang::lifetime_capture_by(X)]]`. For the memory leak, the small vectors used in Attribute are not destroyed because the attributes are allocated through ASTContext's allocator. We therefore need a raw array in this case.	2024-11-13 11:07:20 +01:00
Daniel Kiss	2a1586dfb5	[compiler-rt] Add cpu model init for Windows. (#111961 )	2024-11-13 11:05:40 +01:00
Kadir Cetinkaya	5845688e91	Reapply "[clang] Introduce diagnostics suppression mappings (#112517 )" This reverts commit 5f140ba54794fe6ca379362b133eb27780e363d7.	2024-11-13 10:35:22 +01:00
David Green	42da81582e	[AArch64][GlobalISel] Add a number of ptr shufflevector tests. NFC	2024-11-13 09:27:57 +00:00
David Green	d7263d6d6d	[AArch64] Use second reg class in genSubAdd2SubSub machine combine. In case the first operand is a physical register with no register class, use the second operand of the sub as the register class for the new virtual register in genSubAdd2SubSub machine combine.	2024-11-13 09:22:08 +00:00
Elvina Yakubova	133f8fa233	Reland [clang][AArch64] Add getHostCPUFeatures to query for enabled f… (#115467 ) …eatures in cpu info Relands #97749. Fixed test by adding additional checks for system linux and target == host.	2024-11-13 09:10:56 +00:00
Sylvestre Ledru	2c980310f6	Revert "[libclc] Create aliases with custom_command (#115885 )" for causing: https://github.com/llvm/llvm-project/issues/115942 This reverts commit 584d1a632f3af0daca4db02f7f3b2c7f48ab0ddf.	2024-11-13 10:05:37 +01:00
Pavel Labath	39b2979a43	[lldb] Fix source display for artificial locations (#115876 ) When retrieving the location of the function declaration, we were dropping the file component on the floor, which resulted in an amusingly confusing situation were we displayed the file containing the implementation of the function, but used the line number of the declaration. This patch fixes that. It required a small refactor Function::GetStartLineSourceLineInfo to return a SupportFile (instead of just the file spec), which in turn necessitated changes in a couple of other places as well.	2024-11-13 09:56:00 +01:00
Balazs Benics	12dcaa2e1e	[clang] Add steakhal to the Clang Static Analyzer maintainers (#114991 ) I've been contributing to the Clang Static Analyzer for a while now. I think from 2019, or something like that. I've ensured the quality of the Static Analyzer releases for the last ~4-6 releases now, with testing, fixing and backporting patches; also writing comprehensive release notes for each release. I have a strong sense of ownership of the code I contribute. I follow the issue tracker, and also try to follow and participate in RFCs on Discourse if I'm not overloaded. I also check Discord time-to-time, but I rarely see anything there. You can find the maintainer section of the LLVM DeveloperPolicy [here](https://llvm.org/docs/DeveloperPolicy.html#maintainers) to read more about the responsibilities.	2024-11-13 09:50:09 +01:00
Zhaoxin Yang	6ff41e860f	[Flang][LoongArch] Emit target features for Loongarch64. (#114735 )	2024-11-13 16:34:34 +08:00
Rakshit Patel	c63e83f495	[lit] Add --report-failures-only option for lit test reports (#115439 ) - Add option (--report-failures-only) to generate a reduced report for lit tests that only includes failing tests - This is a continuation of proposed patches by @gregbedwell here: - https://reviews.llvm.org/D143516 - https://reviews.llvm.org/D143519 --------- Co-authored-by: Greg Bedwell <greg.bedwell@sony.com> Co-authored-by: James Henderson <James.Henderson@sony.com>	2024-11-13 08:30:33 +00:00
Balázs Kéri	7a1fdbb9c0	[clang][AST] Add 'IgnoreTemplateParmDepth' to structural equivalence cache (#115518 ) Structural equivalence check uses a cache to store already found non-equivalent values. This cache can be reused for calls (ASTImporter does this). Value of "IgnoreTemplateParmDepth" can have an effect on the structural equivalence therefore it is wrong to reuse the same cache for checks with different values of 'IgnoreTemplateParmDepth'. The current change adds the 'IgnoreTemplateParmDepth' to the cache key to fix the problem.	2024-11-13 09:25:22 +01:00
LiqinWeng	5a12881514	[RISCV][Test] Add test for vp float arithmetic ops. NFC (#114516 )	2024-11-13 16:23:04 +08:00
Zhaoxin Yang	20b442a25d	[Flang][LoongArch] Add support for complex16 params/returns. (#114732 ) In LoongArch64, the passing and returning of type `complex16` is similar to that of structure type like `struct {fp128, fp128}`, meaning they are passed and returned by reference. This behavior is similar to clang, so it can implement conveniently `iso_c_binding`. Additionally, this patch fixes the failure in flang test Integration/debug-complex-1.f90: ``` llvm-project/flang/lib/Optimizer/codeGen/Target.cpp:56: not yet implemented: complex for this precision for return type	2024-11-13 16:13:37 +08:00
Sirui Mu	d56f5171af	[mlir][LLVM] Add support for invariant group related intrinsics (#115877 ) This PR adds support for the following LLVM intrinsics: - `llvm.launder.invariant.group` - `llvm.strip.invariant.group`	2024-11-13 16:11:40 +08:00
Lee Wei	91e134ad7d	[llvm] Replace `UndefValue::get` with `PoisonValue::get` in a unit test [NFC] (#115985 ) Since these `UndefValue::get` are acted as placeholders, I think it's safe to replace them with poison values. There are a lot of `UndefValue::get` in LLVM, I'll start fixing the ones in `unittests` while fixing the regression tests.	2024-11-13 08:01:53 +00:00
Pavel Labath	ae7b5af904	[lldb] Remove ConnectionFileDescriptor::child_process_inherit (#115861 ) It's never set to true. Inheritable FDs are also dangerous as they can end up processes which know nothing about them. It's better to explicitly pass a specific FD to a specific subprocess, which we already mostly can do using the ProcessLaunchInfo FileActions.	2024-11-13 08:59:42 +01:00
Petr Hosek	a4f3a10c0e	[libc] Include features.h in baremetal targets (#109444 ) This is used by other libraries like libc++.	2024-11-12 23:46:20 -08:00
LiqinWeng	9aa4f50ae4	[RISCV][TTI] Add vp.fneg intrinsic cost with functionalOP (#114378 )	2024-11-13 15:40:48 +08:00
Timm Baeder	202ad47fe1	[clang][bytecode] SourceInfo::Source might be null (#115905 ) This broke in 23fbaff9a3fd2b26418e0c2f10b701049399251f, but the old .dyn_cast<> handled null.	2024-11-13 08:35:50 +01:00
Craig Topper	fcacda899f	[RISCV] Remove constant_fold_cast_op from RISCVPostLegalizerCombiner. This is no longer tested after other recent changes. AArch64 does have this in their PostLegalizerCombiner.	2024-11-12 23:28:48 -08:00
Kazu Hirata	9571cc2b28	[ARM] Remove unused includes (NFC) (#115995 ) Identified with misc-include-cleaner.	2024-11-12 23:15:21 -08:00
Kazu Hirata	735ab61ac8	[CodeGen] Remove unused includes (NFC) (#115996 ) Identified with misc-include-cleaner.	2024-11-12 23:15:06 -08:00
Jianjian Guan	a6f8af676a	[RISCV] Improve vmsge and vmsgeu selection (#115435 ) Select vmsge(u) vs, C to vmsgt(u) vs, C-1 if C is not in the imm range and not the minimum value. Fix https://github.com/llvm/llvm-project/issues/114505.	2024-11-13 15:05:08 +08:00
Boaz Brickner	9a365bc9a0	[Clang] [NFC] Add "human" diagnostic argument format (#115835 ) This allows formatting large integers in a human friendly way. Example: "5321584" -> "5.32M". Use it where such human numbers are generated manually today.	2024-11-13 07:58:11 +01:00
Boaz Brickner	edfa75de33	[clang] [NFC] Split checkAttributesAfterMerging() to multiple functions (#115464 )	2024-11-13 07:42:50 +01:00
Luke Lau	1294ddabbc	[RISCV] Add cost model tests for vp.{s,u}{min,max}. NFC	2024-11-13 14:32:44 +08:00
Kasper Nielsen	1824e45cd7	[MLIR,Python] Support converting boolean numpy arrays to and from mlir attributes (unrevert) (#115481 ) This PR re-introduces the functionality of https://github.com/llvm/llvm-project/pull/113064, which was reverted in `0a68171b3c` due to memory lifetime issues. Notice that I was not able to re-produce the ASan results myself, so I have not been able to verify that this PR really fixes the issue. --- Currently it is unsupported to: 1. Convert a MlirAttribute with type i1 to a numpy array 2. Convert a boolean numpy array to a MlirAttribute Currently the entire Python application violently crashes with a quite poor error message https://github.com/pybind/pybind11/issues/3336 The complication handling these conversions, is that MlirAttribute represent booleans as a bit-packed i1 type, whereas numpy represents booleans as a byte array with 8 bit used per boolean. This PR proposes the following approach: 1. When converting a i1 typed MlirAttribute to a numpy array, we can not directly use the underlying raw data backing the MlirAttribute as a buffer to Python, as done for other types. Instead, a copy of the data is generated using numpy's unpackbits function, and the result is send back to Python. 2. When constructing a MlirAttribute from a numpy array, first the python data is read as a uint8_t to get it converted to the endianess used internally in mlir. Then the booleans are bitpacked using numpy's bitpack function, and the bitpacked array is saved as the MlirAttribute representation.	2024-11-13 01:23:10 -05:00
Matthias Springer	804d3c4ce1	[mlir][IR] Add `Block::isReachable` helper function (#114928 ) Add a new helper function `isReachable` to `Block`. This function traverses all successors of a block to determine if another block is reachable from the current block. This functionality has been reimplemented in multiple places in MLIR. Possibly additional copies in downstream projects. Therefore, moving it to a common place.	2024-11-13 14:58:09 +09:00
Sushant Gokhale	9991ea28fc	[CostModel][AArch64] Make extractelement, with fmul user, free whenev… (#111479 ) …er possible In case of Neon, if there exists extractelement from lane != 0 such that 1. extractelement does not necessitate a move from vector_reg -> GPR 2. extractelement result feeds into fmul 3. Other operand of fmul is a scalar or extractelement from lane 0 or lane equivalent to 0 then the extractelement can be merged with fmul in the backend and it incurs no cost. e.g. ``` define double @foo(<2 x double> %a) { %1 = extractelement <2 x double> %a, i32 0 %2 = extractelement <2 x double> %a, i32 1 %res = fmul double %1, %2 ret double %res } ``` `%2` and `%res` can be merged in the backend to generate: `fmul d0, d0, v0.d[1]` The change was tested with SPEC FP(C/C++) on Neoverse-v2. Compile time impact: None Performance impact: Observing 1.3-1.7% uplift on lbm benchmark with -flto depending upon the config.	2024-11-13 11:10:49 +05:30
Kazu Hirata	95554cbd77	[memprof] Teach extractCallsFromIR to recognize heap allocation functions (#115938 ) This patch teaches extractCallsFromIR to recognize heap allocation functions. Specifically, when we encounter a callee that is known to be a heap allocation function like "new", we set the callee GUID to 0. Note that I am planning to do the same for the caller-callee pairs extracted from the profile. That is, when I encounter a frame that does not have a callee, we assume that the frame is calling some heap allocation function with GUID 0. Technically, I'm not recognizing enough functions in this patch. TCMalloc is known to drop certain frames in the call stack immediately above new. This patch is meant to lay the groundwork, setting up GetTLI, plumbing it to extractCallsFromIR, and adjusting the unit tests. I'll address remaining issues in subsequent patches.	2024-11-12 21:37:29 -08:00
Matt Arsenault	5911fbb39d	AMDGPU: Do not fold copy to physreg from operation on frame index (#115977 )	2024-11-12 21:35:51 -08:00
Alex Bradbury	2baead09b2	[docs] Add blank line before bulletpoint list to fix HowToAddABuilder The bulletpoint list wasn't rendering properly due to a missing blank line.	2024-11-13 05:26:02 +00:00
Valentin Clement (バレンタインクレメン)	2583071fb4	[flang][cuda] Compute size of derived type arrays (#115914 )	2024-11-12 21:23:58 -08:00
Jonas Devlieghere	4714215efb	[lldb] Support true/false in ValueObject::SetValueFromCString (#115780 ) Support "true" and "false" (and "YES" and "NO" in Objective-C) in ValueObject::SetValueFromCString. Fixes #112597	2024-11-12 21:18:22 -08:00
Shilei Tian	de0fd64bed	[AMDGPU] Introduce a new generic target `gfx9-4-generic` (#115190 ) This patch introduces a new generic target, `gfx9-4-generic`. Since it doesn’t support FP8 and XF32-related instructions, the patch includes several code reorganizations to accommodate these changes.	2024-11-12 23:11:05 -05:00
Han-Kuan Chen	5a5502b9e1	[SLP] NFC. Use Value instead of template. (#115440 )	2024-11-13 11:58:19 +08:00
Justin Fargnoli	274feef7dd	Reland "[NVPTX] Emit prmt selection value in hex" (#115952 ) Initially landed in 3ed4b0b0efca7a9467ce83fc62de9413da38006d. Reverted in 375d1925dbd0c051fe2d4a86fe98ed08f4a502c5 because the [`load-store.ll`](https://github.com/llvm/llvm-project/blob/main/llvm/test/CodeGen/NVPTX/load-store.ll) test was not updated after 5e75880165553e9afb721239689a9c79ec84a108. 5e75880165553e9afb721239689a9c79ec84a108 is now updated in 7a99f2322c324972f2c5091dddd7752fa21d5a78.	2024-11-12 19:21:34 -08:00
Petr Hosek	5fa47d8c52	[libc] Support multilib with runtimes build (#115357 ) This adds minimal support for multilibs akin to libc++.	2024-11-12 19:16:27 -08:00
Sirui Mu	e887f8290d	[mlir][LLVM] Add !invariant.group metadata to llvm.load and llvm.store (#115723 ) This patch adds support for the `!invariant.group` metadata to the `llvm.load` and the `llvm.store` operation.	2024-11-13 10:54:34 +08:00
Valentin Clement (バレンタインクレメン)	37143fe27e	[flang][cuda] Make launch configuration optional for cuf kernel (#115947 )	2024-11-12 16:49:44 -08:00
Tarun Prabhu	01d233ff40	Revert "[clang][flang] Support -time in both clang and flang" Reverts llvm/llvm-project#109165 This created a buildbot failure on [Fuchsia](https://lab.llvm.org/buildbot/#/builders/11/builds/8080).	2024-11-12 17:08:02 -07:00
Sterling-Augustine	7ba864b592	[SandboxVectorizer] Register erase callback for seed collection (#115951 )	2024-11-12 16:03:27 -08:00
Matthias Springer	b0a4e958e8	[mlir][bufferization] Add support for non-unique `func.return` (#114017 ) Multiple `func.return` ops inside of a `func.func` op are now supported during bufferization. This PR extends the code base in 3 places: - When inferring function return types, `memref.cast` ops are folded away only if all `func.return` ops have matching buffer types. (E.g., we don't fold if two `return` ops have operands with different layout maps.) - The alias sets of all `func.return` ops are merged. That's because aliasing is a "may be" property. - The equivalence sets of all `func.return` ops are taken only if they match. If different `func.return` ops have different equivalence sets for their operands, the equivalence information is dropped. That's because equivalence is a "must be" property. This commit is in preparation of removing the deprecated `func-bufferize` pass. That pass can bufferize functions with multiple `return` ops.	2024-11-13 08:51:39 +09:00
Michael Jones	d6219e6599	[libc] Make fstatvfs test less flakey (#115949 )	2024-11-12 18:40:52 -05:00
Min-Yih Hsu	84e95beae9	[RISCV] Update SiFive P600's scheduling model on RVV instructions (#115243 ) The biggest change is assigning vector crypto instructions to the correct processor resource. The majority of these changes are guided by our RVV-capable llvm-exegesis.	2024-11-12 15:29:40 -08:00
Rahul Joshi	7b5e285d16	[NFC][Clang] Use range for loops in ClangDiagnosticsEmitter (#115573 ) Use range based for loops in Clang diagnostics emitter.	2024-11-12 14:39:02 -08:00

1 2 3 4 5 ...

518050 Commits