llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-16 18:16:35 +00:00

Author	SHA1	Message	Date
Petr Hosek	5304034f09	Revert "[CMake] Unify llvm_check_linker_flag and llvm_check_compiler_linker_flag" This reverts commit 55e65ad876e3ac0b1cb0410a5cce3554c009af65.	2023-03-28 08:28:17 +00:00
zhanglimin	cfdcdf05fe	[tsan] Derive the unmangled SP in longjmp with xor key on loongarch64 Introducing xor key to derive unmangled sp is here to follow the way that the glibc adds support for pointer mangling on loongarch in commit 1c9bc1b6e50293a1b7037a7bfbf835868a55baed. Reviewed By: SixWeining, wangleiat, xen0n Differential Revision: https://reviews.llvm.org/D146716	2023-03-28 16:22:49 +08:00
Juan Manuel MARTINEZ CAAMAÑO	488185cca3	[Clang][DebugInfo][AMDGPU] Emit zero size bitfields in the debug info to delimit bitfields in different allocation units. Consider the following sturctures when targetting: struct foo { int space[4]; char a : 8; char b : 8; char x : 8; char y : 8; }; struct bar { int space[4]; char a : 8; char b : 8; char : 0; char x : 8; char y : 8; }; Even if both structs have the same layout in memory, they are handled differenlty by the AMDGPU ABI. With the following code: // clang --target=amdgcn-amd-amdhsa -g -O1 example.c -S char use_foo(struct foo f) { return f.y; } char use_bar(struct bar b) { return b.y; } For use_foo, the 'y' field is passed in v4 ; v_ashrrev_i32_e32 v0, 24, v4 ; s_setpc_b64 s[30:31] For use_bar, the 'y' field is passed in v5 ; v_bfe_i32 v0, v5, 8, 8 ; s_setpc_b64 s[30:31] To make this distinction, we record a single 0-size bitfield for every member that is preceded by it. Reviewed By: probinson Differential Revision: https://reviews.llvm.org/D144870	2023-03-28 10:07:32 +02:00
Martin Braenne	0608541aa4	[clang][dataflow][NFC] Eliminate StmtToEnvMap interface. Instead, we turn StmtToEnvMap into a concrete class with the implementation that used to live in StmtToEnvMapImpl. The layering issue that originally required the indirection through the `StmtToEnvMap` interface no longer exists. Reviewed By: ymandel, xazax.hun, gribozavr2 Differential Revision: https://reviews.llvm.org/D146507	2023-03-28 08:05:57 +00:00
Martin Storsjö	0f4c6b120f	[lvm-windres] Try to match GNU windres regarding handling of unescaped quotes Some background context: GNU windres invokes the preprocessor in a subprocess. Some windres options are passed through to the preproocessor, e.g. -D options for predefining defines. When GNU windres passes these options onwards, it takes the options in exact the form they are received (in argv or similar) and assembles them into a single preprocessor command string which gets interpreted by a shell (IIRC via the popen() function, or similar). When LLVM invokes subprocesses, it does so via APIs that take properly split argument vectors, to avoid needing to worry about shell quoting/escaping/unescaping. But in the case of LLVM windres, we have to emulate the effect of the shell parsing done by popen(). Most of the relevant cases are already taken care of here, but this patch fixes an uncommon case encountered in https://github.com/llvm/llvm-project/issues/57334. (This case is uncommon since it doesn't do what one would want to; the quotes need to be escaped more to work as intended through the popen() shell). Differential Revision: https://reviews.llvm.org/D146848	2023-03-28 11:02:44 +03:00
Martin Storsjö	dc41f387e3	[llvm-rc] Remove transitional preprocessing fallback logic When preprocessing was integrated to llvm-rc in 2021, this was a new requirement (previously one could execute llvm-rc without a suitable preprocessing tool to be available). As a transitional helper, llvm-rc fell back on skipping preprocessing if no suitable tool was found (with a warning printed), but users could pass an llvm-rc specific option to silence the warning, if they explicitly want to run the tool without preprocessing. Now 2 years later, remove the transitional helper - error out if preprocessing failed. The option for disabling preprocessing remains. Differential Revision: https://reviews.llvm.org/D146797	2023-03-28 11:02:43 +03:00
Martin Storsjö	014e5c8d39	[llvm-rc] Fix the reference to the option for disabling preprocessing in a message This was the original option name from the first iteration of the patch that added the feature, but during review, a different name was suggested and preferred - but the reference in the helpful message was missed. Differential Revision: https://reviews.llvm.org/D146796	2023-03-28 11:02:43 +03:00
Martin Storsjö	282744a9ce	[llvm-rc] Look for "clang-<major>" when locating a suitable preprocessor In some cases, there's no adjacent executable named "clang" or "clang-cl", but one name "clang-<major>". This logic doesn't cover every possible deployment setup of course, but should cover more fairly common/reasonable cases. See `caaae171ac (commitcomment-105808524)` for discussion about a case where this would have been helpful. Differential Revision: https://reviews.llvm.org/D146794	2023-03-28 11:02:42 +03:00
Martin Storsjö	d2fa6b694c	[llvm-rc] Respect the executable specified in the --preprocessor command The arguments passed in this option were passed onto the child process, but we still blindly used the clang binary that we had found to sys::ExecuteAndWait as the intended executable to run. If the user hasn't specified any custom --preprocessor command, Args[0] is equal to the variable Clang. This doesn't affect any tests, since the tests only print the arguments it would try to execute (but not the first parameter to sys::ExecuteAndWait), but there's no testes for executing it (and validating that it did execute the right thing). Differential Revision: https://reviews.llvm.org/D146793	2023-03-28 11:02:41 +03:00
Nicolas Vasilache	7cf203e739	[mlir][Linalg][Transform] Drop spurious assertion in packGreedilyOp `transform.pack_greedily` supports skipping dimensions in which case we may well end up with e.g. a matvec innermost. We should not spuriously crash in such cases.	2023-03-28 00:56:44 -07:00
Adrian Kuegel	cdeaeeeb64	[mlir] Apply ClangTidy readability fix (NFC)	2023-03-28 09:47:44 +02:00
Bjorn Pettersson	ba8facfff0	[SimpleLoopUnswitch] Fix SCEV invalidation for unswitchTrivialSwitch When doing a trivial unswitch of a switch statement the code need to "invalidate SCEVs for the outermost loop reached by any of the exits", as indicated by code comments. Depending on if we find such an outermost loop or not we can limit the invalidation to some sub-loops or the full loop-nest. As shown in the added test case there seem to have been some bugs in the code that was finding the "outermost loop", so we could end up invalidating too few loops. Seems like commit 1bf8ae17f5e2714c8c87978 introduced the bug by moving the code that invalidates the loops above some of the code that computed 'OuterL'. This patch fixes that by also moving that computation of 'OuterL' so that we compute 'OuterL' properly before we use it for the SCEV invalidation. Differential Revision: https://reviews.llvm.org/D146963	2023-03-28 09:41:52 +02:00
pvanhout	d892521076	[AMDGPU] Break-up large PHIs for DAGISel DAGISel uses CopyToReg/CopyFromReg to lower PHI nodes. With large PHIs, this can result in poor codegen. This is because it introduces a need to have a build_vector before copying the PHI value, and that build_vector may have many undef elements. This can cause very high register pressure and abnormal stack usage in some cases. This scalarization/phi "break-up" can be easily tuned/disabled through CL options in case it's not beneficial for some users. It's also only enabled for DAGIsel and GlobalISel handles PHIs much better (as it works on the whole function). This can both scalarize (break a vector into its elements) and simplify (break a vector into smaller, more manageable subvectors) PHIs. Fixes SWDEV-321581 Reviewed By: kzhuravl Differential Revision: https://reviews.llvm.org/D143731	2023-03-28 09:38:47 +02:00
Petr Hosek	7765e5d9a1	[runtimes][CMake] Drop the check to see if linker works This isn't needed anymore. Differential Revision: https://reviews.llvm.org/D144440	2023-03-28 07:37:47 +00:00
pvanhout	6b971325e9	[AMDGPU] Fold more AGPR copies/PHIs in SIFoldOperands Generalize `tryFoldLCSSAPhi` into `tryFoldPhiAGPR` which works on any kind of PHI node (not just LCSSA ones) and attempts to create AGPR Phis more aggressively. Also adds a GFX908-only "cleanup" function `tryOptimizeAGPRPhis` which tries to minimize AGPR to AGPR copies on GFX908, which doesn't have a ACCVGPR MOV instruction (so AGPR-AGPR copies become 2 or 3 instructions as they need a VGPR temp). The reason why this is needed is because D143731 + the new `tryFoldPhiAGPR` may create a lot more PHIs (one 32xfloat PHI becomes 32 float phis), and if each PHI hits the same AGPR (like in `test_mfma_loop_agpr_init`) they will be lowered to 32 copies from the same AGPR, which will each become 2-3 instructions. Creating a VGPR cache in this case prevents all those copies from being generated (we have AGPR-VGPR copies instead which are trivial). This is a prepation patch intended to prevent regressions in D143731 when AGPRs are involved. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D144099	2023-03-28 09:33:12 +02:00
Max Kazantsev	a7dcf39f09	[Test] Add two tests showing unprofitable case of Guard Widening Guard Widening is ignorant about blocks frequency. As result, it may end up widening conditions from cold/effectively dead code into some much hotter place, harming average performance.	2023-03-28 14:30:42 +07:00
Florian Hahn	417fe52e6f	Revert "[SLP] Check with target before vectorizing GEP Indices." This reverts commit 1387a13e1d0bac94457626ef3e7427c84caf6e65. This introduced performance regressions on AArch64, when the cost of a vector GEP + extracts is offset by the benefits of vectorizing the rest of the tree. The test in llvm/test/Transforms/SLPVectorizer/AArch64/vector-getelementptr.ll illustrates the issue. It was extracted from code that regressed a SPEC benchmark by 15%.	2023-03-28 08:06:53 +01:00
Chris Cotter	bd7628461b	[clang-tidy] Ignore unevaluated exprs in rvalue-reference-param-not-moved Ignore unevaluated expressions in rvalue-reference-param-not-moved check since they are not actual uses of a move(). Reviewed By: PiotrZSL Differential Revision: https://reviews.llvm.org/D146929	2023-03-28 07:05:12 +00:00
Jakub Chlanda	ae3c981aa4	[NVPTX] Enforce half type support is present for builtins Differential Revision: https://reviews.llvm.org/D146715	2023-03-28 08:48:10 +02:00
Nicolas Vasilache	15f52c1502	[mlir][Linalg][Transform] Add support to let `transform.structured.pack_greedily` pad to the next multiple of a static constant This increase the flexibility of the transformation to allow mixed packing / padding specifications. Differential Revision: https://reviews.llvm.org/D146969	2023-03-27 23:37:13 -07:00
Tom Stellard	64c30dc9a2	workflows/release-tasks: Fix missing suffix on doxygen tarballs Reviewed By: thieta Differential Revision: https://reviews.llvm.org/D145997	2023-03-27 23:28:18 -07:00
Tom Stellard	c52e947f9c	workflows/release-tasks: Upload release notes as an artifact This make sure the docs are always available and can be manually uploaded if a later step fails. Reviewed By: thieta Differential Revision: https://reviews.llvm.org/D145996	2023-03-27 23:17:14 -07:00
Petr Hosek	55e65ad876	[CMake] Unify llvm_check_linker_flag and llvm_check_compiler_linker_flag These will be replaced by CMake's check_linker_flag once we update the minimum CMake version 3.20. Differential Revision: https://reviews.llvm.org/D145716	2023-03-28 06:07:35 +00:00
Jonas Devlieghere	568be31c9e	[dsymutil] Initialize the debug map before loading the main binary Fix a crash when a warning is emitted while loading the symbols from the main binary. The warning helper assumes that the resulting debug map is initialized, but this happened after loading the main binary. Since there's no dependency between the two the initialization can be moved up. rdar://107298776	2023-03-27 22:34:42 -07:00
Kazu Hirata	e844638946	[llvm] Use isIntOrFPConstant (NFC)	2023-03-27 22:32:23 -07:00
Johannes Doerfert	4d3f79f2ad	[OpenMP] Resolve const cast issue introduced in D123446	2023-03-27 22:13:38 -07:00
Johannes Doerfert	94d14536a9	[OpenMP][FIX] More AAExecutionDomain fixes We missed certain updates, mostly to call site information, and dependent AAs did not get recomputed. We also did not properly distinguish and propagate incoming and outgoing information of call sites. The runtime tests passes now, I'll add a proper test for AAExecutionDomain soon that covers all the cases and ensures we haven't forgotten more updates. To help unblock some apps, I'll put the fix first.	2023-03-27 21:36:21 -07:00
Johannes Doerfert	3a7cb3d45a	[OpenMP] Adjust generic state machine simplification CB This callback caused us to potentially miss out on call edges if we were expecting a custom state machine since the custom state machine was not created but the workers also did not enter the generic one. I have not observed an issue and don't know how to create a test for sure, but it is saver to err on the conservative side for now.	2023-03-27 21:30:23 -07:00
Johannes Doerfert	7ccf4d1ad7	[Attributor][FIX] Account for blocks w/o predecessors	2023-03-27 21:30:23 -07:00
Johannes Doerfert	7f7e1749c5	[OpenMP] Be smarter about the insertion point for deduplication We can use dominance and avoid the special handling of kernels and prevent inserting code before allocas accidentally (as happend in the runtime test).	2023-03-27 21:30:23 -07:00
Johannes Doerfert	5244617e3a	[OpenMP][NFC] Delete dead code This code may have served a purpose at some point but it has been dead for a long while. `FromMapperBase` was always `nullptr` which is `false` which makes the rest of the code dead. Since this has not affected tests, I delete it for now.	2023-03-27 21:30:23 -07:00
Johannes Doerfert	747af24155	[OpenMP] Allow more tests to run on AMDGPU This basically works around the printf issue to increase test coverage. Differential Revision: https://reviews.llvm.org/D146838	2023-03-27 21:30:22 -07:00
Lang Hames	151b58d802	[mlir] Fix unit tests after LLVM commit 8b1771bd9f3.	2023-03-27 19:38:21 -07:00
Lang Hames	557a0ea8af	[mlir] Update JitRunner, ExecutionEngine after LLVM commit 8b1771bd9f3. LLVM commit 8b1771bd9f3 replaced JITEvaluatedSymbol with ExecutorSymbolDef.	2023-03-27 19:18:04 -07:00
AmosLewis	4bdc9d9b2d	[mlir][tosa] Add TOSA f64 type support for const op [mlir][tosa] Add TOSA f64 type support for const op Reviewed By: eric-k256, jpienaar Differential Revision: https://reviews.llvm.org/D145336	2023-03-27 18:49:44 -07:00
Yeting Kuo	0676c6d91f	[RISCV] Support vector type strict_fma. Like D145900, the patch also supports fixed vector strict_fma nodes in RISC-V by customized lowering them to riscv_strict_vfmadd_vl nodes. riscv_strict_vfmadd_vl is created to avoid some riscv_vfmadd_vl optimizations happening to original strict_fma nodes. The patch also adds combine patterns for riscv_strict_fmadd_vl nodes with negation operands. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D146939	2023-03-28 09:01:46 +08:00
Lang Hames	9382bbad3d	[ORC] Add missing header. Accidentally left out of 8b1771bd9f3.	2023-03-27 17:43:58 -07:00
Lang Hames	8b1771bd9f	[ORC] Move most ORC APIs to ExecutorAddr, introduce ExecutorSymbolDef. ExecutorAddr was introduced in b8e5f918166 as an eventual replacement for JITTargetAddress. ExecutorSymbolDef is introduced in this patch as a replacement for JITEvaluatedSymbol: ExecutorSymbolDef is an (ExecutorAddr, JITSymbolFlags) pair, where JITEvaluatedSymbol was a (JITTargetAddress, JITSymbolFlags) pair. A number of APIs had already migrated from JITTargetAddress to ExecutorAddr, but many of ORC's internals were still using the older type. This patch aims to address that. Some public APIs are affected as well. If you need to migrate your APIs you can use the following operations: * ExecutorAddr::toPtr replaces jitTargetAddressToPointer and jitTargetAddressToFunction. * ExecutorAddr::fromPtr replace pointerToJITTargetAddress. * ExecutorAddr(JITTargetAddress) creates an ExecutorAddr value from a JITTargetAddress. * ExecutorAddr::getValue() creates a JITTargetAddress value from an ExecutorAddr. JITTargetAddress and JITEvaluatedSymbol will remain in JITSymbol.h for now, but the aim will be to eventually deprecate and remove these types (probably when MCJIT and RuntimeDyld are deprecated).	2023-03-27 17:37:58 -07:00
Peter Klausler	41a964cff0	[flang] Settle ambiguity between C795 and C721 C721 says that a type parameter value of '*' is permitted in the type-spec for a named constant; C795 says that such type parameters are allowed in type-specs only for a few kinds of things, not including named constants. The interpretation seems to depend on context, with C721 applying to intrinsic types (i.e., character) and C795 applying only to derived types. Differential Revision: https://reviews.llvm.org/D146586	2023-03-27 17:37:30 -07:00
Craig Topper	7b0c41841e	[RISCV] Move compressible registers to the beginning of the FP allocation order. We don't have very many compressible FP instructions, just load and store. These instruction require the FP register to be f8-f15. This patch changes the FP allocation order to prioritize f10-f15 first. These are also the FP argument registers. So I allocated them in reverse order starting at f15 to avoid taking the first argument registers. This appears to match gcc allocation order. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D146488	2023-03-27 17:29:28 -07:00
Peter Klausler	b0f02cee2b	[flang] Catch impure defined assignments in DO CONCURRENT The semantic checking of DO CONCURRENT bodies looks only at the parse tree, not the typed expressions produced from it, so it misses calls to defined assignment subroutines that arise from assignment statements that resolve via generic interfaces into subroutine calls. Extend the checking to peek into the typed assignment operations left on the parse tree by semantics. Differential Revision: https://reviews.llvm.org/D146585	2023-03-27 17:25:26 -07:00
Vadim Paretsky	30ce6fbfaa	[OpenMP] Fix an OpenMP Windows build problem When building OpenMP as part of LLVM, CMAKE was generating incorrect location references for OpenMP build's first step's artifacts being used in regenerating its Windows import library in the second step. The fix is to feed a dummy non-buildable, rather than buildable, source to CMAKE to satisfy its source requirements removing the need to reference the first step's artifacts in the second step altogether. Differential Revision:https://reviews.llvm.org/D146894	2023-03-27 17:20:54 -07:00
Peter Klausler	1b56f273b2	[flang] Detect image control statements in non-construct IF statements The utility routine in semantics that determines whether an executable construct constitutes an image control statement was not examining the single action statement controlled by a non-construct IF statement, e.g. 'IF(P) STOP'. Differential Revision: https://reviews.llvm.org/D146584	2023-03-27 17:13:20 -07:00
Usman Akinyemi	de7639ddb0	Added instruction to join the llvm discourse and discord group. Added instruction and link to join the llvm discord and discourse group in the CONTRIBUTING.md files Reviewed By: keith Differential Revision: https://reviews.llvm.org/D146877	2023-03-27 17:02:07 -07:00
Peter Klausler	bb6faec181	[flang] Tune handling of LEN type parameter discrepancies on ALLOCATE Presently, semantics doesn't check for discrepancies between known constant corresponding LEN type parameters between the declared type of an allocatable/pointer and either the type-spec or the SOURCE=/MOLD= on an ALLOCATE statement. This allows discrepancies between character lengths to go unchecked. Some compilers accept mismatched character lengths on SOURCE=/MOLD= and the allocate object, and that's useful and unambiguous feature that already works in f18 via truncation or padding. A portability warning should issue, however. But for mismatched character lengths between an allocate object and an explicit type-spec, and for any mismatch between derived type LEN type parameters, an error is appropriate. Differential Revision: https://reviews.llvm.org/D146583	2023-03-27 17:01:41 -07:00
Peiming Liu	36c8a9a983	[mlir][sparse] rephrase the documentation for sparse compiler create-sparse-deallocs option. To address post-submit comments in https://reviews.llvm.org/D147010. Reviewed By: wrengr Differential Revision: https://reviews.llvm.org/D147014	2023-03-27 23:59:57 +00:00
Jonas Devlieghere	4d683f7fa7	[dsymutil] Add the ability to generate universal binaries with a fat64 header Add the ability to generate universal binaries with a fat64 header. rdar://107223939 Differential revision: https://reviews.llvm.org/D146879	2023-03-27 16:22:16 -07:00
Peter Klausler	1fa9ef620b	[flang] Consolidate and enhance pointer assignment checks Consolidate aspects of pointer assignment & structure constructor pointer component checking from Semantics/assignment.cpp and /expression.cpp into /pointer-assignment.cpp, and add a warning about data targets that are not definable objects but not hard errors. Specifically, a structure component pointer component data target is not allowed to be a USE-associated object in a pure context by a numbered constraint, but the right-hand side data target of a pointer assignment statement has no such constraint, and that's the new warning. Differential Revision: https://reviews.llvm.org/D146581	2023-03-27 16:19:54 -07:00
Peiming Liu	c44d307c55	[mlir][sparse] add create-sparse-deallocs options to match the create-deallocs in BufferizationOption. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D147010	2023-03-27 23:18:32 +00:00
Peter Klausler	1eb9948f02	[flang] Catch more bad DATA statement objects The data statement variable checker is missing some cases, like expressions that are not variables. Run the checker first to enjoy its very specific error messages, but when it finds no problems, still apply a general check that an expression is a "variable" and also not a constant expression at the top level as a backstop. Differential Revision: https://reviews.llvm.org/D146580	2023-03-27 16:10:03 -07:00

1 2 3 4 5 ...

455989 Commits