llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-29 15:06:10 +00:00

Author	SHA1	Message	Date
eopXD	c8eb535aed	[1/11][IR] Permit load/store/alloca for struct of the same scalable vector type This patch-set aims to simplify the existing RVV segment load/store intrinsics to use a type that represents a tuple of vectors instead. To achieve this, first we need to relax the current limitation for an aggregate type to be a target of load/store/alloca when the aggregate type contains homogeneous scalable vector types. Then to adjust the prolog of an LLVM function during lowering to clang. Finally we re-define the RVV segment load/store intrinsics to use the tuple types. The pull request under the RVV intrinsic specification is riscv-non-isa/rvv-intrinsic-doc#198 --- This is the 1st patch of the patch-set. This patch is originated from D98169. This patch allows aggregate type (StructType) that contains homogeneous scalable vector types to be a target of load/store/alloca. The RFC of this patch was posted in LLVM Discourse. https://discourse.llvm.org/t/rfc-ir-permit-load-store-alloca-for-struct-of-the-same-scalable-vector-type/69527 The main changes in this patch are: Extend `StructLayout::StructSize` from `uint64_t` to `TypeSize` to accommodate an expression of scalable size. Allow `StructType:isSized` to also return true for homogeneous scalable vector types. Let `Type::isScalableTy` return true when `Type` is `StructType` and contains scalable vectors Extra description is added in the LLVM Language Reference Manual on the relaxation of this patch. Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Co-Authored-by: eop Chen <eop.chen@sifive.com> Reviewed By: craig.topper, nikic Differential Revision: https://reviews.llvm.org/D146872	2023-05-19 09:39:36 -07:00
Enna1	e4e6c6510b	[IR] Adds Instruction::setNoSanitizeMetadata() This patch adds a new method setNoSanitizeMetadata() for Instruction, and use it in SanitizerMetadata and SanitizerCoverage. Reviewed By: nickdesaulniers, MaskRay Differential Revision: https://reviews.llvm.org/D150632	2023-05-19 19:18:57 +08:00
Kazu Hirata	32ab0978dc	Partially revert "Use llvm::less_second (NFC)" This reverts part of commit e0039b8d6a5bd05e70203962f448569f2d2ef1c2. This should fix the issue reported in: https://github.com/llvm/llvm-project/issues/62546	2023-05-16 14:49:32 -07:00
Arthur Eubanks	ce90dfc74b	[StructuralHash] Track global variables Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D149209	2023-05-15 16:57:18 -07:00
Guillaume Chatelet	ce9b89f8be	[NFC] Refactor GlobalVariable Ctor Reuse logic from other ctor and remove code duplication. Reviewed By: courbet Differential Revision: https://reviews.llvm.org/D150453	2023-05-15 07:30:07 +00:00
Christian Ulmann	794b58b467	[IR] Drop const in DILocation::getMergedLocation This commit removes constness from DILocation::getMergedLocation and fixes all its users accordingly. Having constness on the parameters forced the return type to be const as well, which does force usage of `const_cast` when the location needs to be used in metadata nodes. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D149942	2023-05-15 07:21:43 +00:00
Sander de Smalen	3b95b81813	[AArch64][SME2/SVE2p1] Add predicate-as-counter intrinsics for ptrue/cntp These intrinsics are used to implement: * svptrue_c8(), svptrue_c16(), etc. * svcntp_c8(svcount_t pnn, uint64_t vl), svcntp_c16(...), etc. As described in https://github.com/ARM-software/acle/pull/217 Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D150263	2023-05-12 08:43:12 +00:00
Felipe de Azevedo Piovezan	becfcdfc81	[Verifier] Allow DW_OP_LLVM_entry_value in IR A follow up patch will make the CoroSplit pass introduce such operations in the IR level when it is safe to do so. Depends on D149748 Differential Revision: https://reviews.llvm.org/D149778	2023-05-10 14:35:04 -04:00
Hongtao Yu	b7d9322b49	[FS-AFDO] Load pseudo probe profile on MIR This change enables loading pseudo-probe based profile on MIR. Different from the IR profile loader, callsites are excluded from MIR profile loading since they are not assinged a FS discriminator. Using zero as the discriminator is not accurate and would undo the distribution work done by the IR loader based on pseudo probe distribution factor. We reply on block probes only for FS profile loading. Some refactoring is done to the IR profile loader so that `getProbeWeight` can be shared by both loaders. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D148584	2023-05-10 11:29:37 -07:00
Zain Jaffal	5d3a884229	[IRGen] Change annotation metadata to support inserting tuple of strings into annotation metadata array. Annotation metadata supports adding singular annotation strings to annotation block. This patch adds the ability to insert a tuple of strings into the metadata array. The idea here is that each tuple of strings represents a piece of information that can be all related. It makes it easier to parse through related metadata information given it will be contained in one tuple. For example in remarks any pass that implements annotation remarks can have different type of remarks and pass additional information for each. The original behaviour of annotation remarks is preserved here and we can mix tuple annotations and single annotations for the same instruction. Reviewed By: paquette Differential Revision: https://reviews.llvm.org/D148328	2023-05-09 17:51:28 +03:00
Kan Wu	b8d2f7177c	[MemProf] Add hot allocation type Add "Hot" AllocationType (in addition to existing cold, notcold). Use lifetime access density as metric to identify hot allocations. Treat hot as notcold for MemProfContextDisambiguation for now before the disambiguation for "hot" is done. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D149932	2023-05-08 10:34:53 -07:00
Teresa Johnson	1768898680	[MemProf] Control availability of hot/cold operator new from LTO link Adds an LTO option to indicate that whether we are linking with an allocator that supports hot/cold operator new interfaces. If not, at the start of the LTO backends any existing memprof hot/cold attributes are removed from the IR, and we also remove memprof metadata so that post-LTO inlining doesn't add any new attributes. This is done via setting a new flag in the module summary index. It is important to communicate via the index to the LTO backends so that distributed ThinLTO handles this correctly, as they are invoked by separate clang processes and the combined index is how we communicate information from the LTO link. Specifically, for distributed ThinLTO the LTO related processes look like: ``` # Thin link: $ lld --thinlto-index-only obj1.o ... objN.o -llib ... # ThinLTO backends: $ clang -x ir obj1.o -fthinlto-index=obj1.o.thinlto.bc -c -O2 ... $ clang -x ir objN.o -fthinlto-index=objN.o.thinlto.bc -c -O2 ``` It is during the thin link (lld --thinlto-index-only) that we have visibility into linker dependences and want to be able to pass the new option via -Wl,-supports-hot-cold-new. This will be recorded in the summary indexes created for the distributed backend processes (*.thinlto.bc) and queried from there, so that we don't need to know during those individual clang backends what allocation library was linked. Since in-process ThinLTO and regular LTO also use a combined index, for consistency we query the flag out of the index in all LTO backends. Additionally, when the LTO option is disabled, exit early from the MemProfContextDisambiguation handling performed during LTO, as this is unnecessary. Depends on D149117 and D149192. Differential Revision: https://reviews.llvm.org/D149215	2023-05-08 08:02:21 -07:00
Akshay Khadse	5c7c3af1d0	Reapply [Coverity] Fix explicit null dereferences This change fixes static code analysis errors Reviewed By: skan Differential Revision: https://reviews.llvm.org/D149506	2023-05-08 21:19:40 +08:00
OCHyams	f9dba933c6	[Assignment Tracking] Skip scalable vectors in declare-to-assign pass Do not convert dbg.declares to dbg.assigns for variables backed by scalable vector allocas as this isn't yet supported. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D149959	2023-05-05 18:07:05 +01:00
Krzysztof Drewniak	f0415f2a45	Re-land "[AMDGPU] Define data layout entries for buffers"" Re-land D145441 with data layout upgrade code fixed to not break OpenMP. This reverts commit 3f2fbe92d0f40bcb46db7636db9ec3f7e7899b27. Differential Revision: https://reviews.llvm.org/D149776	2023-05-03 19:43:56 +00:00
Krzysztof Drewniak	3f2fbe92d0	Revert "[AMDGPU] Define data layout entries for buffers" This reverts commit f9c1ede2543b37fabe9f2d8f8fed5073c475d850. Differential Revision: https://reviews.llvm.org/D149758	2023-05-03 16:11:00 +00:00
Krzysztof Drewniak	f9c1ede254	[AMDGPU] Define data layout entries for buffers Per discussion at https://discourse.llvm.org/t/representing-buffer-descriptors-in-the-amdgpu-target-call-for-suggestions/68798, we define two new address spaces for AMDGCN targets. The first is address space 7, a non-integral address space (which was already in the data layout) that has 160-bit pointers (which are 256-bit aligned) and uses a 32-bit offset. These pointers combine a 128-bit buffer descriptor and a 32-bit offset, and will be usable with normal LLVM operations (load, store, GEP). However, they will be rewritten out of existence before code generation. The second of these is address space 8, the address space for "buffer resources". These will be used to represent the resource arguments to buffer instructions, and new buffer intrinsics will be defined that take them instead of <4 x i32> as resource arguments. ptr addrspace(8). These pointers are 128-bits long (with the same alignment). They must not be used as the arguments to getelementptr or otherwise used in address computations, since they can have arbitrarily complex inherent addressing semantics that can't be represented in LLVM. Even though, like their address space 7 cousins, these pointers have deterministic ptrtoint/inttoptr semantics, they are defined to be non-integral in order to prevent optimizations that rely on pointers being a [0, [addr_max]] value from applying to them. Future work includes: - Defining new buffer intrinsics that take ptr addrspace(8) resources. - A late rewrite to turn address space 7 operations into buffer intrinsics and offset computations. This commit also updates the "fallback address space" for buffer intrinsics to the buffer resource, and updates the alias analysis table. Depends on D143437 Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D145441	2023-05-03 15:25:58 +00:00
Nick Desaulniers	3e3c6f24ff	Revert "[Demangle] make llvm::demangle take std::string_view rather than const std::string&" This reverts commit c117c2c8ba4afd45a006043ec6dd858652b2ffcc. itaniumDemangle calls std::strlen with the results of std::string_view::data() which may not be NUL-terminated. This causes lld/test/wasm/why-extract.s to fail when "expensive checks" are enabled via -DLLVM_ENABLE_EXPENSIVE_CHECKS=ON. See D149675 for further discussion. Back this out until the individual demanglers are converted to use std::string_view.	2023-05-02 15:54:09 -07:00
Teresa Johnson	e0577ce367	[MemProf] Removed unused allocation type Removes the 'notcoldandcold' allocation type summary (de)serialization support added in D135714, after realizing that this will never be generated in practice. There are 2 uses of the allocation type keywords in the summary. One is for the individual profiled memprof context summaries, and each context can only be assigned a single type of hotness. The second is in the clone version information produced by the MemProfContextDisambiguation whole program step, and we only create a clone for a specific allocation type. Differential Revision: https://reviews.llvm.org/D149669	2023-05-02 13:11:35 -07:00
Nick Desaulniers	c117c2c8ba	[Demangle] make llvm::demangle take std::string_view rather than const std::string& As suggested by @erichkeane in https://reviews.llvm.org/D141451#inline-1429549 There's potential for a lot more cleanups around these APIs. This is just a start. Callers need to be more careful about sub-expressions producing strings that don't outlast the expression using ``llvm::demangle``. Add a release note. Reviewed By: MaskRay, #lld-macho Differential Revision: https://reviews.llvm.org/D149104	2023-05-02 11:20:15 -07:00
Matt Arsenault	bc37be1855	LangRef: Add "dynamic" option to "denormal-fp-math" This is stricter than the default "ieee", and should probably be the default. This patch leaves the default alone. I can change this in a future patch. There are non-reversible transforms I would like to perform which are legal under IEEE denormal handling, but illegal with flushing zero behavior. Namely, conversions between llvm.is.fpclass and fcmp with zeroes. Under "ieee" handling, it is legal to translate between llvm.is.fpclass(x, fcZero) and fcmp x, 0. Under "preserve-sign" handling, it is legal to translate between llvm.is.fpclass(x, fcSubnormal\|fcZero) and fcmp x, 0. I would like to compile and distribute some math library functions in a mode where it's callable from code with and without denormals enabled, which requires not changing the compares with denormals or zeroes. If an IEEE function transforms an llvm.is.fpclass call into an fcmp 0, it is no longer possible to call the function from code with denormals enabled, or write an optimization to move the function into a denormal flushing mode. For the original function, if x was a denormal, the class would evaluate to false. If the function compiled with denormal handling was converted to or called from a preserve-sign function, the fcmp now evaluates to true. This could also be of use for strictfp handling, where code may be changing the denormal mode. Alternative name could be "unknown". Replaces the old AMDGPU custom inlining logic with more conservative logic which tries to permit inlining for callees with dynamic handling and avoids inlining other mismatched modes.	2023-04-29 08:44:59 -04:00
OCHyams	9391177cbc	[Assignment Tracking] Check getTypeSizeInBits result for scalable vector types Without this patch, in `getAssignmentInfo` the result of `getTypeSizeInBits` is cast to `uint64_t`, which a) is an operation that will eventually be unsupported by the API according to the comments, and b) causes an assertion failure if the type is a scalable vector. Don't cast the `TypeSize` to `uint64_t` and check `isScalable` before getting the fixed size. This can result in incorrect variable locations, see llvm.org/PR62346 (but is better than crashing). Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D149137	2023-04-28 09:10:38 +01:00
Mingming Liu	b3cb950cf3	[PGO]Implement metadata combine for 'branch_weights' of direct callsites when none of the instructions folds the rest away. - Merge cases are added for simplify-cfg {sink,hoist}, based on https://gcc.godbolt.org/z/avGvc38W7 and https://gcc.godbolt.org/z/dbWbjGhaE - When one instruction folds the others in, do not update branch_weights with sum (see test/Transforms/GVN/calls-readonly.ll) Differential Revision: https://reviews.llvm.org/D148877	2023-04-27 13:04:17 -07:00
Mircea Trofin	460ea85014	[nfc][thinlto] Handle global constant importing separately This makes the logic for referenced globals reusable for import criteria that don't use thresholds - in fact, we currently didn't consider any thresholds when importing. Differential Revision: https://reviews.llvm.org/D149298	2023-04-27 12:21:50 -07:00
ManuelJBrito	d22edb9794	[IR][NFC] Change UndefMaskElem to PoisonMaskElem Following the change in shufflevector semantics, poison will be used to represent undefined elements in shufflevector masks. Differential Revision: https://reviews.llvm.org/D149256	2023-04-27 18:01:54 +01:00
ManuelJBrito	8b56da5e9f	[IR] Change shufflevector undef mask to poison With this patch an undefined mask in a shufflevector will be printed as poison. This change is done to support the new shufflevector semantics for undefined mask elements. Differential Revision: https://reviews.llvm.org/D149210	2023-04-27 14:41:10 +01:00
Arthur Eubanks	891ce5bdce	[NFC][StructuralHash] Use hash_code	2023-04-25 14:12:06 -07:00
OCHyams	65d71ee3cf	[DebugInfo] Replace UndefValue with PoisonValue in DIArgList::handleChangedOperand This helps towards the effort to remove UndefValue from LLVM. Related to https://discourse.llvm.org/t/auto-undef-debug-uses-of-a-deleted-value Reviewed By: nlopes Differential Revision: https://reviews.llvm.org/D140991	2023-04-25 16:18:41 +01:00
OCHyams	72776850ed	Revert "[DebugInfo] Print empty MDTuples wrapped in MetadataAsValue inline" This reverts commit 1e6fe677f8aa98518e05218affa16e468819f5ed (D140900). Buildbot: https://lab.llvm.org/buildbot/#/builders/196/builds/29937	2023-04-25 14:37:25 +01:00
OCHyams	1e6fe677f8	[DebugInfo] Print empty MDTuples wrapped in MetadataAsValue inline This improves the readability of debugging intrinsics. Instead of: call void @llvm.dbg.value(metadata !2, ...) !2 = !{} We will see: call void @llvm.dbg.value(metadata !{}, ...) !2 = !{} Note that we still get a numbered metadata entry for the node even if it's not used elsewhere. This is to avoid adding more context to the print functions. This is already legal IR - LLVM can parse and understand it - so there is no need to update the parser. The next patches in this stack will make such empty metadata operands more common and semantically important. Related to https://discourse.llvm.org/t/auto-undef-debug-uses-of-a-deleted-value Reviewed By: StephenTozer Differential Revision: https://reviews.llvm.org/D140900	2023-04-25 14:13:47 +01:00
NAKAMURA Takumi	c49f850d55	Migrate `IIT_Info` into `Intrinsics.td` - Define `IIT_Info` in `Intrinsics.td` - Implement `EmitIITInfo` in `IntrinsicEmitter.cpp` - Use generated `IIT_Info` in `Function.cpp` Depends on D145873 and D146179 Differential Revision: https://reviews.llvm.org/D146914	2023-04-25 08:53:18 +09:00
Simon Pilgrim	0b7f53efec	[VP] IR expansion for fabs/fsqrt/fma/fmadd Add basic handling for VP ops that can expand to FP intrinsics Fixes #60464 Differential Revision: https://reviews.llvm.org/D149052	2023-04-24 15:20:07 +01:00
Tom Weaver	b63c08c773	Revert "[Coverity] Fix explicit null dereferences" This reverts commit 22b23a5213b57ce1834f5b50fbbf8a50297efc8a. This commit caused the following two build bots to start failing: https://lab.llvm.org/buildbot/#/builders/216/builds/20322 https://lab.llvm.org/buildbot/#/builders/123/builds/18511	2023-04-24 11:14:10 +01:00
Akshay Khadse	22b23a5213	[Coverity] Fix explicit null dereferences This change fixes static code analysis errors Reviewed By: skan Differential Revision: https://reviews.llvm.org/D148912	2023-04-23 12:07:11 +08:00
OCHyams	ea60ffc6d1	[NFC] Return unique dbg intrinsics from findDbgValues and findDbgUsers The out-param vector from findDbgValues and findDbgUsers should not include duplicates, which is possible if the debug intrinsic uses the value multiple times. This filter is already in place for multiple uses in a `DIArgLists`; extend it to cover dbg.assigns too because a Value may be used in both the address and value components. Additionally, refactor the duplicated functionality between findDbgValues and FindDbgUsers into a new function findDbgIntrinsics. Reviewed By: jmorse, StephenTozer Differential Revision: https://reviews.llvm.org/D148788	2023-04-20 14:18:46 +01:00
Nikita Popov	53500e333d	Reapply [SimplifyCFG][LICM] Preserve nonnull, range and align metadata when speculating This exposed another miscompile in GVN, which was fixed by 20e9b31f88149a1d5ef78c0be50051e345098e41. ----- After D141386, violation of nonnull, range and align metadata results in poison rather than immediate undefined behavior, which means that these are now safe to retain when speculating. We only need to remove UB-implying metadata like noundef. This is done by adding a dropUBImplyingAttrsAndMetadata() helper, which lists the metadata which is known safe to retain on speculation. Differential Revision: https://reviews.llvm.org/D146629	2023-04-20 14:17:15 +02:00
Jay Foad	84f36b82f4	[IR] Remove dead code for unsupported ConstantExpr binops	2023-04-20 11:18:00 +01:00
OCHyams	04452e83de	Remove unused ValueTracking.h include from DebugInfo.cpp Buildbot: https://buildkite.com/llvm-project/upstream-bazel/builds/ 59967#01879985-8d44-4041-9cd0-a1e41371208e See https://reviews.llvm.org/D148536	2023-04-19 15:16:41 +01:00
OCHyams	571eaead17	Reapply "[Assignment Tracking] Fix fragment error for some DSE-shortened stores" This reverts commit 6db6ab4815a44bfcaabfcdd84a0ff458394f6f52 which reverts D148536. Build issues addressed in D148698.	2023-04-19 13:36:47 +01:00
OCHyams	ca10e73b53	[NFC] Rename isPointerOffset to getPointerOffsetFrom and move to Value.h Linking LLVMCore failed when building D148536 with shared libs enabled: https://lab.llvm.org/buildbot/#/builders/121/builds/29766 Make isPointerOffset a Value method and rename it to getPointerOffsetFrom. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D148698	2023-04-19 12:22:58 +01:00
OCHyams	6db6ab4815	Revert "[Assignment Tracking] Fix fragment error for some DSE-shortened stores" This reverts commit fca3e8e024f0015604d21e6f76f3e199345679c5. Buildbot: https://lab.llvm.org/buildbot/#/builders/121/builds/29766	2023-04-19 10:03:33 +01:00
OCHyams	fca3e8e024	[Assignment Tracking] Fix fragment error for some DSE-shortened stores `shortenAssignment` inserts dbg.assigns with fragments describing the dead part of a shortened store after each dbg.assign linked to the store. Without this patch it doesn't take into account that the dead part of a shortened store may be outside the bounds of a variable of a linked dbg.assign. It also doesn't correctly account for a non-zero offset in the address modifying `DIExpression` of the dbg.assign (which is possible for fragments now even though whole variables currently cannot have a non-zero offset in their alloca). Fix this by moving the dead slice into variable-space and performing an intersect of that adjusted slice with the existing fragment. This fixes a verifier error reported when building fuchsia with assignment tracking enabled: https://ci.chromium.org/ui/p/fuchsia/builders/ci/ clang_toolchain.ci.core.x64-release/b8784000953022145169/overview Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D148536	2023-04-19 09:32:09 +01:00
Krasimir Georgiev	bf7f6b4436	Revert "Reapply [SimplifyCFG][LICM] Preserve nonnull, range and align metadata when speculating" This reverts commit 6f7e5c0f1ac6cc3349a2e1479ac4208465b272c6. Seems to expose a miscompile in rust, possibly exposing a bug in LLVM somewhere. Investigation thread over at: https://rust-lang.zulipchat.com/#narrow/stream/187780-t-compiler.2Fwg-llvm/topic/LLVM.20D146629.20breakage	2023-04-19 08:28:48 +00:00
OCHyams	1950cb4b68	[Assignment Tracking] Skip empty-metadata dbg.declares in AssignmentTrackingPass Debug intrinsics sometimes end up with empty metadata location operands. The debug intrinsic interfaces return nullptr when retrieving location operand in this case. Skip empty-metadata dbg.declares to avoid dereferencing the nullptr. This doesn't affect the final debug info in any way. Reviewed By: jryans Differential Revision: https://reviews.llvm.org/D148204	2023-04-18 08:43:54 +01:00
Shraiysh Vaishay	7021182d6b	[nfc][llvm] Replace pointer cast functions in PointerUnion by llvm casting functions. This patch replaces the uses of PointerUnion.is function by llvm::isa, PointerUnion.get function by llvm::cast, and PointerUnion.dyn_cast by llvm::dyn_cast_if_present. This is according to the FIXME in the definition of the class PointerUnion. This patch does not remove them as they are being used in other subprojects. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D148449	2023-04-17 13:40:51 -05:00
Nikita Popov	6f7e5c0f1a	Reapply [SimplifyCFG][LICM] Preserve nonnull, range and align metadata when speculating This exposed a miscompile in GVN, which was fixed by D148129. ----- After D141386, violation of nonnull, range and align metadata results in poison rather than immediate undefined behavior, which means that these are now safe to retain when speculating. We only need to remove UB-implying metadata like noundef. This is done by adding a dropUBImplyingAttrsAndMetadata() helper, which lists the metadata which is known safe to retain on speculation. Differential Revision: https://reviews.llvm.org/D146629	2023-04-17 14:15:14 +02:00
Nikita Popov	62ef97e063	[llvm-c] Remove PassRegistry and initialization APIs Remove C APIs for interacting with PassRegistry and pass initialization. These are legacy PM concepts, and are no longer relevant for the new pass manager. Calls to these initialization functions can simply be dropped. Differential Revision: https://reviews.llvm.org/D145043	2023-04-14 12:12:48 +02:00
Nikita Popov	9fe78db4cd	[FunctionAttrs] Fix nounwind inference for landingpads Currently, FunctionAttrs treats landingpads as non-throwing, and will infer nounwind for functions with landingpads (assuming they can't unwind in some other way, e.g. via resum). There are two problems with this: * Non-cleanup landingpads with catch/filter clauses do not necessarily catch all exceptions. Unless there are catch ptr null or filter [0 x ptr] zeroinitializer clauses, we should assume that we may unwind past this landingpad. This seems like an outright bug. * Cleanup landingpads are skipped during phase one unwinding, so we effectively need to support unwinding past them. Marking these nounwind is technically correct, but not compatible with how unwinding works in reality. Fixes https://github.com/llvm/llvm-project/issues/61945. Differential Revision: https://reviews.llvm.org/D147694	2023-04-14 11:46:00 +02:00
Alexis Engelke	a2e596bdf8	[LegacyPM] Reduce number of calls to getName Repeatedly calling getName adds some overhead, which can be easily avoided by querying the name just once per function. The improvements are rather small (~0.5% back-end time in a compile-time optimized setting), but also very easy to achieve. Note that getting the name should be entirely avoidable in the common case, but would require more substantial changes. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D148145	2023-04-13 11:19:39 +02:00
Alexis Engelke	63a8ca3fe9	[LegacyPM] Call getPassName only when needed Even when time tracing is disabled, getPassName is currently still called. This adds an avoidable virtual function call for each pass. Fetching the pass name only when required slightly improves compile-time (particularly when LLVM is built without LTO). Reviewed By: nikic, MaskRay Differential Revision: https://reviews.llvm.org/D148022	2023-04-12 18:36:02 +02:00

1 2 3 4 5 ...

5803 Commits