llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-26 04:26:08 +00:00

Author	SHA1	Message	Date
Thorsten Schütt	148fdc519c	[GlobalISel] Add G_ABDS and G_ABDU instructions (#118122 ) The DAG has the same instructions: the signed and unsigned absolute difference of it's input. For AArch64, they map to uabd and sabd for Neon and SVE. The Neon and SVE instructions will require custom patterns. They are pseudo opcodes and are not imported by the IRTranslator. We need combines to create them. PowerPC, ARM, and AArch64 have native instructions. /// i.e trunc(abs(sext(Op0) - sext(Op1))) becomes abds(Op0, Op1) /// or trunc(abs(zext(Op0) - zext(Op1))) becomes abdu(Op0, Op1) For GlobalISel, we are going to write the combines in MIR patterns. see: llvm/test/CodeGen/AArch64/abd-combine.ll - [ ] combine into abd - [ ] legalize and add td patterns	2024-12-04 12:53:15 +01:00
Luke Lau	5cd3e97362	[RISCV] Mark vmvNr.v as implicitly using vtype (#118414 ) This was pointed out in https://github.com/llvm/llvm-project/pull/118283#issuecomment-2512895919. We cannot move these between vtype definitions as they depend on SEW and require vill to be clear.	2024-12-04 15:10:57 +08:00
Alex Bradbury	8935165659	[NFCI][test] Use lit.local.cfg in test/MachineVerifier/RISCV to gate tests (#118217 ) 87cc4b48c08a627f330396f941b84671c5e591d5 added `REQUIRES: riscv64-registered-target` to the only test in this directory. It seems better (and also reflects what we do elsewhere, including in the AMDGPU/ subdir in MachineVerifier) to gate for all files in the directory via lit.local.cfg.	2024-12-02 16:33:57 +00:00
Piyou Chen	87cc4b48c0	[NFC] Fix buildbot fail by add riscv64-registered-target	2024-11-24 20:54:40 -08:00
Piyou Chen	7317a6e990	[RISCV][MachineVerifier] Use RegUnit for register liveness checking (#115980 ) For the RISC-V target, V14_V15 are not subregisters of v14m4, even though they share some registers. Currently, the MachineVerifier reports an error when checking register liveness for segment load/store operations. This patch adds additional register liveness checking, using RegUnit instead of subregisters, to prevent this error.	2024-11-25 12:43:39 +08:00
Thorsten Schütt	a5d09f4ad9	[GlobalISel] Add G_STEP_VECTOR instruction (#115598 ) aka llvm.stepvector Intrinsic	2024-11-11 10:45:02 +01:00
Matt Arsenault	1ceccbb0dd	VirtRegRewriter: Add implicit register defs for live out undef lanes (#112679 ) If an undef subregister def is live into another block, we need to maintain a physreg def to track the liveness of those lanes. This would manifest a verifier error after branch folding, when the cloned tail block use no longer had a def. We need to detect interference with other assigned intervals to avoid clobbering the undef lanes defined in other intervals, since the undef def didn't count as interference. This is pretty ugly and adds a new dependency on LiveRegMatrix, keeping it live for one more pass. It also adds a lot of implicit operand spam (we really should have a better representation for this). There is a missing verifier check for this situation. Added an xfailed test that demonstrates this. We may also be able to revert the changes in 47d3cbcf842a036c20c3f1c74255cdfc213f41c2. It might be better to insert an IMPLICIT_DEF before the instruction rather than using the implicit-def operand. Fixes #98474	2024-10-28 17:33:53 -07:00
Ellis Hoag	adaa603224	[MachineVerifier] Report errors from one thread at a time (#111605 ) Create the `ReportedErrors` class to track the number of reported errors during verification. The class will block reporting errors if some other thread is currently reporting an error. I've encountered a case where there were many different verifications reporting errors at the same time on different threads. This ensures that we don't start printing the error from one case until we are completely done printing errors from other cases. Most of the time `AbortOnError = true` so we usually abort after reporting the first error. Depends on https://github.com/llvm/llvm-project/pull/111602.	2024-10-11 11:53:44 -07:00
Aditi Medhane	60a8b2b1d0	[AMDGPU] Add MachineVerifier check to detect illegal copies from vector register to SGPR (#105494 ) Addition of a check in the MachineVerifier to detect and report illegal vector registers to SGPR copies in the AMDGPU backend, ensuring correct code generation. We can enforce this check only after SIFixSGPRCopies pass. This is half-fix in the pipeline with the help of isSSA MachineFuction property, the check is happening for passes after phi-node-elimination.	2024-09-19 13:57:44 +05:30
Craig Topper	009398b3b3	[MachineVerifier] Improve checks for G_INSERT_SUBVECTOR. (#109209 ) -Improve messages. -Remove redundant checks that are handled in generic code. -Add check that the subvector is smaller than the vector. -Add checks that subvector is smaller than the vector.	2024-09-18 18:31:16 -07:00
Craig Topper	e494e2a294	[MachineVerifier] Improve G_EXTRACT_SUBVECTOR checking (#109202 ) Check that the destination of G_EXTRACT_SUBVECTOR is smaller than the source. Improve wording of error messages.	2024-09-18 18:29:32 -07:00
Michael Maitland	e08c2178ef	[MachineVerifier] Fix bug in MachineVerifier for G_INSERT_SUBVECTOR (#109048 )	2024-09-17 16:57:41 -04:00
Michael Maitland	ee2add0683	[GISEL] Fix bugs and clarify spec of G_EXTRACT_SUBVECTOR (#108848 ) The implementation was missing the fact that `G_EXTRACT_SUBVECTOR` destination and source vector can be different types. Also fix a bug in the MIR builder for `G_EXTRACT_SUBVECTOR` to generate the correct opcode. Clarify the G_EXTRACT_SUBVECTOR specification.	2024-09-17 10:08:39 -04:00
Aditi Medhane	d24a2fd38e	[AMDGPU] Create dir for amdgpu specific machineverifier tests (#106960 ) Move the AMDGPU target specific testcases in MachineVerifier separately into new directory. Reference : https://github.com/llvm/llvm-project/pull/105494#discussion_r1735055750	2024-09-03 15:19:04 +05:30
Craig Topper	d5c292d8ef	[GISel][RISCV] Correctly handle scalable vector shuffles of pointer vectors in IRTranslator. (#106580 )	2024-08-29 12:35:50 -07:00
Sumanth Gundapaneni	b941ba1e12	llvm.lround: Update verifier to validate support of vector types. (#98950 ) Both IRVerifier and Machine Verifier are updated	2024-08-20 13:24:15 -05:00
Thorsten Schütt	1cc1072349	[GlobalIsel] Add G_SCMP and G_UCMP instructions (#98894 ) https://github.com/llvm/llvm-project/pull/83227	2024-07-18 16:22:37 +02:00
Thorsten Schütt	493d504b35	[GlobalIsel] Fix Machine Verifier errors (#99018 ) temporary solution. For discussion see https://github.com/llvm/llvm-project/pull/98894 Permanent solution could be: REQUIRES: default_triple	2024-07-17 06:04:46 +02:00
Karl-Johan Karlsson	31480b0cc8	[test] Avoid writing to a potentially write-protected dir (#89242 ) These tests just don't check the output written to the current directory. The current directory may be write protected e.g. in a sandboxed environment. The Testcases that use -emit-llvm and -verify only care about stdout/stderr and are in this patch changed to use -emit-llvm-only to avoid writing to an output file. The verify-inlineasmbr.mir testcase that also only care about stdout/stderr is in this patch changed to throw away the output file and just write to /dev/null.	2024-04-20 12:26:58 +02:00
Michael Maitland	8aa3a77eaf	[RISCV][GISEL] Legalize G_ZEXT, G_SEXT, and G_ANYEXT, G_SPLAT_VECTOR, and G_ICMP for scalable vector types This patch legalizes G_ZEXT, G_SEXT, and G_ANYEXT. If the type is a legal mask type, then the instruction is legalized as the element-wise select, where the condition on the select is the mask typed source operand, and the true and false values are 1 or -1 (for zero/any-extension and sign extension) and zero. If the type is a legal integer or vector integer type, then the instruction is marked as legal. The legalization of the extends may introduce a G_SPLAT_VECTOR, which needs to be legalized in this patch for the extend test cases to pass. A G_SPLAT_VECTOR is legal if the vector type is a legal integer or floating point vector type and the source operand is sXLen type. This is because the SelectionDAG patterns only support sXLen typed ISD::SPLAT_VECTORS, and we'd like to reuse those patterns. A G_SPLAT_VECTOR is cutom legalized if it has a legal s1 element vector type and s1 scalar operand. It is legalized to G_VMSET_VL or G_VMCLR_VL if the splat is all ones or all zeros respectivley. In the case of a non-constant mask splat, we legalize by promoting the scalar value to s8. In order to get the s8 element vector back into s1 vector, we use a G_ICMP. In order for the splat vector and extend tests to pass, we also need to legalize G_ICMP in this patch. A G_ICMP is legal if the destination type is a legal bool vector and the LHS and RHS are legal integer vector types.	2024-04-03 15:27:15 -07:00
Michael Maitland	da9f06c9b1	[GISEL] G_SPLAT_VECTOR can take a splat that is larger than the vector element (#86974 ) This is what SelectionDAG does. We'd like to reuse SelectionDAG patterns.	2024-04-01 08:46:22 -04:00
Evgenii Kudriashov	d365a45cb3	[GlobalISel] Introduce G_TRAP, G_DEBUGTRAP, G_UBSANTRAP (#84941 ) Here we introduce three new GMIR instructions to cover a set of trap intrinsics. The idea behind it is that generic intrinsics shouldn't be used with G_INTRINSIC opcode. These new instructions can match perfectly with existing trap ISD nodes. It allows X86, AArch64, RISCV and Mips to reuse SelectionDAG patterns for selection and avoid manual selection. However AMDGPU is an exception. It selects traps during legalization regardless SelectionDAG or GlobalISel. Since there are not many places where traps are used, this change attempts to clean up all the usages of G_INTRINSIC with trap intrinsics. So, there is no stage when both G_TRAP and G_INTRINSIC_W_SIDE_EFFECTS(@llvm.trap) are allowed.	2024-03-23 13:12:44 +01:00
Jonas Paulsson	7564566779	Reapply "Move assertion for AdjustsStack from PEI to MachineVerifier (#85698 )" - The check is now actually done in both PEI and the MachineVerifier. - More .mir tests trivially updated with "adjustsStack: true" as needed.	2024-03-21 20:24:57 -04:00
Jonas Paulsson	9ebd329ad8	Revert "Move assertion for AdjustsStack from PEI to MachineVerifier. (#85698 )" This reverts commit 05bde30585710a51592eee0a6cf6df8184d09c92. Reverting due to verifier complaints with expensive checks on build-bot.	2024-03-20 11:48:30 -04:00
Jonas Paulsson	05bde30585	Move assertion for AdjustsStack from PEI to MachineVerifier. (#85698 ) Have the verifier report a missing AdjustsStack flag rather than waiting until PEI asserts.	2024-03-20 10:29:12 -04:00
Michael Maitland	2f400a2fd7	[GISEL] Add G_VSCALE instruction (#84542 )	2024-03-12 20:22:49 -04:00
Michael Maitland	034cc2f5d0	[GISEL] Add G_INSERT_SUBVECTOR and G_EXTRACT_SUBVECTOR (#84538 ) G_INSERT and G_EXTRACT are not sufficient to use to represent both INSERT/EXTRACT on a subregister and INSERT/EXTRACT on a vector. We would like to be able to INSERT/EXTRACT on vectors in cases that INSERT/EXTRACT on vector subregisters are not sufficient, so we add these opcodes. I tried to do a patch where we treated G_EXTRACT as both G_EXTRACT_SUBVECTOR and G_EXTRACT_SUBREG, but ran into an infinite loop at this [point](`8b5b294ec2/llvm/lib/Target/RISCV/RISCVISelLowering.cpp (L9932)`) in the SDAG equivalent code.	2024-03-11 13:47:30 -04:00
Jay Foad	fd3eaf76ba	[GISel] Enforce G_PTR_ADD RHS type matching index size for addr space (#84352 )	2024-03-09 09:07:22 +00:00
Fangrui Song	66bd3cd75b	[AMDGPU,test] Change llc -march= to -mtriple= PR #75982 had been created before these tests were added, therefore some test were not updated.	2024-03-07 19:09:18 -08:00
Michael Maitland	96049fcf4e	[GISEL] Add IRTranslation for shufflevector on scalable vector types (#80378 ) Recommits llvm/llvm-project#80378 which was reverted in llvm/llvm-project#84330. The problem was that the change in llvm/test/CodeGen/AArch64/GlobalISel/legalizer-info-validation.mir used 217 as an opcode instead of a regex.	2024-03-07 09:10:03 -08:00
Michael Maitland	552da24843	Revert "[GISEL] Add IRTranslation for shufflevector on scalable vector types" (#84330 ) Reverts llvm/llvm-project#80378 causing Buildbot failures that did not show up with check-llvm or CI.	2024-03-07 10:16:31 -05:00
Michael Maitland	2b8aaef09e	[GISEL] Add IRTranslation for shufflevector on scalable vector types (#80378 ) This patch is stacked on https://github.com/llvm/llvm-project/pull/80372, https://github.com/llvm/llvm-project/pull/80307, and https://github.com/llvm/llvm-project/pull/80306. ShuffleVector on scalable vector types gets IRTranslate'd to G_SPLAT_VECTOR since a ShuffleVector that has operates on scalable vectors is a splat vector where the value of the splat vector is the 0th element of the first operand, because the index mask operand is the zeroinitializer (undef and poison are treated as zeroinitializer here). This is analogous to what happens in SelectionDAG for ShuffleVector. `buildSplatVector` is renamed to`buildBuildVectorSplatVector`. I did not make this a separate patch because it would cause problems to revert that change without reverting this change too.	2024-03-07 09:50:29 -05:00
Sameer Sahasrabuddhe	60822637bf	Restore "Implement convergence control in MIR using SelectionDAG (#71785 )" This restores commit c7fdd8c11e54585dc9d15d63de9742067e0506b9. Previously reverted in f010b1bef4dda2c7082cbb41dbabf1f149cce306. LLVM function calls carry convergence control tokens as operand bundles, where the tokens themselves are produced by convergence control intrinsics. This patch implements convergence control tokens in MIR as follows: 1. Introduce target-independent ISD opcodes and MIR opcodes for convergence control intrinsics. 2. Model token values as untyped virtual registers in MIR. The change also introduces an additional ISD opcode CONVERGENCECTRL_GLUE and a corresponding machine opcode with the same spelling. This glues the convergence control token to SDNodes that represent calls to intrinsics. The glued token is later translated to an implicit argument in the MIR. The lowering of calls to user-defined functions is target-specific. On AMDGPU, the convergence control operand bundle at a non-intrinsic call is translated to an explicit argument to the SI_CALL_ISEL instruction. Post-selection adjustment converts this explicit argument to an implicit argument on the SI_CALL instruction.	2024-03-06 12:19:32 +05:30
Mitch Phillips	f010b1bef4	Revert "Restore "Implement convergence control in MIR using SelectionDAG (#71785 )"" This reverts commit c7fdd8c11e54585dc9d15d63de9742067e0506b9. Reason: Broke the sanitizer buildbots. See the comments at https://github.com/llvm/llvm-project/pull/71785 for more information.	2024-03-04 17:05:34 +01:00
Sameer Sahasrabuddhe	c7fdd8c11e	Restore "Implement convergence control in MIR using SelectionDAG (#71785 )" Original commit 79889734b940356ab3381423c93ae06f22e772c9. Perviously reverted in commit a2afcd5721869d1d03c8146bae3885b3385ba15e. LLVM function calls carry convergence control tokens as operand bundles, where the tokens themselves are produced by convergence control intrinsics. This patch implements convergence control tokens in MIR as follows: 1. Introduce target-independent ISD opcodes and MIR opcodes for convergence control intrinsics. 2. Model token values as untyped virtual registers in MIR. The change also introduces an additional ISD opcode CONVERGENCECTRL_GLUE and a corresponding machine opcode with the same spelling. This glues the convergence control token to SDNodes that represent calls to intrinsics. The glued token is later translated to an implicit argument in the MIR. The lowering of calls to user-defined functions is target-specific. On AMDGPU, the convergence control operand bundle at a non-intrinsic call is translated to an explicit argument to the SI_CALL_ISEL instruction. Post-selection adjustment converts this explicit argument to an implicit argument on the SI_CALL instruction.	2024-03-04 13:28:04 +05:30
Sameer Sahasrabuddhe	a2afcd5721	Revert "Implement convergence control in MIR using SelectionDAG (#71785 )" This reverts commit 79889734b940356ab3381423c93ae06f22e772c9. Encountered multiple buildbot failures.	2024-02-21 11:07:02 +05:30
Sameer Sahasrabuddhe	79889734b9	Implement convergence control in MIR using SelectionDAG (#71785 ) LLVM function calls carry convergence control tokens as operand bundles, where the tokens themselves are produced by convergence control intrinsics. This patch implements convergence control tokens in MIR as follows: 1. Introduce target-independent ISD opcodes and MIR opcodes for convergence control intrinsics. 2. Model token values as untyped virtual registers in MIR. The change also introduces an additional ISD opcode CONVERGENCECTRL_GLUE and a corresponding machine opcode with the same spelling. This glues the convergence control token to SDNodes that represent calls to intrinsics. The glued token is later translated to an implicit argument in the MIR. The lowering of calls to user-defined functions is target-specific. On AMDGPU, the convergence control operand bundle at a non-intrinsic call is translated to an explicit argument to the SI_CALL_ISEL instruction. Post-selection adjustment converts this explicit argument to an implicit argument on the SI_CALL instruction.	2024-02-21 10:06:37 +05:30
Shilei Tian	58e8c072aa	[FIX] Require AMDGPU target in test case `llvm/test/MachineVerifier/writelane_m0.mir`	2024-01-30 15:47:19 -05:00
Shilei Tian	6a21e00e39	[AMDGPU][AsmParser] Allow `v_writelane_b32` to use SGPR and M0 as source operands at the same time (#78827 ) Currently the asm parser takes `v_writelane_b32 v1, s13, m0` as illegal instruction for pre-gfx11 because it uses two constant buses while the hardware can only allow one. However, based on the comment of `AMDGPUInstructionSelector::selectWritelane`, it is allowed to have M0 as lane selector and a SGPR used as SRC0 because the lane selector doesn't count as a use of constant bus. In fact, codegen can already generate this form, but this inconsistency is not exposed because the validation of constant bus limitation only happens when paring an assembly but we don't have a test case when both SGPR and M0 used as source operands for the instruction.	2024-01-30 15:39:31 -05:00
Fangrui Song	9e9907f1cf	[AMDGPU,test] Change llc -march= to -mtriple= (#75982 ) Similar to 806761a7629df268c8aed49657aeccffa6bca449. For IR files without a target triple, -mtriple= specifies the full target triple while -march= merely sets the architecture part of the default target triple, leaving a target triple which may not make sense, e.g. amdgpu-apple-darwin. Therefore, -march= is error-prone and not recommended for tests without a target triple. The issue has been benign as we recognize $unknown-apple-darwin as ELF instead of rejecting it outrightly. This patch changes AMDGPU tests to not rely on the default OS/environment components. Tests that need fixes are not changed: ``` LLVM :: CodeGen/AMDGPU/fabs.f64.ll LLVM :: CodeGen/AMDGPU/fabs.ll LLVM :: CodeGen/AMDGPU/floor.ll LLVM :: CodeGen/AMDGPU/fneg-fabs.f64.ll LLVM :: CodeGen/AMDGPU/fneg-fabs.ll LLVM :: CodeGen/AMDGPU/r600-infinite-loop-bug-while-reorganizing-vector.ll LLVM :: CodeGen/AMDGPU/schedule-if-2.ll ```	2024-01-16 21:54:58 -08:00
Jay Foad	35ebd92d3d	[GlobalISel] Add G_PREFETCH (#74863 )	2023-12-11 11:06:50 +00:00
Matt Arsenault	c44dca15a4	MachineVerifier: Reject extra non-register operands on instructions (#73758 ) We were allowing extra immediate arguments, and only bothering to check if registers were implicit or not. Also consolidate extra operand checks in verifier, to make this testable. We had 3 different places checking if you were trying to build an instruction with more operands than allowed by the definition. We had an assertion in addOperand, a direct check in the MIRParser to avoid the assertion, and the machine verifier checks. Remove the assert and parser check so the verifier can provide a consistent verification experience, which will also handle instructions modified in place.	2023-11-30 22:33:42 +09:00
Michael Maitland	4832c88e49	[MachineVerifier] Fix COPY check in MachineVerifier for scalable vectors This change fixes #71518, which compared the KnownMinValue of the scalable virtual register with the FixedSize of the physical register in the wrong direction. It turns out that we cannot include this check at all since it will lead to a false failures. Test cases are added to show that the false failures no longer occur after this fix.	2023-11-07 14:49:38 -08:00
Michael Maitland	ac4ff6168a	[CodeGen][MachineVerifier] Use TypeSize instead of unsigned for getRe… (#70881 ) …gSizeInBits This patch changes getRegSizeInBits to return a TypeSize instead of an unsigned in the case that a virtual register has a scalable LLT. In the case that register is physical, a Fixed TypeSize is returned. The MachineVerifier pass is updated to allow copies between fixed and scalable operands as long as the Src size will fit into the Dest size. This is a precommit which will be stacked on by a change to GISel to generate COPYs with a scalable destination but a fixed size source. This patch is stacked on https://github.com/llvm/llvm-project/pull/70893 for the ability to use scalable vector types in MIR tests.	2023-11-07 14:38:46 -05:00
Craig Topper	9a7c26a399	[GISel] Restrict G_BSWAP to multiples of 16 bits. (#70245 ) This is consistent with the IR verifier and SelectionDAG's getNode. Update tests accordingly. I tried to keep some coverage of non-pow2 when possible. X86 didn't like a G_UNMERGE_VALUES from s48 to 3 s16 that got created when I tried s48.	2023-10-30 10:27:57 -07:00
Visoiu Mistrih Francis	cc9ba5600e	[test] -march -> -mtriple (#67741 ) Similar to 806761a	2023-09-29 10:43:23 -07:00
Fangrui Song	d20190e684	[test] Change llc -march=aarch64\|arm64 to -mtriple=aarch64\|arm64 Similar to commit 806761a7629df268c8aed49657aeccffa6bca449 to avoid issues due to object file format differences. These tests are currently benign.	2023-09-29 10:13:06 -07:00
Matt Arsenault	87117a2e20	MachineVerifier: Add tests which are incorrectly accepted In the process of splitting out the liveness tracking, I ran into these cases which should have been caught. There are still missing errors for some cases in the entry block. https://reviews.llvm.org/D127104	2023-09-01 09:06:57 -04:00
Harvin Iriawan	db158c7c83	[AArch64] Update generic sched model to A510 Refresh of the generic scheduling model to use A510 instead of A55. Main benefits are to the little core, and introducing SVE scheduling information. Changes tested on various OoO cores, no performance degradation is seen. Differential Revision: https://reviews.llvm.org/D156799	2023-08-21 12:25:15 +01:00
Jay Foad	5158097274	[X86] Fix checks for illegal physreg COPY instructions D105263 changed this test to not expect a MachineVerifier error on this instruction: ; FP16 reg is sub_reg of xmm %0:_(s16) = COPY $xmm0 D107082 changed the behaviour back again so that this instruction did cause an error, but the test was not updated to expect the error. Differential Revision: https://reviews.llvm.org/D148534	2023-04-18 12:03:06 +01:00

1 2 3

129 Commits