llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-16 18:46:37 +00:00

Author	SHA1	Message	Date
Victor Perez	98d5d3448d	[MLIR][GPU-LLVM] Define `-convert-gpu-to-llvm-spv` pass (#90972 ) Define pass for GPU to LLVM conversion for SPIR-V backend tool ingest. Supported operations: - `gpu.block_id` - `gpu.global_id` - `gpu.block_dim` - `gpu.thread_id` - `gpu.grid_dim` - `gpu.barrier` - `gpu.shuffle` --------- Signed-off-by: Victor Perez <victor.perez@codeplay.com>	2024-05-31 17:47:53 +02:00
Adam Siemieniuk	8f4d5a32ac	[mlir][tensor] Fold unpadding collapse_shape into extract_slice (#93554 )	2024-05-31 13:29:40 +02:00
Théo Degioanni	b86a9c5bf2	[mlir][irdl] Lookup symbols near dialects instead of locally (#92819 ) Because symbols cannot refer to operations outside of their symbol tables, it was impossible to refer to operations outside of the dialect currently being defined. This PR modifies the lookup logic to happen relative to the symbol table containing the dialect-defining operations. This is a bit of hack but should unblock the situation here.	2024-05-31 09:15:50 +01:00
Jeremy Kun	692ae5443b	[mlir][polynomial] verify from_tensor coeff type (#93243 ) Rebased over https://github.com/llvm/llvm-project/pull/93227 --------- Co-authored-by: Jeremy Kun <j2kun@users.noreply.github.com>	2024-05-30 11:03:36 -07:00
Andrzej Warzyński	435114f9fe	[mlir][test] Rename Vector integration tests for CPU (nfc) (#93521 ) To keep the test filenames consistent, this patch: * removes "test-" from file names (there used to be a mix of "test-feature-1.mlir" and "feature-2.mlir"), * replaces "_" with "-" (there used to be a mix of "feature-3.mlir" and "feature_4.mlir"). Only files under test/Integration/Dialect/Vector/CPU are updated.	2024-05-30 18:06:43 +01:00
Jay Foad	f6c8e7dc3e	[MLIR][AMDGPU] Remove support for old llvm.amdgcn.buffer.* intrinsics (#93838 ) They have been superseded by llvm.amdgcn.raw.buffer.* and llvm.amdgcn.struct.buffer.*.	2024-05-30 17:58:11 +01:00
Ryan Holt	1159e7645b	[mlir][linalg] Add folder for transpose(transpose) -> transpose (#93606 ) Back to back `linalg.transpose` can be rewritten to a single transpose	2024-05-30 10:41:29 -04:00
Jeremy Kun	1f46729a18	[polynomial] Move primitive root attribute to ntt/intt ops. (#93227 ) Better design to put semantics on the ops, and in this case the ntt/intt op can lower in multiple ways depending on the polynomial ring modulus (it can need an nth root of unity for cyclic polymul -> ntt, or a 2nth root for negacyclic polymul -> ntt) --------- Co-authored-by: Jeremy Kun <j2kun@users.noreply.github.com>	2024-05-30 07:09:18 -07:00
Cullen Rhodes	b49c0b8abc	[mlir][ArmSME] Simplify permutation map handling (#93515 ) In -convert-vector-to-arm-sme the permutation_map is explicitly checked for transpose when converting xfer ops, but for 2-D vector types the only non-identity permutation map is transpose so this can be simplified.	2024-05-30 13:22:43 +01:00
Mubashar Ahmad	bc946f5287	[mlir][vector] Add 1D vector.deinterleave lowering (#93042 ) This patch implements the lowering of vector.deinterleave for 1D vectors. For fixed vector types, the operation is lowered to two llvm shufflevector operations. One for even indexed elements and the other for odd indexed elements. A poison operation is used to satisfy the parameters of the shufflevector parameters. For scalable vectors, the llvm vector.deinterleave2 intrinsic is used for lowering. As such the results found by extraction and used to form the result struct for the intrinsic.	2024-05-30 09:42:35 +01:00
Oleksandr "Alex" Zinenko	67897d77ed	[mlir][py] invalidate nested operations when parent is deleted (#93339 ) When an operation is erased in Python, its children may still be in the "live" list inside Python bindings. After this, if some of the newly allocated operations happen to reuse the same pointer address, this will trigger an assertion in the bindings. This assertion would be incorrect because the operations aren't actually live. Make sure we remove the children operations from the "live" list when erasing the parent. This also concentrates responsibility over the removal from the "live" list and invalidation in a single place. Note that this requires the IR to be sufficiently structurally valid so a walk through it can succeed. If this invariant was broken by, e.g, C++ pass called from Python, there isn't much we can do.	2024-05-30 10:06:02 +02:00
Nikita Popov	d10b76552f	[ConstantFold] Remove notional over-indexing fold (#93697 ) The data-layout independent constant folding currently has some rather gnarly code for canonicalizing GEP indices to reduce "notional overindexing", and then infers inbounds based on that canonicalization. Now that we canonicalize to i8 GEPs, this canonicalization is essentially useless, as we'll discard it as soon as the GEP hits the data-layout aware constant folder anyway. As such, I'd like to remove this code entirely. This shouldn't have any impact on optimization capabilities.	2024-05-30 08:36:44 +02:00
Guy David	4bce270157	[mlir][llvm] Implement ConstantLike for ZeroOp, UndefOp, PoisonOp (#93690 ) These act as constants and should be propagated whenever possible. It is safe to do so for mlir.undef and mlir.poison because they remain "dirty" through out their lifetime and can be duplicated, merged, etc. per the LangRef. Signed-off-by: Guy David <guy.david@nextsilicon.com>	2024-05-30 08:21:08 +02:00
Christian Ulmann	6a3982f8b7	[MLIR][LLVM] Relax the LLVM dialect's inliner assuming UCF (#93514 ) This commit changes the LLVM dialect's inliner interface to stop assuming that the inlined function only contained unstructured control flow. This is not necessarily true, and it lead to not properly propagating the noalias information.	2024-05-30 07:58:13 +02:00
Mehdi Amini	e6821dd8c8	Revert "[MLIR][Python] add ctype python binding support for bf16" (#93771 ) Reverts llvm/llvm-project#92489 This broke the bots.	2024-05-29 23:21:04 -06:00
Bimo	89801c74c3	[MLIR][Python] add ctype python binding support for bf16 (#92489 ) Since bf16 is supported by mlir, similar to complex128/complex64/float16, we need an implementation of bf16 ctype in Python binding. Furthermore, to resolve the absence of bf16 support in NumPy, a third-party package [ml_dtypes ](https://github.com/jax-ml/ml_dtypes) is introduced to add bf16 extension, and the same approach was used in `torch-mlir` project. See motivation and discussion in: https://discourse.llvm.org/t/how-to-run-executionengine-with-bf16-dtype-in-mlir-python-bindings/79025	2024-05-29 22:01:40 -07:00
Mehdi Amini	5bec47c1ef	Revert "[mlir][spirv] Add integration test for `vector.interleave` and `vector.shuffle`" (#93732 ) Reverts llvm/llvm-project#93595 This broke the gcc-7 bot.	2024-05-29 14:29:01 -06:00
Valentin Clement (バレンタインクレメン)	428b9be648	[mlir] Align num elements type to LLVM ArrayType (#93230 ) MLIR LLMArrayType is using `unsigned` for the number of elements while LLVM ArrayType is using `uint64_t` `4ae896fe97/llvm/include/llvm/IR/DerivedTypes.h (L377)` This leads to silent truncation when we use it for globals in flang. ``` program test integer(8), parameter :: large = 2**30 real, dimension(large) :: bigarray common /c/ bigarray bigarray(999) = 666 end ``` The above program would result in a segfault since the global would be of size 0 because of the silent truncation. ``` fir.global common @c_(dense<0> : vector<4294967296xi8>) : !fir.array<4294967296xi8> ``` became ``` llvm.mlir.global common @c_(dense<0> : vector<4294967296xi8>) {addr_space = 0 : i32} : !llvm.array<0 x i8> ``` This patch updates the definition of MLIR ArrayType to take `uint64_t` as argument of the number of elements to be compatible with LLVM.	2024-05-29 13:05:44 -07:00
Angel Zhang	c9c244423f	[mlir][spirv] Add integration test for `vector.interleave` and `vector.shuffle` (#93595 ) - Add integration test for `vector.shuffle` and `vector.interleave`, mentioned in issue #91978 - Add `VectorToSPIRV` patterns to `GPUToSPIRVPass` --------- Co-authored-by: Jakub Kuderski <kubakuderski@gmail.com>	2024-05-29 12:19:32 -04:00
donald chen	0981dca777	[mlir][arith] Add neutral element support to arith.maxnumf/arith.minnumf (#93278 ) For maxnumf and minnumf, the result of calculations involving NaN will be another value, so their neutral element is set to NaN.	2024-05-29 10:20:49 -04:00
zjgarvey	42a0fb2333	[mlir][linalg] Add linalg.conv_2d_ngchw_gfchw_q to named ops (#92136 ) Adds a named op: linalg.conv_2d_ngchw_gfchw_q. This op is similar to linalg.conv_2d_ngchw_gfchw, but additionally incorporates zero point offset corrections.	2024-05-29 12:55:05 +02:00
Simon Camphausen	1594cebedd	[mlir][EmitC] Fix evaluation order of expressions (#93549 ) Expressions with the same precedence were not parenthesized and therefore were possibly evaluated in the wrong order depending on the shape of the expression tree. --------- Co-authored-by: Matthias Gehre <matthias.gehre@amd.com> Co-authored-by: Corentin Ferry <corentin.ferry@amd.com>	2024-05-29 11:42:06 +02:00
Guray Ozen	7f58ffd09b	[mlir][python] Yield results of `scf.for_` (#93610 ) Using `for_` is very hand with python bindings. Currently, it doesn't support results, we had to fallback to two lines scf.for. This PR yields results of scf.for in `for_` --------- Co-authored-by: Maksim Levental <maksim.levental@gmail.com>	2024-05-29 08:43:13 +02:00
Jakub Kuderski	5bfe4b93e1	[mlir][arith] Disallow casting tensor dimensions (#93349 ) Tighten the verifier for arith cast ops to disallow changing tensor dimensions, e.g., static to dynamic. After this change: * `arith.cast_op %x : tensor<4xi32> to tensor<4xf32>` remains valid * `arith.cast_op %x : tensor<4xi32> to tensor<?xf32>` becomes invalid * `arith.cast_op %x : tensor<?xi32> to tensor<4xf32>` becomes invalid This is mostly to simplify the op semantics. See the discussion thread for more context: https://discourse.llvm.org/t/rfc-remove-arith-math-ops-on-tensors/74357/63.	2024-05-28 20:04:41 -04:00
Spenser Bauman	6aeea700df	[mlir][dataflow] Fix for integer range analysis propagation bug (#93199 ) Integer range analysis will not update the range of an operation when any of the inferred input lattices are uninitialized. In the current behavior, all lattice values for non integer types are uninitialized. For operations like arith.cmpf ```mlir %3 = arith.cmpf ugt, %arg0, %arg1 : f32 ``` that will result in the range of the output also being uninitialized, and so on for any consumer of the arith.cmpf result. When control-flow ops are involved, the lack of propagation results in incorrect ranges, as the back edges for loop carried values are not properly joined with the definitions from the body region. For example, an scf.while loop whose body region produces a value that is in a dataflow relationship with some floating-point values through an arith.cmpf operation: ```mlir func.func @test_bad_range(%arg0: f32, %arg1: f32) -> (index, index) { %c4 = arith.constant 4 : index %c1 = arith.constant 1 : index %c0 = arith.constant 0 : index %3 = arith.cmpf ugt, %arg0, %arg1 : f32 %1:2 = scf.while (%arg2 = %c0, %arg3 = %c0) : (index, index) -> (index, index) { %2 = arith.cmpi ult, %arg2, %c4 : index scf.condition(%2) %arg2, %arg3 : index, index } do { ^bb0(%arg2: index, %arg3: index): %4 = arith.select %3, %arg3, %arg3 : index %5 = arith.addi %arg2, %c1 : index scf.yield %5, %4 : index, index } return %1#0, %1#1 : index, index } ``` The existing behavior results in the control condition %2 being optimized to true, turning the while loop into an infinite loop. The update to %arg2 through the body region is never factored into the range calculation, as the ranges for the body ops all test as uninitialized. This change causes all values initialized with setToEntryState to be set to some initialized range, even if the values are not integers. --------- Co-authored-by: Spenser Bauman <sabauma@fastmail>	2024-05-28 18:29:17 -04:00
Guray Ozen	51752ed0dd	[mlir][nvgpu] verify the module	2024-05-28 21:17:31 +02:00
Kunwar Grover	debdbeda15	[mlir] Remove dialect specific bufferization passes (Reland) (#93535 ) These passes have been depreciated for a long time and replaced by one-shot bufferization. These passes are also unsafe because they do not check for read-after-write conflicts. Relands https://github.com/llvm/llvm-project/pull/93488 which failed on buildbot. Fixes the failure by updating integration tests to use one-shot-bufferize instead.	2024-05-28 20:04:27 +01:00
Peiming Liu	99835922ca	[mlir][sparse] remove sparse encoding propagation pass. (#93593 )	2024-05-28 11:23:15 -07:00
Matthias Gehre	af22e274e9	TosaToTensor: Support reshape on tensors of unsigned integer (#91734 ) This adds - `mlir::tosa::populateTosaToLinalgTypeConversion` which converts tensors of unsigned integers into tensors of signless integers - modifies the `tosa.reshape` lowering in TosaToTensor to use the type converter correctly I choose to implement the type converter in `mlir/Conversion/TosaToLinalg/TosaToLinalg.h` instead of `mlir/Conversion/TosaToTensor/TosaToTensor.h` because I need the same type converter in the TosaToLinalg lowerings (future PR). Alternatively, I could duplicate the type converter so it exists both in TosaToLinalg and TosaToTensor. Let me know if you prefer that.	2024-05-28 17:59:23 +02:00
stefankoncarevic	94be801879	[mlir][ROCDL] Update the LLVM data layout for ROCDL lowering. (#92127 ) This change updates the dataLayout string to ensure alignment with the latest LLVM TargetMachine configuration. The aim is to maintain consistency and prevent potential compilation issues related to memory address space handling.	2024-05-28 10:17:02 -05:00
Ryan Holt	74ed79f7f1	[mlir][linalg] Add linalg.transpose constant folding (#92589 ) There was existing support for constant folding a `linalg.generic` that was actually a transpose. This commit adds support for the named op, `linalg.transpose`, as well by making use of the `LinalgOp` interface.	2024-05-28 10:42:32 -04:00
Sayan Saha	44861c7ac5	[mlir] [linalg] Check for dim shape to decide unit dim for each operand in dropUnitDims pass. (#93317 ) `mlir-opt --linalg-fold-unit-extent-dims` pass on the following IR ``` #map = affine_map<(d0, d1, d2, d3, d4, d5, d6) -> (d0, d1 + d4, d2 + d5, d6)> #map1 = affine_map<(d0, d1, d2, d3, d4, d5, d6) -> (d4, d5, d6, d3)> #map2 = affine_map<(d0, d1, d2, d3, d4, d5, d6) -> (d0, d1, d2, d3)> module { func.func @main(%arg0: tensor<1x?x?x1xf32>, %arg1: index) -> tensor<?x1x61x1xf32> { %cst = arith.constant dense<1.000000e+00> : tensor<1x1x1x1xf32> %0 = tensor.empty(%arg1) : tensor<?x1x61x1xf32> %1 = linalg.generic {indexing_maps = [#map, #map1, #map2], iterator_types = ["parallel", "parallel", "parallel", "parallel", "reduction", "reduction", "reduction"]} ins(%arg0, %cst : tensor<1x?x?x1xf32>, tensor<1x1x1x1xf32>) outs(%0 : tensor<?x1x61x1xf32>) { ^bb0(%in: f32, %in_0: f32, %out: f32): %2 = arith.mulf %in, %in_0 : f32 %3 = arith.addf %out, %2 : f32 linalg.yield %3 : f32 } -> tensor<?x1x61x1xf32> return %1 : tensor<?x1x61x1xf32> } } ``` produces an incorrect tensor.expand_shape operation: ``` error: 'tensor.expand_shape' op expected dimension 0 of collapsed type to be dynamic since one or more of the corresponding dimensions in the expanded type is dynamic %1 = linalg.generic {indexing_maps = [#map, #map1, #map2], iterator_types = ["parallel", "parallel", "parallel", "parallel", "reduction", "reduction", "reduction"]} ins(%arg0, %cst : tensor<1x?x?x1xf32>, tensor<1x1x1x1xf32>) outs(%0 : tensor<?x1x61x1xf32>) { ^ /mathworks/devel/sandbox/sayans/geckWorks/g3294570/repro.mlir:8:10: note: see current operation: %5 = "tensor.expand_shape"(%4) <{reassociation = [[0, 1, 2, 3]]}> : (tensor<61xf32>) -> tensor<?x1x61x1xf32> // -----// IR Dump After LinalgFoldUnitExtentDimsPass Failed (linalg-fold-unit-extent-dims) //----- // #map = affine_map<(d0) -> (0, d0)> #map1 = affine_map<(d0) -> ()> #map2 = affine_map<(d0) -> (d0)> "builtin.module"() ({ "func.func"() <{function_type = (tensor<1x?x?x1xf32>, index) -> tensor<?x1x61x1xf32>, sym_name = "main"}> ({ ^bb0(%arg0: tensor<1x?x?x1xf32>, %arg1: index): %0 = "arith.constant"() <{value = dense<1.000000e+00> : tensor<f32>}> : () -> tensor<f32> %1 = "tensor.collapse_shape"(%arg0) <{reassociation = [[0, 1], [2, 3]]}> : (tensor<1x?x?x1xf32>) -> tensor<?x?xf32> %2 = "tensor.empty"() : () -> tensor<61xf32> %3 = "tensor.empty"() : () -> tensor<61xf32> %4 = "linalg.generic"(%1, %0, %2, %3) <{indexing_maps = [#map, #map1, #map2, #map2], iterator_types = [#linalg.iterator_type<parallel>], operandSegmentSizes = array<i32: 3, 1>}> ({ ^bb0(%arg2: f32, %arg3: f32, %arg4: f32, %arg5: f32): %6 = "arith.mulf"(%arg2, %arg3) <{fastmath = #arith.fastmath<none>}> : (f32, f32) -> f32 %7 = "arith.addf"(%arg4, %6) <{fastmath = #arith.fastmath<none>}> : (f32, f32) -> f32 "linalg.yield"(%7) : (f32) -> () }) : (tensor<?x?xf32>, tensor<f32>, tensor<61xf32>, tensor<61xf32>) -> tensor<61xf32> %5 = "tensor.expand_shape"(%4) <{reassociation = [[0, 1, 2, 3]]}> : (tensor<61xf32>) -> tensor<?x1x61x1xf32> "func.return"(%5) : (tensor<?x1x61x1xf32>) -> () }) : () -> () }) : () -> () ``` The reason of this is because the dimension `d0` is determined to be an unit-dim that can be dropped based on the dimensions of operand `arg0` to `linalg.generic`. Later on when iterating over operand `outs` the dimension `d0` is determined to be an unit-dim even though the shape corresponding to it is `Shape::kDynamic`. For the `linalg.generic` to be valid `d0` of `outs` does need to be `1` but that isn't properly processed in the current implementation and the dimension is dropped resulting in `outs` operand to be `tensor<61xf32>` in the example. The fix is to also check that the dimension shape is actually `1` before dropping the dimension. The IR after the fix is: ``` #map = affine_map<()[s0, s1] -> (s0 * s1)> #map1 = affine_map<(d0) -> (0, d0)> #map2 = affine_map<(d0) -> ()> module { func.func @main(%arg0: tensor<1x?x?x1xf32>, %arg1: index) -> tensor<?x1x61x1xf32> { %c0 = arith.constant 0 : index %c1 = arith.constant 1 : index %cst = arith.constant dense<1.000000e+00> : tensor<f32> %collapsed = tensor.collapse_shape %arg0 [[0, 1], [2, 3]] : tensor<1x?x?x1xf32> into tensor<?x?xf32> %0 = tensor.empty(%arg1) : tensor<?x61xf32> %1 = affine.apply #map()[%arg1, %c1] %2 = tensor.empty(%1) : tensor<?x61xf32> %3 = linalg.generic {indexing_maps = [#map1, #map2, #map1, #map1], iterator_types = ["parallel"]} ins(%collapsed, %cst, %0 : tensor<?x?xf32>, tensor<f32>, tensor<?x61xf32>) outs(%2 : tensor<?x61xf32>) { ^bb0(%in: f32, %in_0: f32, %in_1: f32, %out: f32): %4 = arith.mulf %in, %in_0 : f32 %5 = arith.addf %in_1, %4 : f32 linalg.yield %5 : f32 } -> tensor<?x61xf32> %expanded = tensor.expand_shape %3 [[0, 1], [2, 3]] output_shape [%c0, 1, 61, 1] : tensor<?x61xf32> into tensor<?x1x61x1xf32> return %expanded : tensor<?x1x61x1xf32> } } ```	2024-05-28 15:21:46 +02:00
Artem Kroviakov	01fbc5658c	[mlir][vector] Add support for linearizing Insert VectorOp in VectorLinearize (#92370 ) Building on top of [#88204](https://github.com/llvm/llvm-project/pull/88204), this PR adds support for converting `vector.insert` into an equivalent `vector.shuffle` operation that operates on linearized (1-D) vectors.	2024-05-28 14:54:37 +02:00
josel-amd	125bd061c3	[mlir][emitc] Support conversion of arith.divsi and arith.remsi to EmitC (#93450 )	2024-05-28 14:36:24 +02:00
Adrian Kuegel	9b79acedd6	[mlir][Bazel] Followup to 57c10fa564af44a5b236bc642c540d715b04448c It removed the dependency from the wrong target. Also, we need to remove the header include to be able to remove the dependency from VectorToSPIRV.	2024-05-28 11:36:26 +00:00
Abid Qadeer	d80383730c	[mlir] Add missing fields in DICompositeTypeAttr. (#93226 ) The fortran arrays use 'dataLocation', 'rank', 'allocated' and 'associated' fields of the DICompositeType. These were not available in 'DICompositeTypeAttr'. This PR adds the missing fields. --------- Co-authored-by: Tobias Gysi <tobias.gysi@nextsilicon.com>	2024-05-28 12:22:44 +01:00
Kunwar Grover	39848d0a98	Revert "[mlir] Remove dialect specific bufferization passes" (#93528 ) Reverts llvm/llvm-project#93488 Buildbot failure: https://lab.llvm.org/buildbot/#/builders/220/builds/39911	2024-05-28 11:21:34 +01:00
Kunwar Grover	2fc5106437	[mlir] Remove dialect specific bufferization passes (#93488 ) These passes have been depreciated for a long time and replaced by one-shot bufferization. These passes are also unsafe because they do not check for read-after-write conflicts.	2024-05-28 11:12:58 +01:00
Cullen Rhodes	ea20647023	[mlir][ArmSME] NFC: -force-streaming-compatible-sve rename fixup (#93177 ) -force-streaming-compatible-sve was renamed in #92774 but this test was missed, no longer required so removing.	2024-05-28 09:07:07 +01:00
donald chen	a0fdb38a76	[mlir][linalg] Add more precise memory effects to linalg op (#92079 ) This patch add more precise memory effect to linalg op. Including the following points: 1. Remove the read side effects for operands that are not used. 2. Set the effect for all side effects to "full".	2024-05-28 08:39:19 +02:00
Angel Zhang	57c10fa564	[mlir][spirv] Add vector.interleave to spirv.VectorShuffle conversion (#93240 ) - Add `vector.interleave` to `spirv.VectorShuffle` conversion - Remove the `vector.interleave` to `vector.shuffle` conversion from `populateVectorToSPIRVPatterns` and CMake/Bazel dependencies --------- Co-authored-by: Jakub Kuderski <kubakuderski@gmail.com>	2024-05-27 19:44:57 -04:00
Jakub Kuderski	714aee31e1	[mlir][vector] Add result type to `interleave` assembly format (#93392 ) This is to make it more obvious for what the result type is, especially with some less trivial cases like 0-d inputs resulting in 1-d inputs or interaction with scalable vector types. Note that `vector.deinterleave` uses the same format with explicit result type. Also improve examples and clean up surrounding code.	2024-05-27 11:03:36 -04:00
Marius Brehler	8760d4ba4c	[mlir][EmitC] Update comment (NFC)	2024-05-27 12:25:06 +00:00
Michael Kruse	064391df49	[mlir] Revise IDE folder structure (#89749 ) Update the folder titles for targets in the monorepository that have not seen taken care of for some time. These are the folders that targets are organized in Visual Studio and XCode (`set_property(TARGET <target> PROPERTY FOLDER "<title>")`) when using the respective CMake's IDE generator. * Ensure that every target is in a folder * Use a folder hierarchy with each LLVM subproject as a top-level folder * Use consistent folder names between subprojects * When using target-creating functions from AddLLVM.cmake, automatically deduce the folder. This reduces the number of `set_property`/`set_target_property`, but are still necessary when `add_custom_target`, `add_executable`, `add_library`, etc. are used. A LLVM_SUBPROJECT_TITLE definition is used for that in each subproject's root CMakeLists.txt.	2024-05-25 17:32:42 +02:00
Beal Wang	4d60be0452	[mlir] Do not print empty property (#93379 ) Skip printing property as `<<<NULL ATTRIBUTE>>>` when operation has an empty property. Co-authored-by: Biao Wang <biaow@nvidia.com>	2024-05-25 09:24:12 -06:00
Jacques Pienaar	c26847dc81	[mlir][drr] Allow variadic in rewrite side (#93340 ) Enables writing patterns where one has op creation with variadic in result pattern more easily. Signed-off-by: Jacques Pienaar <jpienaar@google.com>	2024-05-24 15:53:25 -07:00
Jakub Kuderski	8e3be5c38d	[mlir][arith] Clean up select op implementation (#93351 ) Inline traits used by `arith.select` only into `ArithOps.td`. Trim trailing whitespace in op description.	2024-05-24 18:24:42 -04:00
Oleksandr "Alex" Zinenko	8f21909c2f	[mlir] expose -debug-only equivalent to C and Python (#93175 ) These are useful for finer-grain debugging and complement the already exposed global debug flag.	2024-05-24 23:15:18 +02:00
Christopher Bate	9ad5da2def	[mlir][Pass] Add new FileTreeIRPrinterConfig (#67840 ) This change expands the existing instrumentation that prints the IR before/after each pass to an output stream (usually stderr). It adds a new configuration that will print the output of each pass to a separate file. The files will be organized into a directory tree rooted at a specified directory. For existing tools, a CL option `-mlir-print-ir-tree-dir` is added to specify this directory and activate the new printing config. The created directory tree mirrors the nesting structure of the IR. For example, if the IR is congruent to the pass-pipeline "builtin.module(pass1,pass2,func.func(pass3,pass4),pass5)", and `-mlir-print-ir-tree-dir=/tmp/pipeline_output`, then then the tree file tree created will look like: ``` /tmp/pass_output ├── builtin_module_the_symbol_name │ ├── 0_pass1.mlir │ ├── 1_pass2.mlir │ ├── 2_pass5.mlir │ ├── func_func_my_func_name │ │ ├── 1_0_pass3.mlir │ │ ├── 1_1_pass4.mlir │ ├── func_func_my_other_func_name │ │ ├── 1_0_pass3.mlir │ │ ├── 1_1_pass4.mlir ``` The subdirectories are named by concatenating the relevant parent operation names and symbol name (if present). The printer keeps a counter associated with ops that are targeted by passes and their isolated-from-above parents. Each filename is given a numeric prefix using the counter value for the op that the pass is targeting and then prepending the counter values for each parent. This gives a naming where it is easy to distinguish which passes may have run concurrently vs. which have a clear ordering. In the above example, for both `1_1_pass4.mlir` files, the first `1` refers to the counter for the parent op, and the second refers to the counter for the respective function.	2024-05-24 10:01:48 -06:00
tyb0807	8178a3ad1b	[mlir] Replace MLIR_ENABLE_CUDA_CONVERSIONS with LLVM_HAS_NVPTX_TARGET (#93008 ) LLVM_HAS_NVPTX_TARGET is automatically set depending on whether NVPTX was enabled when building LLVM. Use this instead of manually defining MLIR_ENABLE_CUDA_CONVERSIONS (whose name is a bit misleading btw).	2024-05-24 17:31:28 +02:00

1 2 3 4 5 ...

19916 Commits