llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-24 11:46:07 +00:00

Author	SHA1	Message	Date
jeanPerier	42d49a1a5b	[flang] fix openmp tests after #132261 (#132391 ) I missed these tests in my test update after #132261 because they are under `! REQUIRES: openmp_runtime`. Broke build bots: https://lab.llvm.org/buildbot/#/builders/50/builds/11789	2025-03-21 14:04:37 +01:00
Tom Eccles	3bcab6f20a	[flang][OpenMP][Semantics] improve semantic checks for array sections (#132230 ) I'm not sure why strides were not allowed in array sections: the stride is explicitly allowed by the standard from the first version where array sections were introduced. The limitation is that the stride must not be negative. Here I have added the check for a negative stride and updated the test for a zero length section to take account of the stride.	2025-03-21 10:58:44 +00:00
jeanPerier	44261dae5b	[flang][NFC] use hlfir.declare first result when both results are raw pointers (#132261 ) Currently, the helpers to get fir::ExtendedValue out of hlfir::Entity use hlfir.declare second result (`#1`) in most cases. This is because this result is the same as the input and matches what FIR was getting before lowering to HLFIR. But this creates odd situations when both hlfir.declare are raw pointers and either result ends-up being used in the IR depending on whether the code was generated by a helper using fir::ExtendedValue, or via "pure HLFIR" helpers using the first result. This will typically prevent simple CSE and easy identification that two operation (e.g load/store) are touching the exact same memory location without using alias analysis or "manual detection" (looking for common hlfir.declare defining op). Hence, when hlfir.declare results are both raw pointers, use `#0` when producing `fir::ExtendedValue`. When `#0` is a fir.box, keep using `#1` because these are not the same. The only code change is in HLFIRTools.cpp and is pretty small, but there is a big test fallout of `#1` to `#0`.	2025-03-21 11:41:04 +01:00
Sergio Afonso	b231f6f862	[MLIR][OpenMP] Improve omp.map.info verification (#132066 ) This patch makes the `map_type` and `map_capture_type` arguments of the `omp.map.info` operation required, which was already an invariant being verified by its users via `verifyMapClause()`. This makes it clearer, as getters no longer return misleading `std::optional` values. Checks for the `mapper_id` argument are moved to a verifier for the operation, rather than being checked by users. Functionally NFC, but not marked as such due to a reordering of arguments in the assembly format of `omp.map.info`.	2025-03-20 15:48:45 +00:00
Krzysztof Parzyszek	68180d8d16	[flang][OpenMP] Use OmpDirectiveSpecification in standalone directives (#131163 ) This uses OmpDirectiveSpecification in the rest of the standalone directives.	2025-03-20 06:50:43 -05:00
Slava Zakharin	2c91f10362	[flang] Fixed repacking for TARGET and INTENT(OUT) (#131972 ) TARGET dummy arrays can be accessed indirectly, so it is unsafe to repack them. INTENT(OUT) dummy arrays that require finalization on entry to their subroutine must be copied-in by `fir.pack_arrays`. In addition, based on my testing results, I think it will be useful to document that `LOC` and `IS_CONTIGUOUS` will have different values for the repacked arrays. I still need to decide where to document this, so just added a note in the design doc for the time being.	2025-03-19 17:12:32 -07:00
Peter Klausler	a0ae88bffe	[flang] Disable 3 x86-64 tests on non-x86-64 (#132088 ) Now that COMPLEX(KIND=10) is properly disabled where 80-bit floating-point types are not available, three tests that were not peculiar to x86-64 are failing on other targets. Make them specific to x86-64.	2025-03-19 12:52:59 -07:00
Peter Klausler	6b9716b7f4	[flang] Catch bad usage case of whole assumed-size array (#132052 ) Whole assumed-size arrays are generally not allowed outside specific contexts, where expression analysis notes that they can appear. But contexts can nest, and in the case of an actual argument that turns out to be an array constructor, the permission to use a whole assumed-size array must be rescinded. Fixes https://github.com/llvm/llvm-project/issues/131909.	2025-03-19 12:02:34 -07:00
Peter Klausler	7f7d7d552b	[flang] Use local name for structure constructor (#132047 ) When reinterpreting an ambiguously parsed function reference as a structure constructor, use the original symbol of the type in the representation of the derived type spec of the structure constructor, not its ultimate resolution. The distinction turns out to matter when generating module files containing derived type constants as initializers when the derived types' names have undergone USE association renaming. Fixes https://github.com/llvm/llvm-project/issues/131579.	2025-03-19 12:02:03 -07:00
Peter Klausler	b99dab2587	[flang] Exempt construct entities from SAVE check for PURE (#131383 ) A PURE subprogram can't have a local variable with the SAVE attribute. An ASSOCIATE or SELECT TYPE construct entity whose selector is a variable will return true from IsSave(); exclude them from the local variable check. Fixes https://github.com/llvm/llvm-project/issues/131356.	2025-03-19 12:01:18 -07:00
Peter Klausler	9f284e1784	[flang] Disabling REAL kinds must also disable their COMPLEX (#131353 ) When disabling kinds of REAL in the TargetCharacteristics, one must also disable the corresponding kinds of COMPLEX. Fixes https://github.com/llvm/llvm-project/issues/131088.	2025-03-19 12:00:51 -07:00
Peter Klausler	329bfa91b0	[flang] Fix crash in CO_REDUCE semantics (#131211 ) A std::optional<> value was being accessed without first ensuring its presence.	2025-03-19 12:00:23 -07:00
Peter Klausler	3f04fb42aa	[flang] Complete semantic checks for FORM TEAM (#131022 ) Add remaining checking for the FORM TEAM statement, complete and enable a test.	2025-03-19 11:59:59 -07:00
Peter Klausler	1dc397deed	[flang] Enforce control flow restrictions on CHANGE TEAM (#131013 ) Like DO CONCURRENT and CRITICAL constructs, control flow into and out of a CHANGE TEAM construct is disallowed.	2025-03-19 11:59:39 -07:00
Peter Klausler	abebac5b86	[flang] Dig deeper to find more EVENT_TYPE/LOCK_TYPE misuse (#130687 ) Only objects may have these types, or have potential subobject components with these types.	2025-03-19 11:59:18 -07:00
Peter Klausler	587f997db7	[flang] Catch C15104(4) violations when coindexing is present (#130677 ) The value of a structure constructor component can't have a pointer ultimate component if it is a coindexed designator.	2025-03-19 11:58:59 -07:00
Krzysztof Parzyszek	cd26dd5595	[flang][OpenMP] Use OmpDirectiveSpecification in simple directives (#131162 ) The `OmpDirectiveSpecification` contains directive name, the list of arguments, and the list of clauses. It was introduced to store the directive specification in METADIRECTIVE, and could be reused everywhere a directive representation is needed. In the long term this would unify the handling of common directive properties, as well as creating actual constructs from METADIRECTIVE by linking the contained directive specification with any associated user code.	2025-03-19 11:34:40 -05:00
Valentin Clement (バレンタインクレメン)	20feca47c1	[flang][cuda] Allow ieee_arithmetic on the device (#131930 ) - Allow ieee_arithmetic on the device - Add ignore_tkr(d) to ieee_is_finite	2025-03-19 07:20:06 -07:00
Kiran Chandramohan	96b112fb61	Revert "[flang][openmp] Adds Parser and Semantic Support for Interop Construct, and Init and Use Clauses." (#132005 ) Reverts llvm/llvm-project#120584 Reverting due to CI failure https://lab.llvm.org/buildbot/#/builders/157/builds/22946	2025-03-19 11:13:52 +00:00
swatheesh-mcw	ee8a759bfb	[flang][openmp] Adds Parser and Semantic Support for Interop Construct, and Init and Use Clauses. (#120584 ) Adds Parser and Semantic Support for the below construct and clauses: - Interop Construct - Init Clause - Use Clause Note: The other clauses supported by Interop Construct such as Destroy, Use, Depend and Device are added already.	2025-03-19 10:49:17 +00:00
Tom Eccles	e7c6e3557b	[flang][OpenMP] Fix threadprivate pointer variable in common block (#131888 ) Fixes #112538 The problem was that the host associated symbol for the threadprivate variable doesn't have all of the symbol attributes (e.g. POINTER). This caused the lowering code to generate the wrong type, eventually hitting an assertion.	2025-03-19 10:12:52 +00:00
jeanPerier	b8271ec8b3	[flang] accept character type in fir::changeTypeShape (#131892 ) There is no reason for character element type to be forbidden in this helper. The assert was firing in character pointer assignment in FORALL after #130772 added a usage of this helper.	2025-03-19 10:01:24 +01:00
Slava Zakharin	fd0e20a64b	[flang] Generate fir.pack/unpack_array in Lowering. (#131704 ) Basic generation of array repacking operations in Lowering.	2025-03-18 21:26:33 -07:00
Slava Zakharin	9ed772cecc	[flang] Fixed computation of position of function's arg in AddDebugInfo. (#131672 ) I am working on `-frepack-array` feature (#127147), which produces non-trivial manipulations with arguments of `fir.declare`. In this case, we end up with CFG computation of the `fir.declare` argument, and AddDebugInfo pass incorrectly mapped two dummy arguments to the same arg index in the debug attributes. This patch makes sure that we assign the arg index only if we can prove that we've traced the block argument to the function's entry block. I believe this problem is not specific to `-frepack-arrays`, e.g. it may appear due to MLIR inlining as well.	2025-03-18 16:46:59 -07:00
Valentin Clement (バレンタインクレメン)	b7ed5c8e06	[flang][cuda] Check for ignore_tkr(d) when resolving generic call (#131923 )	2025-03-18 15:39:04 -07:00
Slava Zakharin	e0bcf3aa0b	[flang] Allow no type parameters for fir.pack_array. (#131662 ) Arrays with assumed-length types are represented with a box without explicit length parameters. This patch fixes the verification to allow it for `fir.pack_array`.	2025-03-18 07:59:04 -07:00
Akash Banerjee	cbc5c11fec	[MLIR][OpenMP] Add Lowering support for implicitly linking to default declare mappers (#131006 )	2025-03-18 13:17:10 +00:00
Kareem Ergawy	83658ddb1b	[flang][OpenMP] Enable delayed privatization by default for `omp.distribute` (#131574 ) Switches delayed privatization for `omp.distribute` to be on by default: controlled by the `-openmp-enable-delayed-privatization` instead of by `-openmp-enable-delayed-privatization-staging`. ### GFortran & Fujitsu test suite results: #### gfotran test-suite (this PR): ``` Testing Time: 34.51s Passed: 6569 ``` #### Fujitsu without changes (commit: 0813c5cf5f52): ``` Testing Time: 155.39s Passed : 88325 Failed : 156 Executable Missing: 408 ``` #### Fujitsu with changes (this PR): ``` Testing Time: 158.54s Passed : 88325 Failed : 156 Executable Missing: 408 ```	2025-03-18 14:07:41 +01:00
Kareem Ergawy	1094ffcafb	[flang][fir] Add MLIR op for `do concurrent` (#130893 ) Adds new MLIR ops to model `do concurrent`. In order to make `do concurrent` representation self-contained, a loop is modeled using 2 ops, one wrapper and one that contains the actual body of the loop. For example, a 2D `do concurrent` loop is modeled as follows: ```mlir fir.do_concurrent { %i = fir.alloca i32 %j = fir.alloca i32 fir.do_concurrent.loop (%i_iv, %j_iv) = (%i_lb, %j_lb) to (%i_ub, %j_ub) step (%i_st, %j_st) { %0 = fir.convert %i_iv : (index) -> i32 fir.store %0 to %i : !fir.ref<i32> %1 = fir.convert %j_iv : (index) -> i32 fir.store %1 to %j : !fir.ref<i32> } } ``` The `fir.do_concurrent` wrapper op encapsulates both the actual loop and the allocations required for the iteration variables. The `fir.do_concurrent.loop` op is a multi-dimensional op that contains the loop control and body. See the ops' docs for more info.	2025-03-18 10:53:44 +01:00
Valentin Clement (バレンタインクレメン)	e5ec7bb21b	[flang][cuda] Set correct offsets for multiple variables in dynamic shared memory (#131674 )	2025-03-17 17:13:06 -07:00
Valentin Clement (バレンタインクレメン)	74d4fc0a3e	[flang][cuda][NFC] Use ssa value for offset in shared memory op (#131661 ) Switch from attribute to a value as we need to support dynamic offset when multiple variables are used with dynamic shared memory.	2025-03-17 14:23:34 -07:00
Kiran Chandramohan	93e0df07c2	[Flang][OpenMP] Allow zero trait score (#131473 )	2025-03-17 09:49:08 +00:00
sharang.12492	7eb8b73178	[Flang][OpenMP][taskloop] Adding missing semantic checks in Taskloop (#128431 ) Below semantic checks for Taskloop clause mentioned in OpenMP [5.2] specification were missing, this patch contains the semantic checks, corresponding error messages and test cases: OpenMP standard [5.2]: [12.6] Taskloop Construct [Restrictions] Restrictions to the taskloop construct are as follows: • The reduction-modifier must be default. • The conditional lastprivate-modifier must not be specified. Authored-by: shkaushi <sharang.kaushik@amd.com>	2025-03-17 12:35:37 +05:30
Valentin Clement (バレンタインクレメン)	4fde8c341f	[flang][cuda] Lower CUDA shared variable with cuf.shared_memory op (#131399 ) Use `cuf.shared_memory` operation instead of `cuf.alloc` for CUDA shared variable. These variables do not need free operations.	2025-03-16 17:44:56 -07:00
Valentin Clement (バレンタインクレメン)	e86081b6c2	[flang][cuda] Convert cuf.shared_memory operation to LLVM ops (#131396 ) Convert the operation to `llvm.addressof` operation with `llvm.getelementptr` with the appropriate offset.	2025-03-14 19:34:55 -07:00
Valentin Clement (バレンタインクレメン)	4fb20b85fd	[flang][cuda] Compute offset on cuf.shared_memory ops (#131395 ) Add a pass to compute the size of the shared memory (static shared memory) and the offsets of each variables to be placed in shared memory. The global representing the shared memory is also created during this pass. In case of dynamic shared memory, the global as a type of `!fir.array<0xi8>` and the size of the memory is set at kernel launch.	2025-03-14 19:34:35 -07:00
Valentin Clement (バレンタインクレメン)	4818623924	[flang][cuda] Add cuf.shared_memory operation (#131392 ) Introduce `cuf.shared_memory` operation. The operation is used to get the pointer in shared memory for a specific variable. The shared memory is materialized as a global in address space 3 and the different variables are pointing to it at different offset. Follow up patches will add lowering and conversion of this operation.	2025-03-14 15:43:25 -07:00
Valentin Clement (バレンタインクレメン)	a862b6deae	[flang][cuda] Lower shared global to the correct NVVM address space (#131368 ) Global with the CUDA shared data attribute needs to be lowered to llvm globals with the correct address space (3). Address space is set from the `mlir::NVVM::NVVMMemorySpace::kSharedMemorySpace` enum from `mlir/Dialect/LLVMIR/NVVMDialect.h`	2025-03-14 15:28:32 -07:00
Slava Zakharin	00f9c855fb	[flang] Added fir.is_contiguous_box and fir.box_total_elements ops. (#131047 ) These are helper operations to aid with expanding of fir.pack_array.	2025-03-14 08:25:05 -07:00
jeanPerier	3ff3b29dd6	[flang] lower remaining cases of pointer assignments inside forall (#130772 ) Implement handling of `NULL()` RHS, polymorphic pointers, as well as lower bounds or bounds remapping in pointer assignment inside FORALL. These cases eventually do not require updating hlfir.region_assign, lowering can simply prepare the new descriptor for the LHS inside the RHS region. Looking more closely at the polymorphic cases, there is not need to call the runtime, fir.rebox and fir.embox do handle the dynamic type setting correctly. After this patch, the last remaining TODO is the allocatable assignment inside FORALL, which like some cases here, is more likely an accidental feature given FORALL was deprecated in F2003 at the same time than allocatable components where added.	2025-03-14 10:51:46 +01:00
Michael Kruse	bddf24ddbd	[Flang] Add omp_lib dependency to check-flang (#130975 ) With `LLVM_ENABLE_RUNTIMES=openmp`, flang enables the OpenMP regression tests, but `check-flang` was not ensuring that the OpenMP requirements are built first. Fix by adding a `libomp-mod` to `flang-test-depends`. Adding `libomp-mod` to extra_targets is necessary because there is no target from openmp/ that is reachable from the parent bootstrapping-build. `ninja openmp` fails because openmp/ has no `openmp` target. `check-openmp` would also run the OpenMP tests and does not even build `omp_lib.mod`. `runtimes` would build all the runtimes, not just OpenMP. Also fix the misleading CMake configure status messages that suggest the only way to build omp_lib.mod/.h is `LLVM_ENABLE_PROJECTS=openmp`.	2025-03-14 09:24:28 +01:00
Valentin Clement (バレンタインクレメン)	369da8421c	[flang][cuda] Allow assumed-size declaration for SHARED variable (#130833 ) Avoid triggering an assertion for shared variable using the assumed-size syntax. ``` attributes(global) subroutine sharedstar() real, shared :: s(*) ! ok. dynamic shared memory. end subroutine ```	2025-03-13 11:06:17 -07:00
Tom Eccles	01aca42363	[flang] Add support for -f[no-]verbose-asm (#130788 ) This flag provides extra commentary in the assembly output.	2025-03-13 15:22:13 +00:00
Kareem Ergawy	b003face11	[flang][OpenMP] Add `OutlineableOpenMPOpInterface` to `omp.teams` (#131109 ) Given the following input: ```fortran program rep_loopbind implicit none integer :: i real :: priv_val !$omp teams private(priv_val) !$omp distribute do i=1,1000 end do !$omp end teams end program ``` the `AllocaOpConversion` pattern in `FIRToLLVMLowering` would move the private allocations that belong to the `teams` directive (i.e. the allocations needed for the private copies of `priv_val` and the loop's iteration variable) from the the `omp.teams` op to the outside scope. This is not correct since these allocations should be eventually emitted inside the outlined region for the `teams` directive. Without this fix, these allocation would be emitted in the parent function (or the parent scope whatever it is).	2025-03-13 16:03:19 +01:00
Michael Klemm	28ffa7f6a4	[flang][OpenMP] Fix missing missing inode issue (#130798 ) When outlining an offload region, Flang creates a unique name by querying an inode ID. However, when the name of the actual source file does not match the logical file in a `#line` preprocessor directive, code-gen was failing as it could not determine the inode ID. This PR checks for this condition and if the logical file name does not exist, the inode is replaced with a hash value created from the source code itself.	2025-03-13 15:50:37 +01:00
Mats Petersson	d0188ebcc2	[flang][OpenMP]Add symbls omp_in, omp_out and omp_priv in DECLARE RED… (#129908 ) …UCTION This patch allows better parsing of the reduction and initializer components, including supporting derived types in both those places. There is more work needed here, but this is a definite improvement in what can be handled through parser and semantics. Note that declare reduction is still not supported in lowering, so any attempt to compile DECLARE REDUCTION code will end with a TODO aka "Not yet implemented" abort in the compiler. Note that this version of the code does not cover declaring multiple reductions using the same name with different types. This is will be fixed in a future patch. [This was also the case before this change]. One existing test modified to actually compile (as it didn't in the original form).	2025-03-13 09:39:45 +00:00
Krzysztof Parzyszek	f4fc2d731c	[flang][OpenMP] Map ByRef if size/alignment exceed that of a pointer (#130832 ) Improve the check for whether a type can be passed by copy. Currently, passing by copy is done via the OMP_MAP_LITERAL mapping, which can only transfer as much data as can be contained in a pointer representation.	2025-03-12 19:41:11 -05:00
Iñaki Amatria Barral	bdbe8fa1f3	[flang] Align `-x` language modes with `gfortran` (#130268 ) This PR addresses some of the issues described in https://github.com/llvm/llvm-project/issues/127617. Key changes: - Stop assuming fixed-form for `-x f95` unless the input is a `.i` file. This change ensures compatibility with `-save-temps` workflows while preventing unintended fixed-form assumptions. - Ensure `-x f95-cpp-input` enables `-cpp` by default, aligning Flang's behavior with `gfortran`.	2025-03-12 16:45:33 +01:00
Asher Mancinelli	982527eef0	[flang] Use saturated intrinsics for floating point to integer conversions (#130686 ) The saturated floating point conversion intrinsics match the semantics in the standard more closely than the fptosi/fptoui instructions. Case 2 of 16.9.100 is > INT (A [, KIND]) > If A is of type real, there are two cases: if \|A\| < 1, INT (A) has the value 0; if \|A\| ≥ 1, INT (A) is the integer whose magnitude is the largest integer that does not exceed the magnitude of A and whose sign is the same as the sign of A. Currently, converting a floating point value into an integer type too small to hold the constant will be converted to poison in opt, leaving us with garbage: ``` > cat t.f90 program main real(kind=16) :: f integer(kind=4) :: i f=huge(f) i=f print , i end program main # current upstream > for i in `seq 10`; do; ./a.out; done -862156992 -1497393344 -739096768 -1649494208 1761228608 -1959270592 -746244288 -1629194432 -231217344 382322496 ``` With the saturated fptoui/fptosi intrinsics, we get the appropriate values ``` # mine > flang -O2 ./t.f90 && ./a.out 2147483647 > perl -e 'printf "%d\n", (2 * 31) - 1' 2147483647 ``` One notable difference: NaNs being converted to ints will become zero, unlike current flang (and some other compilers). Newer versions of GCC have this behavior.	2025-03-12 08:14:46 -07:00
Tom Eccles	c851ee38ad	[flang][OpenMP] catch namelist access through equivalence (#130804 ) The standard prohibits privatising namelist variables. We also decided in #110671 to prohibit reductions of namelist variables. This commit prevents this rule from being circumvented through the use of equivalence statements. Fixes #122824	2025-03-12 11:45:15 +00:00

1 2 3 4 5 ...

5559 Commits