llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-25 20:56:06 +00:00

Author	SHA1	Message	Date
Valentin Clement (バレンタインクレメン)	ae8dd63681	[flang][cuda] Add interface and lowering for all_sync (#134001 )	2025-04-01 17:59:11 -07:00
Andre Kuhlenschmidt	b6edd25f17	[flang][intrinsics] NFC: make comment consistent (#133972 ) Just makes this named argument comment consistent with all the others in the file.	2025-04-01 14:30:10 -07:00
Slava Zakharin	58551faaf1	[flang] Inline fir.is_contiguous_box in some cases. (#133812 ) Added inlining for `rank == 1` and `innermost` cases.	2025-04-01 08:41:11 -07:00
Jean-Didier PAILLEUX	513a91a5f1	[flang/flang-rt] Implement PERROR intrinsic form GNU Extension (#132406 ) Add the implementation of the `PERROR(STRING) ` intrinsic from the GNU Extension to prints on the stderr a newline-terminated error message corresponding to the last system error prefixed by `STRING`. (https://gcc.gnu.org/onlinedocs/gfortran/PERROR.html)	2025-04-01 15:47:54 +02:00
Tom Eccles	e17d864f55	[flang][OpenMP][Lower] lower array subscripts for task depend (#132994 ) The OpenMP standard says that all dependencies in the same set of inter-dependent tasks must be non-overlapping. This simplification means that the OpenMP only needs to keep track of the base addresses of dependency variables. This can be seen in kmp_taskdeps.cpp, which stores task dependency information in a hash table, using the base address as a key. This patch generates a rebox operation to slice boxed arrays, but only the box data address is used for the task dependency. The extra box is optimized away by LLVM at O3. Vector subscripts are TODO (I will address in my next patch). This also fixes a bug for ordinary subscripts when the symbol was mapped to a box: Fixes #132647	2025-04-01 10:26:14 +01:00
Jean-Didier PAILLEUX	bae3577002	[flang] Define ERF, ERFC and ERFC_SCALED intrinsics with Q and D prefix (#125217 ) `ERF`, `ERFC` and `ERFC_SCALED` intrinsics prefixed by `Q` and `D` are missing. Codes such as `CP2K`(https://github.com/cp2k/cp2k) and `TurboRVB`(https://github.com/sissaschool/turborvb) use these intrinsics just like defined in the GNU standard and here: https://www.ibm.com/docs/fr/xl-fortran-aix/16.1.0?topic=reference-intrinsic-procedures These intrinsics are based on the existing intrinsics but apply a restriction on the type kind. - `DERF`, `DERFC` and `DERFC_SCALED` are for double précision only. - `QERF`, `QERFC` and `QERFC_SCALED` are for quad précision only.	2025-04-01 08:07:26 +02:00
Thirumalai Shaktivel	091dcb8fc2	[Flang] Make a private copy for the common block variables in copyin clause (#111359 ) Fixes: https://github.com/llvm/llvm-project/issues/82949	2025-04-01 11:35:44 +05:30
Slava Zakharin	5f268d04f9	[flang] Code generation for fir.pack/unpack_array. (#132080 ) The code generation relies on `ShallowCopyDirect` runtime to copy data between the original and the temporary arrays (both directions). The allocations are done by the compiler generated code. The heap allocations could have been passed to `ShallowCopy` runtime, but I decided to expose the allocations so that the temporary descriptor passed to `ShallowCopyDirect` has `nocapture` - maybe this will be better for LLVM optimizations.	2025-03-31 11:42:17 -07:00
Slava Zakharin	0ac8cb1b3d	[flang] Recognize fir.pack_array in LoopVersioning. (#133191 ) This change enables LoopVersioning when `fir.pack_array` is met in the def-use chain. It fixes a couple of huge performance regressions caused by enabling `-frepack-arrays`.	2025-03-31 11:41:43 -07:00
Thirumalai Shaktivel	374a5bea52	[Flang][OpenMP] Add PointerAssociateScalar to Cray Pointer used in the DSA (#133232 ) Issue: Cray Pointer is not associated to Cray Pointee, leading to Segmentation fault Fix: GetUltimate, retrieves the base symbol in the current scope, which gets passed all the references and returns the original symbol --------- Co-authored-by: Michael Klemm <michael.klemm@amd.com>	2025-03-29 15:39:12 +01:00
swatheesh-mcw	fe30cf18ab	Revert "Revert "[flang][openmp] Adds Parser and Semantic Support for Interop Construct, and Init and Use Clauses."" (#132343 ) Reverts llvm/llvm-project#132005	2025-03-28 15:21:52 +00:00
Nick Sarnie	48b7530273	[clang][flang][Triple][llvm] Add isOffload function to LangOpts and isGPU function to Triple (#126956 ) I'm adding support for SPIR-V, so let's consolidate these checks. --------- Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>	2025-03-28 14:19:20 +00:00
Joseph Huber	772173f548	[Clang][AMDGPU] Remove special handling for COV4 libraries (#132870 ) Summary: When we were first porting to COV5, this lead to some ABI issues due to a change in how we looked up the work group size. Bitcode libraries relied on the builtins to emit code, but this was changed between versions. This prevented the bitcode libraries, like OpenMP or libc, from being used for both COV4 and COV5. The solution was to have this 'none' functionality which effectively emitted code that branched off of a global to resolve to either version. This isn't a great solution because it forced every TU to have this variable in it. The patch in https://github.com/llvm/llvm-project/pull/131033 removed support for COV4 from OpenMP, which was the only consumer of this functionality. Other users like HIP and OpenCL did not use this because they linked the ROCm Device Library directly which has its own handling (The name was borrowed from it after all). So, now that we don't need to worry about backward compatibility with COV4, we can remove this special handling. Users can still emit COV4 code, this simply removes the special handling used to make the OpenMP device runtime bitcode version agnostic.	2025-03-28 07:35:16 -05:00
Bruno Cardoso Lopes	7c3ecffe9b	[MLIR][LLVMIR] Add support for the full form of global_{ctor,dtor} (#133176 ) Currently only ctor/dtor list and their priorities are supported. This PR adds support for the missing data field. Few implementation notes: - The assembly printer has a fixed form because previous `attr_dict` will sort the dict by key name, making global_dtor and global_ctor differ in the order of printed arguments. - LLVM's `ptr null` is being converted to `#llvm.zero` otherwise we'd have to create a region to use the default operation conversion from `ptr null`, which is silly given that the field only support null or a symbol.	2025-03-27 14:11:05 -07:00
Andre Kuhlenschmidt	077940621d	[flang][openacc] Make OpenACC block construct parse errors less verbose. (#131042 ) This PR does reduces the verbosity of parser errors for OpenACC block constructs that do not parse correctly because they are missing their trailing end block directive by: - Removing the redundant error messages created by parsing 3 different styles of directive tokens. - Providing a general mechanism of configuring the max number of contexts printed for every syntax error. - Not printing less specific contexts that are at the same location. Prior to the changes: ``` $ flang -fc1 -fopenacc -fsyntax-only flang/test/Parser/acc-data-statement.f90 2>&1 \| tee acc-data-statement.prior.log \| wc -l 262 ``` [acc-data-statement.prior.log](https://github.com/user-attachments/files/19298165/acc-data-statement.prior.log) ``` $ flang -fc1 -fopenacc -fsyntax-only flang/test/Parser/acc-data-statement.f90 2>&1 \| tee acc-data-statement.prior.log \| wc -l 73 ``` [acc-data-statement.post.log](https://github.com/user-attachments/files/19298181/acc-data-statement.post.log)	2025-03-26 12:36:04 -07:00
Andre Kuhlenschmidt	3ab70e3f90	[Flang] Change sizeof argument name to "x" (#130189 ) This closes #128610 by fixing the name of the argument to the sizeof function to be "x" and adds a test.	2025-03-26 12:34:36 -07:00
Peter Klausler	3bc8aa7823	[flang] Catch whole assumed-size array as RHS (#132819 ) The right-hand side expression of an intrinsic assignment statement may not be the name of an assumed-size array dummy argument.	2025-03-26 12:09:57 -07:00
Peter Klausler	6df27dd42d	[flang] Fix missed case of symbol renaming in module file generation (#132475 ) The map of symbols requiring new local aliases for USE association needs to use the symbols' ultimate resolutions to avoid missing cases that can arise in convoluted codes with lots of confusing renamings. Fixes https://github.com/llvm/llvm-project/issues/132435.	2025-03-26 12:09:38 -07:00
Peter Klausler	4ea5aa09de	[flang][NFC] Restore I/O runtime API header name (#132423 ) flang/include/flang/Runtime/io-api.h was changed into io-api-consts.h, then wrapped into a new io-api.h that includes io-api-consts.h, does some redundant includes and declarations, and then declares the prototype of one function, InquiryKeywordHashDecode. Make that function static in io-stmt.cpp prior to its sole call site, then undo the renaming, to reduce confusion and redundancy.	2025-03-26 12:09:16 -07:00
Peter Klausler	38207a52a7	[flang] Test SYNC IMAGES, increase checking (#132279 ) Add a test for the SYNC IMAGES statement, and add a check for invalid image numbers.	2025-03-26 12:08:48 -07:00
Peter Klausler	f3991e10bb	[flang] Allow macro replacement in numeric kind suffix (#132120 ) When a numeric value has a kind suffix containing an identifier, allow macro replacement for that identifier by treating it as its own token. Fixes https://github.com/llvm/llvm-project/issues/131548.	2025-03-26 12:08:26 -07:00
Valentin Clement (バレンタインクレメン)	e6dda9c23a	[flang][cuda] Only create shared memory global when needed (#132999 )	2025-03-26 09:26:50 -07:00
Kajetan Puchalski	529c5b71c6	[flang] Add -f[no-]slp-vectorize flags (#132801 ) Add -f[no-]slp-vectorize to the flang driver. Add corresponding -fvectorize-slp to the flang frontend. Enable -fslp-vectorize at -O2 and higher in flang to match the current behaviour in clang. --------- Signed-off-by: Kajetan Puchalski <kajetan.puchalski@arm.com>	2025-03-26 16:10:35 +00:00
Slava Zakharin	613a077b05	[flang] Generate quadmath_wrapper.h for Flang Evaluate. (#132817 ) When building Flang with Clang, we need to do the same quadmath.h wrapping as we do for flang-rt. I extracted the CMake code into FlangCommon.cmake, and cleaned up the arguments passing to execute_process (note that `-###` was treated as `-` in the original code, because `#` starts a comment). I believe the Clang command does not require the input source file, so I removed it as well.	2025-03-25 12:08:38 -07:00
Eugene Epshteyn	2c8e26081f	[flang] Add HOSTNM runtime and lowering intrinsics implementation (#131910 ) Implement GNU extension intrinsic HOSTNM, both function and subroutine forms. Add HOSTNM documentation to `flang/docs/Intrinsics.md`. Add lowering and semantic unit tests. (This change is modeled after GETCWD implementation.)	2025-03-25 13:17:17 -04:00
vdonaldson	92e0560347	[flang] ieee_denorm (#132307 ) Add support for the nonstandard ieee_denorm exception for real kinds 3, 4, 8 on x86 processors.	2025-03-25 13:02:43 -04:00
Valentin Clement (バレンタインクレメン)	5be9082fed	[flang][cuda] Carry over the dynamic shared memory size to gpu.launch_func (#132837 )	2025-03-24 18:37:19 -07:00
Krzysztof Parzyszek	c221d64206	[flang] Remove mentions of evaluate::Variable<T> (#132805 ) The template itself was not defined anywhere. The closest thing was a forward declaration in flang/include/flang/Evaluate/variable.h.	2025-03-24 18:26:57 -05:00
Leandro Lupori	ef56f4b5a0	[flang][OpenMP] Fix reduction of arrays with non-default lower bounds (#132228 ) Using LoopNest's indices with ShapeShifts that have non-default lower bounds results in accesses to incorrect array elements. To avoid having to adjust each index, a ShapeShift with default lower bounds can be used instead. Fixes #131751	2025-03-24 09:48:41 -03:00
Kareem Ergawy	4fa5ab382e	[flang][OpenMP] Skip multi-block `teams` regions when processing `loop` directives (#132687 ) Fixes a regression when the generic `loop` directive conversion pass encounters a multi-block `teams` region. At the moment, we skip such regions.	2025-03-24 07:43:11 -05:00
Kareem Ergawy	6328506536	[flang][fir] Add rewrite pattern to convert `fir.do_concurrent` to `fir.do_loop` (#132207 ) Rewrites `fir.do_concurrent` ops to a corresponding nest of `fir.do_loop ... unordered` ops.	2025-03-24 12:09:32 +01:00
Kareem Ergawy	c031579cb7	[flang][OpenMP] Hoist reduction info from nested `loop` ops to parent `teams` ops (#132003 ) Fixes a bug in reductions when converting `teams loop` constructs with `reduction` clauses. According to the spec (v5.2, p340, 36): ``` The effect of the reduction clause is as if it is applied to all leaf constructs that permit the clause, except for the following constructs: * .... * The teams construct, when combined with the loop construct. ``` Therefore, for a combined directive similar to: `!$omp teams loop reduction(...)`, the earlier stages of the compiler assign the `reduction` clauses only to the `loop` leaf and not to the `teams` leaf. On the other hand, if we have a combined construct similar to: `!$omp teams distribute parallel do`, the `reduction` clauses are assigned both to the `teams` and the `do` leaves. We need to match this behavior when we convert `teams` op with a nested `loop` op since the target set of constructs/ops will be incorrect without moving the reductions up to the `teams` op as well. This PR introduces a pattern that does exactly this. Given the following input: ``` omp.teams { omp.loop reduction(@red_sym %red_op -> %red_arg : !fir.ref<i32>) { omp.loop_nest ... { ... } } } ``` this pattern updates the `omp.teams` op in-place to: ``` omp.teams reduction(@red_sym %red_op -> %teams_red_arg : !fir.ref<i32>) { omp.loop reduction(@red_sym %teams_red_arg -> %red_arg : !fir.ref<i32>) { omp.loop_nest ... { ... } } } ``` Note the following: * The nested `omp.loop` is not rewritten by this pattern, this happens through `GenericLoopConversionPattern`. * The reduction info are cloned from the nested `omp.loop` op to the parent `omp.teams` op. * The reduction operand of the `omp.loop` op is updated to be the new reduction block argument of the `omp.teams` op.	2025-03-21 14:12:15 -05:00
Michael Kruse	123eb75cd4	[Flang] Do not emit numeric_storage_size into object file (#131463 ) The value of numeric_storage_size depends on compilation options and therefore its value is not yet known when building the builtins runtime. Instead, the parameter is folding a __numeric_storage_size() expression which is loaded into the user program. For the iso_fortran_env object file, omit the symbol as it is never used. Similar tests that ensure that __numeric_storage_size() is not folded until compiling the actual user program exist in FortranEvalutate: `1e6ba3cd2f/flang/lib/Evaluate/check-expression.cpp (L487-L492)` `1e6ba3cd2f/flang/lib/Evaluate/fold-integer.cpp (L1457-L1460)` Required for using CMake to compile the builtin module files. See RFC at https://discourse.llvm.org/t/rfc-building-flangs-builtin-mod-files/84626	2025-03-21 12:32:54 +01:00
Tom Eccles	3bcab6f20a	[flang][OpenMP][Semantics] improve semantic checks for array sections (#132230 ) I'm not sure why strides were not allowed in array sections: the stride is explicitly allowed by the standard from the first version where array sections were introduced. The limitation is that the stride must not be negative. Here I have added the check for a negative stride and updated the test for a zero length section to take account of the stride.	2025-03-21 10:58:44 +00:00
jeanPerier	44261dae5b	[flang][NFC] use hlfir.declare first result when both results are raw pointers (#132261 ) Currently, the helpers to get fir::ExtendedValue out of hlfir::Entity use hlfir.declare second result (`#1`) in most cases. This is because this result is the same as the input and matches what FIR was getting before lowering to HLFIR. But this creates odd situations when both hlfir.declare are raw pointers and either result ends-up being used in the IR depending on whether the code was generated by a helper using fir::ExtendedValue, or via "pure HLFIR" helpers using the first result. This will typically prevent simple CSE and easy identification that two operation (e.g load/store) are touching the exact same memory location without using alias analysis or "manual detection" (looking for common hlfir.declare defining op). Hence, when hlfir.declare results are both raw pointers, use `#0` when producing `fir::ExtendedValue`. When `#0` is a fir.box, keep using `#1` because these are not the same. The only code change is in HLFIRTools.cpp and is pretty small, but there is a big test fallout of `#1` to `#0`.	2025-03-21 11:41:04 +01:00
Sergio Afonso	b231f6f862	[MLIR][OpenMP] Improve omp.map.info verification (#132066 ) This patch makes the `map_type` and `map_capture_type` arguments of the `omp.map.info` operation required, which was already an invariant being verified by its users via `verifyMapClause()`. This makes it clearer, as getters no longer return misleading `std::optional` values. Checks for the `mapper_id` argument are moved to a verifier for the operation, rather than being checked by users. Functionally NFC, but not marked as such due to a reordering of arguments in the assembly format of `omp.map.info`.	2025-03-20 15:48:45 +00:00
Krzysztof Parzyszek	68180d8d16	[flang][OpenMP] Use OmpDirectiveSpecification in standalone directives (#131163 ) This uses OmpDirectiveSpecification in the rest of the standalone directives.	2025-03-20 06:50:43 -05:00
Slava Zakharin	2c91f10362	[flang] Fixed repacking for TARGET and INTENT(OUT) (#131972 ) TARGET dummy arrays can be accessed indirectly, so it is unsafe to repack them. INTENT(OUT) dummy arrays that require finalization on entry to their subroutine must be copied-in by `fir.pack_arrays`. In addition, based on my testing results, I think it will be useful to document that `LOC` and `IS_CONTIGUOUS` will have different values for the repacked arrays. I still need to decide where to document this, so just added a note in the design doc for the time being.	2025-03-19 17:12:32 -07:00
Peter Klausler	6b9716b7f4	[flang] Catch bad usage case of whole assumed-size array (#132052 ) Whole assumed-size arrays are generally not allowed outside specific contexts, where expression analysis notes that they can appear. But contexts can nest, and in the case of an actual argument that turns out to be an array constructor, the permission to use a whole assumed-size array must be rescinded. Fixes https://github.com/llvm/llvm-project/issues/131909.	2025-03-19 12:02:34 -07:00
Peter Klausler	7f7d7d552b	[flang] Use local name for structure constructor (#132047 ) When reinterpreting an ambiguously parsed function reference as a structure constructor, use the original symbol of the type in the representation of the derived type spec of the structure constructor, not its ultimate resolution. The distinction turns out to matter when generating module files containing derived type constants as initializers when the derived types' names have undergone USE association renaming. Fixes https://github.com/llvm/llvm-project/issues/131579.	2025-03-19 12:02:03 -07:00
Peter Klausler	b99dab2587	[flang] Exempt construct entities from SAVE check for PURE (#131383 ) A PURE subprogram can't have a local variable with the SAVE attribute. An ASSOCIATE or SELECT TYPE construct entity whose selector is a variable will return true from IsSave(); exclude them from the local variable check. Fixes https://github.com/llvm/llvm-project/issues/131356.	2025-03-19 12:01:18 -07:00
Peter Klausler	329bfa91b0	[flang] Fix crash in CO_REDUCE semantics (#131211 ) A std::optional<> value was being accessed without first ensuring its presence.	2025-03-19 12:00:23 -07:00
Peter Klausler	3f04fb42aa	[flang] Complete semantic checks for FORM TEAM (#131022 ) Add remaining checking for the FORM TEAM statement, complete and enable a test.	2025-03-19 11:59:59 -07:00
Peter Klausler	1dc397deed	[flang] Enforce control flow restrictions on CHANGE TEAM (#131013 ) Like DO CONCURRENT and CRITICAL constructs, control flow into and out of a CHANGE TEAM construct is disallowed.	2025-03-19 11:59:39 -07:00
Peter Klausler	abebac5b86	[flang] Dig deeper to find more EVENT_TYPE/LOCK_TYPE misuse (#130687 ) Only objects may have these types, or have potential subobject components with these types.	2025-03-19 11:59:18 -07:00
Peter Klausler	587f997db7	[flang] Catch C15104(4) violations when coindexing is present (#130677 ) The value of a structure constructor component can't have a pointer ultimate component if it is a coindexed designator.	2025-03-19 11:58:59 -07:00
Sergio Afonso	ac9e4e9b33	[Flang][OpenMP] Simplify entry block creation for BlockArgOpenMPOpInterface ops, NFC (#132036 ) This patch adds the `OpWithBodyGenInfo::blockArgs` field and updates `createBodyOfOp()` to prevent the need for `BlockArgOpenMPOpInterface` operations to pass the same callback, minimizing chances of introducing inconsistent behavior.	2025-03-19 17:29:40 +00:00
Krzysztof Parzyszek	cd26dd5595	[flang][OpenMP] Use OmpDirectiveSpecification in simple directives (#131162 ) The `OmpDirectiveSpecification` contains directive name, the list of arguments, and the list of clauses. It was introduced to store the directive specification in METADIRECTIVE, and could be reused everywhere a directive representation is needed. In the long term this would unify the handling of common directive properties, as well as creating actual constructs from METADIRECTIVE by linking the contained directive specification with any associated user code.	2025-03-19 11:34:40 -05:00
Valentin Clement (バレンタインクレメン)	20feca47c1	[flang][cuda] Allow ieee_arithmetic on the device (#131930 ) - Allow ieee_arithmetic on the device - Add ignore_tkr(d) to ieee_is_finite	2025-03-19 07:20:06 -07:00
Kiran Chandramohan	96b112fb61	Revert "[flang][openmp] Adds Parser and Semantic Support for Interop Construct, and Init and Use Clauses." (#132005 ) Reverts llvm/llvm-project#120584 Reverting due to CI failure https://lab.llvm.org/buildbot/#/builders/157/builds/22946	2025-03-19 11:13:52 +00:00

1 2 3 4 5 ...

7476 Commits