llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-18 23:56:49 +00:00

Author	SHA1	Message	Date
Brad Smith	9b7a7e4b9e	[OpenMP] Add support for Haiku (#133034 ) Co-authored-by: Jérôme Duval <jerome.duval@gmail.com>	2025-03-26 15:16:55 -04:00
Josep Pinot	cd6b7448d5	Revert "Update OpenMP runtime to adopt taskgraph clause from 6.0 Specs" (#131571 )	2025-03-21 10:11:36 +01:00
Sergio Afonso	6085f3f6a8	[OpenMP] Address __kmp_dist_for_static_init issue (#129902 ) This patch attempts to provide a fix for an issue that appears when the `__kmp_dist_for_static_init` function is called from a serialized team. This is triggered by code generated by flang for `distribute parallel do` constructs whenever an `if` clause for the `parallel` leaf construct is present. This results in the introduction of a call to `__kmpc_fork_call_if` in place of `__kmpc_fork_call`. When it evaluates to `false`, it defers execution to `__kmp_serialized_parallel`, which creates a new serial team that is picked up by `__kmp_dist_for_static_init`, resulting in an incorrect `team` pointer that causes the `nteams == (kmp_uint32)team->t.t_parent->t.t_nproc` assertion to fail. The sequence of calls replicating this issue can be summarized as: - `__kmpc_fork_teams` - `__kmpc_fork_call_if` - `__kmpc_dist_for_static_init_` Since I am not familiar with the implementation of the OpenMP runtime, it is possible that the above sequence of calls is incorrect, or that the bug can be better fixed in another way, so I am open to discussing this. The following Fortran program can be compiled with flang to show the issue: ```f90 ! Compile and run: flang -fopenmp test.f90 -o test && ./test ! Check LLVM IR: flang -fc1 -emit-llvm -fopenmp test.f90 -o - program main implicit none integer, parameter :: n = 10 integer :: i, idx(n) !$omp teams !$omp distribute parallel do if(.false.) do i=1,n idx(i) = i end do !$omp end teams print , idx end program ```	2025-03-17 11:44:29 +00:00
Josep Pinot	77ad061923	[OpenMP] Update OpenMP runtime to adopt taskgraph clause from 6.0 Specs (#130751 ) Updating OpenMP runtime taskgraph support(record/replay mechanism): - Adds a `graph_reset` bit in `kmp_taskgraph_flags_t` to discard existing TDG records. - Switches from a strict index-based TDG ID/IDX to a more flexible integer-based, which can be any integer (e.g. hashed). - Adds helper functions like `__kmp_find_tdg`, `__kmp_alloc_tdg`, and `__kmp_free_tdg` to manage TDGs by their IDs. These changes pave the way for the integration of OpenMP taskgraph (spec 6.0). Taskgraphs are still recorded in an array with a lookup efficiency reduced to O(n), where n ≤ `__kmp_max_tdgs`. This can be optimized by moving the TDGs to a hashtable, making lookups more efficient. The provided helper routines facilitate easier future optimizations.	2025-03-14 08:02:23 +01:00
Rémy Neveu	3aa96f52cf	[OpenMP] [Taskgraph] Differentiating task ids from the taskgraph and from the debugger (#130660 ) This PR creates a new member for task data, which is used to identify the task in its taskgraph (when ompx taskgraph is enabled). It aims to remove the overloading of the td_task_id member, which was used both by the debugger and the taskgraph. This resulted in the identifier's non-unicity in the case of multiple taskgraphs. Co-authored-by: Rémy Neveu <rem2007@free.fr>	2025-03-12 11:39:02 -07:00
Omair Javaid	3a41c7b483	[OpenMP] Mark Failing OpenMP Tests as XFAIL on Windows (#129040 ) This patch marks specific OpenMP runtime tests as XFAIL on Windows due to failures reported in #129023	2025-03-10 19:23:10 +05:00
foxtran	3f48d34dff	[OpenMP][runtime] Fix comparison of integer expressions of different signedness (#128204 ) This PR fixes warning which occurs if one compiles OpenMP runtime with GCC: ``` warning: comparison of integer expressions of different signedness: 'kmp_intptr_t' {aka 'long int'} and 'long unsigned int' [-Wsign-compare] ```	2025-03-03 08:54:57 +01:00
Joachim	12a9e2adc3	[OpenMP][OMPT][OMPD] Fix frame flags for OpenMP tool APIs (#114118 ) In several cases the flags entries in ompt_frame_t are not initialized. According to @jdelsign the address provided as reenter and exit address is the canonical frame address (cfa) rather than a "framepointer". This patch makes sure that the flags entry is always initialized and changes the value from ompt_frame_framepointer to ompt_frame_cfa. The assertion in the tests makes sure that the flags are always set, when a tool (callback.h in this case) looks at the value. Fixes #89058	2025-02-27 18:47:57 +01:00
Jonathan Peyton	1c4e9863fa	[OpenMP][NFC] Remove unused debug lock (#127928 )	2025-02-20 08:47:59 -06:00
Jonathan Peyton	851177c2e3	[OpenMP][NFC] Remove unused __kmp_dispatch_lock global (#127686 )	2025-02-19 13:32:00 -06:00
Jonathan Peyton	b1f882f86a	[OpenMP][NFC] Remove unused clock function types and globals (#127684 )	2025-02-19 13:31:40 -06:00
Brad Smith	0b8bd472b0	[OpenMP][libomp] Add OpenBSD, NetBSD and DragonFly stdarg handling (#126182 ) Fixes build on OpenBSD/aarch64. ``` FAILED: openmp/runtime/src/CMakeFiles/omp.dir/kmp_runtime.cpp.o /home/brad/tmp/llvm-build/bin/clang++ --target=aarch64-unknown-openbsd7.6 -D_DEBUG -D_GLIBCXX_ASSERTIONS -D__STDC_CONSTANT_MACROS -D__STDC_FORMAT_MACROS -D__STDC_LIMIT_MACROS -Domp_EXPORTS -I/home/brad/tmp/llvm-build/runtimes/runtimes-bins/openmp/runtime/src -I/home/brad/tmp/llvm-brad/openmp/runtime/src -I/home/brad/tmp/llvm-brad/openmp/runtime/src/i18n -I/home/brad/tmp/llvm-brad/openmp/runtime/src/include -I/home/brad/tmp/llvm-brad/openmp/runtime/src/thirdparty/ittnotify -fPIC -fno-semantic-interposition -fvisibility-inlines-hidden -Werror=date-time -Werror=unguarded-availability-new -Wall -Wextra -Wno-unused-parameter -Wwrite-strings -Wcast-qual -Wmissing-field-initializers -Wimplicit-fallthrough -Wcovered-switch-default -Wno-noexcept-type -Wnon-virtual-dtor -Wdelete-non-virtual-dtor -Wsuggest-override -Wstring-conversion -Wmisleading-indentation -Wctad-maybe-unsupported -fdiagnostics-color -ffunction-sections -fdata-sections -Wall -fcolor-diagnostics -Wcast-qual -Wformat-pedantic -Wimplicit-fallthrough -Wsign-compare -Wno-extra -Wno-pedantic -fno-semantic-interposition -fdata-sections -O3 -DNDEBUG -std=c++17 -fPIC -D _GNU_SOURCE -D _REENTRANT -U_GLIBCXX_ASSERTIONS -UNDEBUG -fno-exceptions -fno-rtti -Wno-covered-switch-default -Wno-frame-address -Wno-strict-aliasing -Wno-switch -Wno-uninitialized -Wno-return-type-c-linkage -Wno-cast-qual -Wno-int-to-void-pointer-cast -MD -MT openmp/runtime/src/CMakeFiles/omp.dir/kmp_runtime.cpp.o -MF openmp/runtime/src/CMakeFiles/omp.dir/kmp_runtime.cpp.o.d -o openmp/runtime/src/CMakeFiles/omp.dir/kmp_runtime.cpp.o -c /home/brad/tmp/llvm-brad/openmp/runtime/src/kmp_runtime.cpp /home/brad/tmp/llvm-brad/openmp/runtime/src/kmp_runtime.cpp:1449:47: error: value of type 'kmp_va_list' (aka '__builtin_va_list') is not contextually convertible to 'bool' 1449 \| return (master_th->th.th_teams_microtask && ap && \| ^~ /home/brad/tmp/llvm-brad/openmp/runtime/src/kmp_runtime.cpp:1449:44: error: invalid operands to binary expression ('microtask_t' (aka 'void ()(int , int , ...)') and 'kmp_va_list' (aka '__builtin_va_list')) 1449 \| return (master_th->th.th_teams_microtask && ap && \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ^ ~~ /home/brad/tmp/llvm-brad/openmp/runtime/src/kmp_runtime.cpp:1457:15: warning: comparison between NULL and non-pointer ('kmp_va_list' (aka '__builtin_va_list') and NULL) [-Wnull-arithmetic] 1457 \| return ((ap == NULL && active_level == 0) \|\| \| ~~ ^ ~~~~ /home/brad/tmp/llvm-brad/openmp/runtime/src/kmp_runtime.cpp:1457:15: error: invalid operands to binary expression ('kmp_va_list' (aka '__builtin_va_list') and 'long') 1457 \| return ((ap == NULL && active_level == 0) \|\| \| ~~ ^ ~~~~ /home/brad/tmp/llvm-brad/openmp/runtime/src/kmp_runtime.cpp:1458:12: error: value of type 'kmp_va_list' (aka '__builtin_va_list') is not contextually convertible to 'bool' 1458 \| (ap && teams_level > 0 && teams_level == level)); \| ^~ /home/brad/tmp/llvm-brad/openmp/runtime/src/kmp_runtime.cpp:1458:15: error: invalid operands to binary expression ('kmp_va_list' (aka '__builtin_va_list') and 'bool') 1458 \| (ap && teams_level > 0 && teams_level == level)); \| ~~ ^ ~~~~~~~~~~~~~~~ /home/brad/tmp/llvm-brad/openmp/runtime/src/kmp_runtime.cpp:1735:9: error: invalid argument type 'kmp_va_list' (aka '__builtin_va_list') to unary expression 1735 \| if (!ap) { \| ^~~ /home/brad/tmp/llvm-brad/openmp/runtime/src/kmp_runtime.cpp:2169:66: warning: comparison between NULL and non-pointer ('kmp_va_list' (aka '__builtin_va_list') and NULL) [-Wnull-arithmetic] 2169 \| !(microtask == (microtask_t)__kmp_teams_master \|\| ap == NULL)) \| ~~ ^ ~~~~ /home/brad/tmp/llvm-brad/openmp/runtime/src/kmp_runtime.cpp:2169:66: error: invalid operands to binary expression ('kmp_va_list' (aka '__builtin_va_list') and 'long') 2169 \| !(microtask == (microtask_t)__kmp_teams_master \|\| ap == NULL)) \| ~~ ^ ~~~~ /home/brad/tmp/llvm-brad/openmp/runtime/src/kmp_runtime.cpp:2284:9: error: value of type 'kmp_va_list' (aka '__builtin_va_list') is not contextually convertible to 'bool' 2284 \| if (ap) { \| ^~ /home/brad/tmp/llvm-brad/openmp/runtime/src/kmp_runtime.cpp:2302:58: error: invalid argument type 'kmp_va_list' (aka '__builtin_va_list') to unary expression 2302 \| __kmp_fork_team_threads(root, team, master_th, gtid, !ap); \| ^~~ /home/brad/tmp/llvm-brad/openmp/runtime/src/kmp_runtime.cpp:2363:9: error: value of type 'kmp_va_list' (aka '__builtin_va_list') is not contextually convertible to 'bool' 2363 \| if (ap) { \| ^~ /home/brad/tmp/llvm-brad/openmp/runtime/src/kmp_runtime.cpp:7803:3: error: no matching function for call to '__kmp_fork_call' 7803 \| __kmp_fork_call(loc, gtid, fork_context_intel, team->t.t_argc, \| ^~~~~~~~~~~~~~~ /home/brad/tmp/llvm-brad/openmp/runtime/src/kmp_runtime.cpp:1927:5: note: candidate function not viable: no known conversion from 'long' to 'kmp_va_list' (aka '__builtin_va_list') for 7th argument 1927 \| int __kmp_fork_call(ident_t loc, int gtid, \| ^ 1928 \| enum fork_context_e call_context, // Intel, GNU, ... 1929 \| kmp_int32 argc, microtask_t microtask, launch_t invoker, 1930 \| kmp_va_list ap) { \| ~~~~~~~~~~~~~~ 2 warnings and 11 errors generated. ```	2025-02-17 17:46:02 -05:00
Julian Brown	2fdf191e24	[OpenMP] Fix crash with task stealing and task dependencies (#126049 ) This patch series demonstrates and fixes a bug that causes crashes with OpenMP 'taskwait' directives in heavily multi-threaded scenarios. TLDR: The early return from __kmpc_omp_taskwait_deps_51 missed the synchronization mechanism in place for the late return. Additional debug assertions check for the implied invariants of the code. @jpeyton52 found the timing hole as this sequence of events: > > 1. THREAD 1: A regular task with dependences is created, call it T1 > 2. THREAD 1: Call into `__kmpc_omp_taskwait_deps_51()` and create a stack based depnode (`NULL` task), call it T2 (stack) > 3. THREAD 2: Steals task T1 and executes it getting to `__kmp_release_deps()` region. > 4. THREAD 1: During processing of dependences for T2 (stack) (within `__kmp_check_deps()` region), a link is created T1 -> T2. This increases T2's (stack) `nrefs` count. > 5. THREAD 2: Iterates through the successors list: decrement the T2's (stack) npredecessor count. BUT HASN'T YET `__kmp_node_deref()`-ed it. > 6. THREAD 1: Now when finished with `__kmp_check_deps()`, it returns false because npredecessor count is 0, but T2's (stack) `nrefs` count is 2 because THREAD 2 still references it! > 7. THREAD 1: Because `__kmp_check_deps()` returns false, early exit. > _Now the stack based depnode is invalid, but THREAD 2 still references it._ > > We've reached improper stack referencing behavior. Varied results/crashes/ asserts can occur if THREAD 1 comes back and recreates the exact same depnode in the exact same stack address during the same time THREAD 2 calls `__kmp_node_deref()`.	2025-02-14 10:55:59 +01:00
Matt	a1826b4d26	[OpenMP][SIMD][FIX] Use conservative "omp simd ordered" lowering (#126172 ) A proposed fix for the issue #95611, [OpenMP][SIMD] ordered has no effect in a loop SIMD region as of LLVM 18.1.0 Changes: - Implement new lowering behavior: Conservatively serialize "omp simd" loops that have `omp simd ordered` directive to prevent incorrect vectorization (which results in incorrect execution behavior of the miscompiled program). Implementation outline: - We start with the optimistic default initial value of `LoopStack.setParallel(/Enable=/true);` in `CodeGenFunction::EmitOMPSimdInit(const OMPLoopDirective &D)`. - We only disable the loop parallel memory access assumption with `if (HasOrderedDirective) LoopStack.setParallel(/Enable=/false);` using the `HasOrderedDirective` (which tests for the presence of an `OMPOrderedDirective`). - This results in no longer incorrectly vectorizing the loop when the `omp simd ordered` directive is present. Motivation: We'd like to prevent incorrect vectorization of the loops marked with the `#pragma omp ordered simd` directive which has previously resulted in miscompiled code. At the same time, we'd like the usage outside of the `#pragma omp ordered simd` context to remain unaffected: Note that in the test "clang/test/OpenMP/ordered_codegen.cpp" we only "lose" the `!llvm.access.group` metadata in `foo_simd` alone. This is conservative, in that it's possible some of the loops would be possible to vectorize, but we prefer to avoid miscompilation of the loops that are currently illegal to vectorize. A concrete example follows: ```cpp // "test.c" #include <float.h> #include <math.h> #include <omp.h> #include <stdio.h> #include <stdlib.h> #include <time.h> int compare_float(float x1, float x2, float scalar) { const float diff = fabsf(x1 - x2); x1 = fabsf(x1); x2 = fabsf(x2); const float l = (x2 > x1) ? x2 : x1; if (diff <= l * scalar * FLT_EPSILON) return 1; else return 0; } #define ARRAY_SIZE 256 __attribute__((noinline)) void initialization_loop( float X[ARRAY_SIZE][ARRAY_SIZE], float Y[ARRAY_SIZE][ARRAY_SIZE]) { const float max = 1000.0; srand(time(NULL)); for (int r = 0; r < ARRAY_SIZE; r++) { for (int c = 0; c < ARRAY_SIZE; c++) { X[r][c] = ((float)rand() / (float)(RAND_MAX)) * max; Y[r][c] = X[r][c]; } } } __attribute__((noinline)) void omp_simd_loop(float X[ARRAY_SIZE][ARRAY_SIZE]) { for (int r = 1; r < ARRAY_SIZE; ++r) { for (int c = 1; c < ARRAY_SIZE; ++c) { #pragma omp simd for (int k = 2; k < ARRAY_SIZE; ++k) { #pragma omp ordered simd X[r][k] = X[r][k - 2] + sinf((float)(r / c)); } } } } __attribute__((noinline)) int comparison_loop(float X[ARRAY_SIZE][ARRAY_SIZE], float Y[ARRAY_SIZE][ARRAY_SIZE]) { int totalErrors_simd = 0; const float scalar = 1.0; for (int r = 1; r < ARRAY_SIZE; ++r) { for (int c = 1; c < ARRAY_SIZE; ++c) { for (int k = 2; k < ARRAY_SIZE; ++k) { Y[r][k] = Y[r][k - 2] + sinf((float)(r / c)); } } // check row for simd update for (int k = 0; k < ARRAY_SIZE; ++k) { if (!compare_float(X[r][k], Y[r][k], scalar)) { ++totalErrors_simd; } } } return totalErrors_simd; } int main(void) { float X[ARRAY_SIZE][ARRAY_SIZE]; float Y[ARRAY_SIZE][ARRAY_SIZE]; initialization_loop(X, Y); omp_simd_loop(X); const int totalErrors_simd = comparison_loop(X, Y); if (totalErrors_simd) { fprintf(stdout, "totalErrors_simd: %d \n", totalErrors_simd); fprintf(stdout, "%s : %d - FAIL: error in ordered simd computation.\n", __FILE__, __LINE__); } else { fprintf(stdout, "Success!\n"); } return totalErrors_simd; } ``` Before: ``` $ clang -fopenmp-simd -O3 -ffast-math -lm test.c -o test && ./test totalErrors_simd: 15408 test.c : 76 - FAIL: error in ordered simd computation. ``` clang 19.1.0: https://godbolt.org/z/6EvhxqEhe After: ``` $ clang -fopenmp-simd -O3 -ffast-math test.c -o test && ./test Success! ``` Co-authored-by: Matt P. Dziubinski <matt-p.dziubinski@hpe.com>	2025-02-12 08:53:47 -05:00
Brad Smith	52a02b6d1e	[openmp] Fix for 32-bit PowerPC (#126412 )	2025-02-10 04:04:26 -05:00
Joseph Huber	daefb1b012	[OpenMP] Make `omp.h` work when compiled with `-ffreestanding` (#125618 ) Summary: Freestanding builds have `stddef.h` and `stdint.h` but not `stdlib.h`. We don't actually use any `stdlib.h` definitions in the OpenMP headers, and some definitions from this header are usable without the OpenMP runtime (allocators) so we should be able to do this. This ignores the include if possible, removing the implicit include would possibly break some applications so it stays here.	2025-02-04 06:48:23 -06:00
Christian Clauss	89c5576ff9	OpenMP: Fix Python 3 SyntaxErrors (#123940 ) 1. `print()` is a function in Python 3. 2. New-style exceptions are required in Python 3.	2025-01-27 14:45:54 -08:00
Nikita Popov	90a05f3216	[openmp] Support CET in z_Linux_asm.S (#123213 ) When libomp is built with -cf-protection, add endbr instructions to the start of functions for Intel CET support.	2025-01-17 09:26:49 +01:00
Kelvin Li	01ee66ea62	[flang][OMP] change malloc.h to stdlib.h in collapse_test.inc (NFC) (#122711 )	2025-01-13 14:44:29 -05:00
Brad Smith	ec27eb8c6b	[OpenMP] Fix interoperability test compilation on OpenBSD (#119053 )	2024-12-09 11:13:37 -05:00
Christian Oliveros	05bcf83c5c	[OpenMP][Build][Wasm][116552] Fixed build problem when compiling with Emscripten on Windows (#116874 )	2024-11-20 07:40:21 -05:00
Martin Storsjö	dc3156d8e6	[OpenMP] Don't hardcode _WIN32_WINNT for MinGW targets (#115708 ) Instead respect what the toolchain default is (or what the user sets via CMAKE_CXX_FLAGS). This fixes builds with libcxx, with mingw toolchains targeting msvcrt.dll, after 5d8be4c036aa5ce4a94f1f37a9155d5c877e23db; after that commit, the libcxx public headers reference symbols such as iswspace_l, which are unavailable when targeting msvcrt.dll on older versions of Windows (it's only available in msvcrt.dll since Windows Vista).	2024-11-16 11:23:15 +02:00
Daniel Chen	d3d8103d53	[OpenMP] Using `SimpleVLA` to handle vla usage in ompt-general.cpp. (#114583 ) The `openmp` runtime failed to build on LoP with LLVM18 on LoP due to the addition of `-Wvla-cxx-extension` as ``` llvm-project/openmp/runtime/src/ompt-general.cpp:711:15: error: variable length arrays in C++ are a Clang extension [-Werror,-Wvla-cxx-extension] 711 \| int tmp_ids[ids_size]; \| ^~~~~~~~ llvm-project/openmp/runtime/src/ompt-general.cpp:711:15: note: function parameter 'ids_size' with unknown value cannot be used in a constant expression llvm-project/openmp/runtime/src/ompt-general.cpp:704:65: note: declared here 704 \| OMPT_API_ROUTINE int ompt_get_place_proc_ids(int place_num, int ids_size, \| ^ 1 error generated. ``` This patch is to ignore the checking against this usage.	2024-11-04 12:42:16 -05:00
c8ef	b57b3f6425	[NFC] Simple typo correction. (#114548 )	2024-11-02 00:40:57 +08:00
Ye Luo	eccdb24894	[OpenMP] Create versioned libgomp softlinks (#112973 ) Add libgomp.1.dylib for MacOS and libgomp.so.1 for Linux Linkers on Mac and Linux pick up versioned libgomp dynamic library files. The existing softlinks (libgomp.dylib for MacOS and libgomp.so for Linux) are insufficient. This helps alleviate the issue of mixing libgomp and libomp at runtime.	2024-10-25 13:19:58 -05:00
Shilei Tian	5d07162bba	[OpenMP] Fix the test issue when `libomp` is built as a static library (#113522 )	2024-10-24 12:52:17 -04:00
Luke Drummond	b55c52c047	Revert "Renormalize line endings whitespace only after dccebddb3b80" This reverts commit 9d98acb196a40fee5229afeb08f95fd36d41c10a.	2024-10-18 21:16:50 +01:00
Josep Pinot	af1e9c81f4	[OpenMP] Fix missing gtid argument in __kmp_print_tdg_dot function (#111986 ) This patch modifies the signature of the `__kmp_print_tdg_dot` function in `kmp_tasking.cpp` to include the global thread ID (gtid) as an argument. The gtid is now correctly passed to the function. - Updated the function declaration to accept the gtid parameter. - Modified all calls to `__kmp_print_tdg_dot` to pass the correct gtid value. This change addresses issues encountered when compiling with `OMPX_TASKGRAPH` enabled. No functional changes are expected beyond successful compilation.	2024-10-17 10:01:28 -04:00
Luke Drummond	9d98acb196	Renormalize line endings whitespace only after dccebddb3b80 Line ending policies were changed in the parent, dccebddb3b80. To make it easier to resolve downstream merge conflicts after line-ending policies are adjusted this is a separate whitespace-only commit. If you have merge conflicts as a result, you can simply `git add --renormalize -u && git merge --continue` or `git add --renormalize -u && git rebase --continue` - depending on your workflow.	2024-10-17 14:49:26 +01:00
Nikita Popov	4722c6b87c	[openmp] Use core_siblings_list if physical_package_id not available (#111831 ) On powerpc, physical_package_id may not be available. Currently, this causes openmp to fall back to flat topology and various affinity tests fail. Fix this by parsing core_siblings_list to deterimine which cpus belong to the same socket. This matches what the testing code does. The code to parse the CPU list format thankfully already exists. Fixes https://github.com/llvm/llvm-project/issues/111809.	2024-10-14 09:23:41 +02:00
Xing Xue	c62e61acb4	[libomp][AIX] Use SO version "1" for AIX libomp (#111384 ) For `libomp` on AIX, we build shared object `libomp.so` first and then archive it into libomp.a. This patch changes to use SO version `1` and name the shared object `libomp.so.1` so that it is consistent with the naming of other shared objects in AIX libraries, e.g., `libc++.so.1` in `libc++.a`. With this change, the change made in commit bde51d9b0d473447ea12fb14924f14ea167eec85 to ensure only `libomp.a` is published on AIX is no longer necessary and is removed.	2024-10-08 06:04:13 -04:00
Xing Xue	bde51d9b0d	[libomp][AIX] Ensure only libomp.a is published on AIX (#109016 ) For `libomp` on AIX, we build shared object `libomp.so` first and then archive it into `libomp.a`. Due to a CMake for AIX problem, the install step also tries to publish `libomp.so`. While we use a script to build `libomp.a` out-of-tree for Clang and avoided the problem, this chokes the in-tree build for Flang. The issue will be reported to CMake but before a fixed CMake is available, this patch ensures only `libomp.a` is published.	2024-09-18 16:12:39 -04:00
Brad Smith	37e109c6f8	[OpenMP] Support setting POSIX thread name on *BSD's and Solaris (#106489 )	2024-08-31 16:53:33 -04:00
Hansang Bae	9e0ee0e104	[OpenMP] Add support for pause with omp_pause_stop_tool (#97100 ) This patch adds support for pause resource with a new enumerator omp_pause_stop_tool. The expected behavior of this enumerator is * omp_pause_resource: not allowed * omp_pause_resource_all: equivalent to omp_pause_hard	2024-08-15 11:44:50 -05:00
Hansang Bae	5989709047	[OpenMP] Miscellaneous small code improvements (#95603 ) Removes a few uninitialized variables, possible resource leaks, and redundant code.	2024-08-15 10:42:22 -05:00
HighW4y2H3ll	0160d817c2	[OpenMP] Rename worker threads for improved debuggability (#102065 ) Rename the worker threads "openmp_worker" --------- Co-authored-by: h2h <h2h@meta.com> Co-authored-by: Matthias Braun <matze@braunis.de>	2024-08-13 22:20:11 -07:00
Tulio Magno Quites Machado Filho	0aa22dcd2f	[OpenMP][AArch64] Fix branch protection in microtasks (#102317 ) Start __kmp_invoke_microtask with PACBTI in order to identify the function as a valid branch target. Before returning, SP is authenticated. Also add the BTI and PAC markers to z_Linux_asm.S. With this patch, libomp.so can now be generated with DT_AARCH64_BTI_PLT when built with -mbranch-protection=standard. The implementation is based on the code available in compiler-rt.	2024-08-13 15:34:41 -03:00
Alexandre Ganea	20baa9a9ec	[openmp][runtime] Silence warnings This fixes several of those when building with MSVC on Windows: ``` [3625/7617] Building CXX object projects\openmp\runtime\src\CMakeFiles\omp.dir\kmp_affinity.cpp.obj C:\src\git\llvm-project\openmp\runtime\src\kmp_affinity.cpp(2637): warning C4062: enumerator 'KMP_HW_UNKNOWN' in switch of enum 'kmp_hw_t' is not handled C:\src\git\llvm-project\openmp\runtime\src\kmp.h(628): note: see declaration of 'kmp_hw_t' ```	2024-08-11 19:01:12 -04:00
arsnyder16	f7b2c2e49f	[openmp][WebAssembly] Allow openmp to compile and run under emscripten toolchain (#95169 ) * Separate wasi and emscripten as they have different constraints and abilities * Emscripten mimics Linux/POSIX by statically linking the musl runtime. This allow nearly all KMP_OS_LINUX code paths to work correctly. There are only a few places that need to be adjusted related to dynamic linking (dl_open) * Internally link openmp globals * With CommonLinkage it is needed to emit them in an assembly file, now they are defined and used within each compilation unit * With ExternalLinkage they suffer from duplicate symbols during linking for unnamed globals like reduction/critical * Interestingly this aligns with the TODO comment above this code	2024-08-07 13:00:37 -05:00
Jonathan Peyton	916a91578f	[OpenMP] Assign thread ids in the cpuinfo topology method (#91013 ) On non-hyperthreaded machines, the thread id is not always explicit in the /proc/cpuinfo file. This patch adds a check to ensure the thread ids are put in.	2024-07-29 09:52:02 -05:00
Jonathan Peyton	77ff969e5d	[OpenMP] Add topology and affinity changes for Meteor Lake (#91012 ) These are Intel-specific changes for the CPUID leaf 31 method for detecting machine topology. * Cleanup known levels usage in x2apicid topology algorithm Change to be a constant mask of all Intel topology type values. * Take unknown ids into account when sorting them If a hardware id is unknown, then put further down the hardware thread list so it will take last priority when assigning to threads. * Have sub ids printed out for hardware thread dump * Add caches to topology New` kmp_cache_ids_t` class helps create cache ids which are then put into the topology table after regular topology type ids have been put in. * Allow empty masks in place list creation Have enumeration information and place list generation take into account that certain hardware threads may be lacking certain layers * Allow different procs to have different number of topology levels Accommodates possible situation where CPUID.1F has different depth for different hardware threads. Each hardware thread has a topology description which is just a small set of its topology levels. These descriptions are tracked to see if the topology is uniform or not. * Change regular ids with logical ids Instead of keeping the original sub ids that the x2apicid topology detection algorithm gives, change each id to its logical id which is a number: [0, num_items - 1]. This makes inserting new layers into the topology significantly simpler. * Insert caches into topology This change takes into account that most topologies are uniform and therefore can use the quicker method of inserting caches as equivalent layers into the topology.	2024-07-29 09:51:42 -05:00
Jonathan Peyton	2e57e63666	[OpenMP][libomp] Fix tasking debug assert (#95823 ) The debug assert is meant to check that the index is a valid which means the runtime needs to check against the size of the array instead of the number of threads. A free()-ed thread put back in the thread pool may index into anywhere inside the task team's available array from 0 to tt_max_threads potentially. Fixes: #94260	2024-07-24 12:25:21 -05:00
Michael Kruse	5c93a94f5a	[Clang][OpenMP] Add interchange directive (#93022 ) Add the interchange directive which will be introduced in the upcoming OpenMP 6.0 specification. A preview has been published in [Technical Report 12](https://www.openmp.org/wp-content/uploads/openmp-TR12.pdf).	2024-07-19 09:24:40 +02:00
Michael Kruse	80865c01e1	[Clang][OpenMP] Add reverse directive (#92916 ) Add the reverse directive which will be introduced in the upcoming OpenMP 6.0 specification. A preview has been published in [Technical Report 12](https://www.openmp.org/wp-content/uploads/openmp-TR12.pdf). --------- Co-authored-by: Alexey Bataev <a.bataev@outlook.com>	2024-07-18 10:35:32 +02:00
Hansang Bae	7a72856af8	[OpenMP] Use new OMPT state and sync kinds for barrier events (#95602 ) This change makes the runtime use new OMPT state and sync kinds introduced in OpenMP 5.1 in place of the deprecated implicit state and sync kinds. Events from implicit barriers use different enumerators for workshare, parallel, and teams.	2024-07-16 09:52:20 -05:00
Alexandre Ganea	be26e54542	[openmp] Silence warning when building the x64 Windows LLVM release package This fixes: ``` MASM : warning A4018:invalid command-line option : -U_GLIBCXX_ASSERTIONS ```	2024-07-05 21:16:04 -04:00
Hansang Bae	d4f3d24e7f	[OpenMP] Add ompt_start_tool declaration in omp-tools.h (#97099 ) The function ompt_start_tool is a globally-visible C function according to the specification.	2024-07-03 12:59:34 -05:00
Joachim	a707d0883b	[OpenMP][OMPT] Indicate loop schedule for worksharing-loop events (#97429 ) Use more specific values from `ompt_work_t` to allow the tool identify the schedule of a worksharing-loop. With this patch, the runtime will report the schedule chosen by the runtime rather than necessarily the schedule literally requested by the clause. E.g., for guided + just one iteration per thread, the runtime would choose and report static. Fixes issue #63904	2024-07-03 09:33:19 +02:00
Gheorghe-Teodor Bercea	f0567702aa	[OpenMP] Add missing export for dynamic tracking patch (#97419 ) Add missing export for OpenMP non-offloading builds.	2024-07-02 10:09:27 -04:00
dhruvachak	946f5d111d	[OpenMP] [OMPT] Callback registration should not depend on the device init callback. (#96371 ) Even if the device init callback is not registered, a tool should be allowed to register other callbacks.	2024-07-01 10:07:05 -07:00

1 2 3 4 5 ...

1597 Commits