llvm-project

mirror of https://github.com/llvm/llvm-project.git synced 2025-04-25 07:46:05 +00:00

Author	SHA1	Message	Date
Joseph Huber	de41b137dd	[Offload] Provide a CMake cache file to easily build offloading (#115074 ) Summary: This patch adds a cache file that will automatically enable openpm, offload, and all the fancy GPU libraries.	2024-11-07 15:35:29 -06:00
Baodi Shan	4123050b96	[Offload][Doc] Add 'offload' in OpenMP target doc (#110141 ) Fix #106399	2024-10-02 12:55:28 -04:00
Joel E. Denny	54b10555c3	[OpenMP] LIBOMPTARGET_DEVICE_ARCHITECTURES requires semicolons (#107454 ) If I use commas to delimit architectures in `LIBOMPTARGET_DEVICE_ARCHITECTURES`, cmake for the runtimes complains: ``` Unknown GPU architecture 'sm_70,sm_80,sm_90' ``` Semicolons are required instead.	2024-09-05 12:51:00 -07:00
Joseph Huber	74d23f15b6	[OpenMP] Implement 'omp_alloc' on the device (#102526 ) Summary: The 'omp_alloc' function should be callable from a target region. This patch implemets it by simply calling `malloc` for every non-default trait value allocator. All the special access modifiers are unimplemented and return null. The null allocator returns null as the spec states it should not be usable from the target.	2024-08-14 13:38:55 -05:00
Johannes Doerfert	9a1013220b	[Offload] Allow to record kernel launch stack traces (#100472 ) Similar to (de)allocation traces, we can record kernel launch stack traces and display them in case of an error. However, the AMD GPU plugin signal handler, which is invoked on memroy faults, cannot pinpoint the offending kernel. Insteade print `<NUM>`, set via `OFFLOAD_TRACK_NUM_KERNEL_LAUNCH_TRACES=<NUM>`, many traces. The recoding/record uses a ring buffer of fixed size (for now 8). For `trap` errors, we print the actual kernel name, and trace if recorded.	2024-07-31 11:49:50 -07:00
Johannes Doerfert	c95abe94ae	[Offload] Implement double free (and other allocation error) reporting (#100261 ) As a first step towards a GPU sanitizer we now can track allocations and deallocations in order to report double frees, and other problems during deallocation.	2024-07-30 10:10:57 -07:00
Tobias Hieta	10c6d6349e	Clear release notes for upcoming LLVM 20 dev cycle	2024-07-23 11:04:06 +02:00
Tim Gymnich	597d2f7662	[OpenMP] Add Environment Variable to disable Reuse of Blocks for High Loop Trip Counts (#89239 ) Sometimes it might be beneficial to spawn more thread blocks instead of reusing existing for multiple loop iterations. Alternatives considered: Make `DefaultNumBlocks` settable via an environment variable. --------- Co-authored-by: Joseph Huber <huberjn@outlook.com>	2024-06-14 07:35:23 -07:00
Michael Kruse	8bdc577667	[openmp] Revise IDE folder structure (#89750 ) Update the folder titles for targets in the monorepository that have not seen taken care of for some time. These are the folders that targets are organized in Visual Studio and XCode (`set_property(TARGET <target> PROPERTY FOLDER "<title>")`) when using the respective CMake's IDE generator. * Ensure that every target is in a folder * Use a folder hierarchy with each LLVM subproject as a top-level folder * Use consistent folder names between subprojects * When using target-creating functions from AddLLVM.cmake, automatically deduce the folder. This reduces the number of `set_property`/`set_target_property`, but are still necessary when `add_custom_target`, `add_executable`, `add_library`, etc. are used. A LLVM_SUBPROJECT_TITLE definition is used for that in each subproject's root CMakeLists.txt.	2024-05-25 17:34:28 +02:00
Joseph Huber	c618ae1734	[Offload] Rework handling for loading vendor runtimes (#93073 ) Summary: We previously had multiple options for this, this patch replaces them with `LIBOMPTARGET_DLOPEN_PLUGINS=` to be a list of plugins to dynamically use. It defaults to everything right now. This ignores the `host` plugin because the `libffi` dependency is going to be removed soon hopefully in https://github.com/llvm/llvm-project/pull/91264.	2024-05-22 13:04:52 -05:00
Sirraide	c44fa3e8a9	[Clang] Refactor `__attribute__((assume))` (#84934 ) This is a followup to #81014 and #84582: Before this patch, Clang would accept `__attribute__((assume))` and `[[clang::assume]]` as nonstandard spellings for the `[[omp::assume]]` attribute; this resulted in a potentially very confusing name clash with C++23’s `[[assume]]` attribute (and GCC’s `assume` attribute with the same semantics). This pr replaces every usage of `__attribute__((assume))` with `[[omp::assume]]` and makes `__attribute__((assume))` and `[[clang::assume]]` alternative spellings for C++23’s `[[assume]]`; this shouldn’t cause any problems due to differences in appertainment and because almost no-one was using this variant spelling to begin with (a use in libclc has already been changed to use a different attribute).	2024-05-22 17:58:48 +02:00
Jonathan Peyton	2ff3850ea1	[OpenMP] Add absolute KMP_HW_SUBSET functionality (#85326 ) Users can put a : in front of KMP_HW_SUBSET to indicate that the specified subset is an "absolute" subset. Currently, when a user puts KMP_HW_SUBSET=1t. This gets translated to KMP_HW_SUBSET="s,c,1t", where * means "use all of". If a user wants only one thread as the entire topology they can now do KMP_HW_SUBSET=:1t. Along with the absolute syntax is a fix for newer machines and making them easier to use with only the 3-level topology syntax. When a user puts KMP_HW_SUBSET=1s,4c,2t on a machine which actually has 4 layers, (say 1s,2m,3c,2t as the entire machine) the user gets an unexpected "too many resources asked" message because KMP_HW_SUBSET currently translates the "4c" value to mean 4 cores per module. To help users out, the runtime can assume that these newer layers, module in this case, should be ignored if they are not specified, but the topology should always take into account the sockets, cores, and threads layers.	2024-04-03 11:43:23 -05:00
Fangrui Song	9936ac3083	[docs] Prefer --gcc-install-dir= to deprecated GCC_INSTALL_PREFIX (#85458 ) Setting GCC_INSTALL_PREFIX leads to a warning (#77537). Link: https://discourse.llvm.org/t/add-gcc-install-dir-deprecate-gcc-toolchain-and-remove-gcc-install-prefix/65091 Link: https://discourse.llvm.org/t/correct-cmake-parameters-for-building-clang-and-lld-for-riscv/72833	2024-03-18 13:11:44 -07:00
Tom Stellard	987087df90	Bump trunk version to 19.0.0git	2024-01-23 19:00:11 -08:00
Johannes Doerfert	b8b2a279d0	[OpenMP][NFC] Encapsulate profiling logic (#74003 ) This simply puts the profiling logic into the `Profiler` class and allows non-RAII profiling via `beginSection` and `endSection`.	2023-11-30 15:52:02 -08:00
Michael Halkenhaeuser	19fa27605c	[NFC][docs] Add AMDGPU documentation for `LIBOMPTARGET_STACK_SIZE` Add documentation w.r.t. changes by #72606, which allows to set the dynamic callstack size.	2023-11-28 14:09:42 -05:00
Johannes Doerfert	d3921e4670	[OpenMP] Basic BumpAllocator for (AMD)GPUs (#69806 ) The patch contains a basic BumpAllocator for (AMD)GPUs to allow us to run more tests. The allocator implements `malloc`, both internally and externally, while we continue to default to the NVIDIA `malloc` when we target NVIDIA GPUs. Once we have smarter or customizable allocators we should consider this choice, for now, this allocator is better than none. It traps if it is out of memory, making it easy to debug. Heap size is configured via `LIBOMPTARGET_HEAP_SIZE` and defaults to 512MB. It allows to track allocation statistics via `LIBOMPTARGET_DEVICE_RTL_DEBUG=8` (together with `-fopenmp-target-debug=8`). Two tests were added, and one was enabled. This is the next step towards fixing https://github.com/llvm/llvm-project/issues/66708	2023-10-21 14:49:30 -07:00
Joseph Huber	ccb1d183c3	[OpenMP][Docs] Remove old entry saying static libraries are unsupported Summary: Static libraries have been supported since LLVM 15.0, this entry is misleading and should be removed.	2023-08-30 06:48:57 -05:00
Anton Rydahl	c1b5674fbb	[OpenMP] Change OpenMP default version in documentation and help text for -fopenmp-version As discussed on the weekly OpenMP meeting on the second of August 2023, the default version in the OpenMP documentation shoud be changed from OpenMP 5.0 to 5.1. Differential Revision: https://reviews.llvm.org/D156901	2023-08-28 19:05:55 -07:00
Kazu Hirata	11e2975810	Fx typos in documentation	2023-08-18 23:36:04 -07:00
Terry Wilmarth	f0221fb1d7	[OpenMP] Add option to use different units for blocktime This change adds the option of using different units for blocktimes specified via the KMP_BLOCKTIME environment variable. The parsing of the environment now recognizes units suffixes: ms and us. If a units suffix is not specified, the default unit is ms. Thus default behavior is still the same, and any previous usage still works the same. Internally, blocktime is now converted to microseconds everywhere, so settings that exceed INT_MAX in microseconds are considered "infinite". kmp_set/get_blocktime are updated to use the units the user specified with KMP_BLOCKTIME, and if not specified, ms are used. Added better range checking and inform messages for the two time units. Large values of blocktime for default (ms) case (beyond INT_MAX/1000) are no longer allowed, but will autocorrect with an INFORM message. The delay for determining ticks per usec was lowered. It is now 1 million ticks which was calculated as ~450us based on 2.2GHz clock which is pretty typical base clock frequency on X86: (1e6 Ticks) / (2.2e9 Ticks/sec) * (1e6 usec/sec) = 454 usec Really short benchmarks can be affected by longer delay. Update KMP_BLOCKTIME docs. Portions of this commit were authored by Johnny Peyton. Differential Revision: https://reviews.llvm.org/D157646	2023-08-18 14:01:13 -05:00
Michael Halkenhaeuser	7eba3e58d5	[OpenMP][AMDGPU] Add Envar for controlling HSA busy queue tracking If the Envar is set to true (default), busy HSA queues will be actively avoided when assigning a queue to a Stream. Otherwise, we will initialize a new HSA queue for each requested Stream, then default to round robin once the set maximum has been reached. Reviewed By: jdoerfert, kevinsala Differential Revision: https://reviews.llvm.org/D156996	2023-08-07 10:48:02 -04:00
Joseph Huber	46642cc83d	[Libomptarget] Remove debug RAII from libomptarget This feature was supposed to allow you to trace execution inside of Libomptarget. However, this never really worked properly. The printing was always reoganized, only worked for single threads, and pretty much only told you a handful of things about a runtime library that's an implementation detail to all users. Despite this, it contributed about 40% of the total filesize of the deviceRTL. This patch simply removes this functionalit which I think was past due. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D157001	2023-08-03 09:37:47 -05:00
Michael Halkenhaeuser	5b19f42b63	[OpenMP][AMDGPU] Single eager resource init + HSA queue utilization tracking This patch lazily initializes queues/streams/events since their initialization might come at a cost even if we do not use them. To further benefit from this, AMDGPU/HSA queue management is moved into the AMDGPUStreamManager of an AMDGPUDevice. Streams may now use different HSA queues during their lifetime and identify busy queues. When a Stream is requested from the resource manager, it will search for and try to assign an idle queue. During the search for an idle queue the manager may initialize more queues, up to the set maximum (default: 4). When no idle queue could be found: resort to round robin selection. With contributions from Johannes Doerfert <johannes@jdoerfert.de> Depends on D156245 Reviewed By: kevinsala Differential Revision: https://reviews.llvm.org/D154523	2023-08-02 08:22:26 -04:00
Anton Rydahl	5c0f98cd2a	[OpenMP][Docs] Added offloading command line reference to OpenMP FAQ This command adds an OpenMP offloading specific command line reference. The OpenMP FAQ links to the .rst new file. Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D156387	2023-07-29 17:40:28 -07:00
antonrydahl	daf36b54b4	Revert "[OpenMP][Docs] Added offloading command line reference to OpenMP FAQ" This reverts commit 4166ff6107d76eb3f9acd337f1fcd254de733477. I accidentally pushed an old version of this patch.	2023-07-28 18:28:29 -07:00
Anton Rydahl	b880552dc1	[OpenMP][Docs] Updated the OpenMP documentation about building the OpenMP documentation with Sphinx When I was trying to improve the OpenMP documentation, I found that the information in `OpenMP/docs/README.md` did not contain up-to-date information about how to build the OpenMP documentation with Sphinx. When I ran `make docs-openmp-html`, the command failed because there were a few syntax errors in `openmp/docs/design/Runtimes.rst`. This commit fixes the syntax errors and updates the documentation on building the OpenMP documentation. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D156470	2023-07-28 18:04:21 -07:00
antonrydahl	4166ff6107	[OpenMP][Docs] Added offloading command line reference to OpenMP FAQ I have added a few things to the OpenMP FAQ which I think were missing. Feel free to suggest some changes. Are there missing options in the offloading command line reference? And what do you think about the section "Q: Why is my build taking a long time"? Differential Revision: https://reviews.llvm.org/D156387	2023-07-28 18:04:21 -07:00
Tobias Hieta	4706251a31	Clear release notes for 18.x	2023-07-25 13:58:49 +02:00
Michael Halkenhaeuser	5fa5c39871	[OpenMP] Add OMPT release note OMPT release note addition for LLVM 17 Differential Revision: https://reviews.llvm.org/D156191	2023-07-24 20:38:04 -04:00
Joseph Huber	8db184ae8c	[OpenMP] Add a few release notes Summary: Release notes	2023-07-24 13:26:44 -05:00
Joseph Huber	48da62617e	[OpenMP] Add documentation on using the `libc` in OpenMP This points users to the `libc` documentation and explains the basics of how it's used inside the runtime. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D155318	2023-07-14 13:28:29 -05:00
Joseph Huber	e90ab9148b	[OpenMP] Delete old plugins It's time to remove the old plugins as the next-gen has already been set to default in LLVM 16. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D142820	2023-07-05 17:39:47 -05:00
Johannes Doerfert	6629a96a8c	[OpenMP] Improve default block count selection fow low block counts If a combined loop has insufficient parallelism (= low trip count), we might end up with too few teams/blocks. To counter that we can reduce the number of threads per team we use. This patch implements a heuristic and exposes a new environment variable to control the minimum of threads to be employed in this case. Issue reported by: Felipe Cabarcas Jaramillo <cabarcas@udel.edu> (@fel-cab). Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D152014	2023-06-05 16:35:44 -07:00
Kazu Hirata	a82f2b2db3	Fix typos in documentation	2023-05-28 13:13:12 -07:00
Mark de Wever	cbaa3597aa	Reland "[CMake] Bumps minimum version to 3.20.0. This reverts commit d763c6e5e2d0a6b34097aa7dabca31e9aff9b0b6. Adds the patch by @hans from https://github.com/llvm/llvm-project/issues/62719 This patch fixes the Windows build. d763c6e5e2d0a6b34097aa7dabca31e9aff9b0b6 reverted the reviews D144509 [CMake] Bumps minimum version to 3.20.0. This partly undoes D137724. This change has been discussed on discourse https://discourse.llvm.org/t/rfc-upgrading-llvms-minimum-required-cmake-version/66193 Note this does not remove work-arounds for older CMake versions, that will be done in followup patches. D150532 [OpenMP] Compile assembly files as ASM, not C Since CMake 3.20, CMake explicitly passes "-x c" (or equivalent) when compiling a file which has been set as having the language C. This behaviour change only takes place if "cmake_minimum_required" is set to 3.20 or newer, or if the policy CMP0119 is set to new. Attempting to compile assembly files with "-x c" fails, however this is workarounded in many cases, as OpenMP overrides this with "-x assembler-with-cpp", however this is only added for non-Windows targets. Thus, after increasing cmake_minimum_required to 3.20, this breaks compiling the GNU assembly for Windows targets; the GNU assembly is used for ARM and AArch64 Windows targets when building with Clang. This patch unbreaks that. D150688 [cmake] Set CMP0091 to fix Windows builds after the cmake_minimum_required bump The build uses other mechanism to select the runtime. Fixes #62719 Reviewed By: #libc, Mordante Differential Revision: https://reviews.llvm.org/D151344	2023-05-27 12:51:21 +02:00
Tobias Hieta	f98ee40f4b	[NFC][Py Reformat] Reformat python files in the rest of the dirs This is an ongoing series of commits that are reformatting our Python code. This catches the last of the python files to reformat. Since they where so few I bunched them together. Reformatting is done with `black`. If you end up having problems merging this commit because you have made changes to a python file, the best way to handle that is to run git checkout --ours <yourfile> and then reformat it with black. If you run into any problems, post to discourse about it and we will try to help. RFC Thread below: https://discourse.llvm.org/t/rfc-document-and-standardize-python-code-style Reviewed By: jhenderson, #libc, Mordante, sivachandra Differential Revision: https://reviews.llvm.org/D150784	2023-05-25 11:17:05 +02:00
Nico Weber	d763c6e5e2	Revert "Reland "[CMake] Bumps minimum version to 3.20.0."" This reverts commit 65429b9af6a2c99d340ab2dcddd41dab201f399c. Broke several projects, see https://reviews.llvm.org/D144509#4347562 onwards. Also reverts follow-up commit "[OpenMP] Compile assembly files as ASM, not C" This reverts commit 4072c8aee4c89c4457f4f30d01dc9bb4dfa52559. Also reverts fix attempt "[cmake] Set CMP0091 to fix Windows builds after the cmake_minimum_required bump" This reverts commit 7d47dac5f828efd1d378ba44a97559114f00fb64.	2023-05-17 10:53:33 -04:00
Mark de Wever	65429b9af6	Reland "[CMake] Bumps minimum version to 3.20.0." The owner of the last two failing buildbots updated CMake. This reverts commit e8e8707b4aa6e4cc04c0cffb2de01d2de71165fc.	2023-05-13 11:42:25 +02:00
Mark de Wever	e8e8707b4a	Revert "Reland "[CMake] Bumps minimum version to 3.20.0."" Unfortunatly not all buildbots are updated. This reverts commit ffb807ab5375b3f78df198dc5d4302b3b552242f.	2023-05-06 17:03:56 +02:00
Mark de Wever	ffb807ab53	Reland "[CMake] Bumps minimum version to 3.20.0." All build bots should be updated now. This reverts commit 44d38022ab29a3156349602733b3459df5beef93.	2023-05-06 11:43:02 +02:00
Timm Bäder	eadf6db585	[docs] Hide collaboration and include graphs in doxygen docs They don't convey any useful information and make the documentation unnecessarily hard to read. Differential Revision: https://reviews.llvm.org/D149641	2023-05-04 12:26:51 +02:00
gregrodgers	f238a98e84	[OpenMP][libomptarget][AMDGPU] Enable active HSA wait state Adds HSA timeout hint of 2 seconds to the AMDGPU nextgen-plugin to improve performance of small kernels. The HSA runtime may stay in HSA_WAIT_STATE_ACTIVE for up to the timeout value before switching to HSA_WAIT_STATE_BLOCKED. This can improve latency from which small kernels can benefit. The value was determined via experimentation w/ different benchmarks. The timeout value can be overriden using the environment variable LIBOMPTARGET_AMDGPU_STREAM_BUSYWAIT with a value in microseconds. Original author: Greg Rodgers <Gregory.Rodgers@amd.com> Contributions from: JP Lehr <JanPatrick.Lehr@amd.com> Differential Revision: https://reviews.llvm.org/D148808	2023-05-04 06:01:14 -04:00
Mark de Wever	44d38022ab	Revert "Revert "Revert "[CMake] Bumps minimum version to 3.20.0.""" This reverts commit 1ef4c3c859728008cf707cad8d67f45ae5070ae1. Two buildbots still haven't been updated.	2023-04-15 20:12:24 +02:00
Mark de Wever	1ef4c3c859	Revert "Revert "[CMake] Bumps minimum version to 3.20.0."" This reverts commit 92523a35a827539db8557bbc3ecab7f9ea3f6ade. Reland to see whether CIs are updated.	2023-04-15 13:12:04 +02:00
Joseph Huber	d2f22fb841	[OpenMP][Docs] Replace broken design document link with the git repo Summary: At some point we stopped copying this file to the server, but realistically this is just a static `.pdf` hosted in the LLVM repository so we can link it directly.	2023-04-14 11:11:11 -05:00
Joseph Huber	0979ea9235	[OpenMP][Docs] Add documentation for using configuration files We recently reverted a patch that automatically set the rpath on OpenMP executables. This was used because the `libomptarget.so` library is only expected to work with the same version of compiler that will be using it. This patch adds some documentation for how to get similar behaviour as before using a clang configuration file. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D147943	2023-04-14 09:39:05 -05:00
Mark de Wever	d0398d3593	Revert "Reland "[CMake] Bumps minimum version to 3.20.0."" This reverts commit a72165e5df59032cdd54dcb18155f2630d73abd1. Some buildbots have not been updated yet.	2023-03-18 20:32:43 +01:00
Mark de Wever	a72165e5df	Reland "[CMake] Bumps minimum version to 3.20.0." This reverts commit 92523a35a827539db8557bbc3ecab7f9ea3f6ade. Test whether all CI runners are updated.	2023-03-18 13:33:42 +01:00
Kevin Sala	09a5915e51	[OpenMP][libomptarget][NFC] Add documentation regarding NextGen plugins Differential Revision: https://reviews.llvm.org/D144975	2023-03-14 16:01:02 +01:00

1 2 3

132 Commits